#145: Index text inside text document files (pdf, odt, doc, etc)

Type: FeatureItem Feature: ContentManagement Tags: colivre
ScheduledFor: N/A Assigned to:   Sites:  
Priority: 0 Status: Pending  

When an user uploads a text document (as UploadedFile article type), we expect to, if possible, index the contents of the file so searches can use that information.

File types to be supported:

Format Possible path
Plain text trivial
PDF pdf2ps → ps2txt (ghostscript)
PS ps2txt (ghostscript)
Microsoft Word (.doc) antiword
OpenOffice.org odt2txt
RTF unrtf
Abiword ?
kword ?
others ...  
Add comment
You need to login to be able to comment.
 

ActionItemForm edit

Title Index text inside text document files (pdf, odt, doc, etc)
ActionItemType FeatureItem
Priority Low
Tags colivre
Feature ContentManagement
ResponsibleDevelopers
ScheduledFor N/A
AffectsVersion
Status Pending
Ticket SAC:
Topic revision: r4 - 07 Apr 2010, JoenioCosta

irc Talk with Devs Now!

%IF{"$'WIKINAME'!='WikiGuest'" then="

♥ I Care

%DBQUERY{"(topic='^ActionItem[0-9]+') AND whocares=~'WikiGuest' AND Status!='Done' AND Status!='Rejected'" format=" $formfield(Title)" separator="$n---$n" }%
Filter ♥ AIs
"}%

 
Translations: English
Search on Docs:
   
ActionItem Search:

Copyright © 2007-2018 by the Noosfero contributors
Colivre - Cooperativa de Tecnologias Livres