Thanks for the responses and to answer both questions/comments: 1. I tried the Word filter and it fails with "Invalid Word Format" 2. My other comment was regarding that it doesn't seem to index TEXT files in the ORIGINAL bundle but only the TEXT bundle which is where the problem originally stemmed from. It would seem that having just a filter that puts a copy of the ORIGINAL file in the TEXT bundle would suffice to get the indexer to work properly.
Any other insight is greatly appreciated. Thanks. Tom Autry Coffing Corporation 3136 Presidential Drive Fairborn, Ohio 45324 Office: 937-458-6100 Cell: 937-361-4680 Email: tom.au...@coffingco.com -----Original Message----- From: helix84 [mailto:heli...@centrum.sk] Sent: Tuesday, September 18, 2012 9:51 AM To: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] Filter-media on TEXT files On Tue, Sep 18, 2012 at 3:46 PM, Mark H. Wood <mw...@iupui.edu> wrote: > I don't understand: why would there be any need to extract plain text > from a bitstream that's already plain text? Just index it. The point > of text extraction is to create a plain-text bitstream for the indexer > to digest. Mark, does the indexer index text from plain text files in the ORIGINAL bundle? Regards, ~~helix84 ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech This e-mail message and any attachments may contain legally privileged, confidential or proprietary information. If you are not the intended recipient(s),or the employee or agent responsible for delivery of this message to the intended recipient(s), you are hereby notified that any dissemination, distribution or copying of this e-mail message is strictly prohibited. If you have received this message in error, please immediately notify the sender and delete this e-mail message from your computer. Any views expressed in this message are those of the individual sender and may not necessarily reflect the views of the company. ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech