Hi, Has anyone written a document parser for Zend_Search_Lucene to extract text from, and index, Microsoft Word documents or Adobe PDF files? I see there is a specialised one for HTML documents, but nothing in SVN for other formats.
If no one has written such code, are there work-arounds you would propose? The wvware library appears deprecated, and if possible I am trying not to rely on server software needing to be installed. Thanks, Peter -- View this message in context: http://www.nabble.com/Zend_Search_Lucene%3A-indexing-PDF---Doc-files-tf4278749s16154.html#a12178692 Sent from the Zend Framework mailing list archive at Nabble.com.
