[fw-general] Zend_Search_Lucene: indexing PDF & Doc files

Peter Bowyer Thu, 16 Aug 2007 03:15:29 -0700

Hi,

Has anyone written a document parser for Zend_Search_Lucene to extract text
from, and index, Microsoft Word documents or Adobe PDF files?  I see there
is a specialised one for HTML documents, but nothing in SVN for other
formats.


If no one has written such code, are there work-arounds you would propose? 
The wvware library appears deprecated, and if possible I am trying not to
rely on server software needing to be installed.

Thanks,
Peter
-- 
View this message in context: 
http://www.nabble.com/Zend_Search_Lucene%3A-indexing-PDF---Doc-files-tf4278749s16154.html#a12178692
Sent from the Zend Framework mailing list archive at Nabble.com.

[fw-general] Zend_Search_Lucene: indexing PDF & Doc files

Reply via email to