There's also a WAR that's already built, that's available at http://www.brownsite.net/docsearch.htm
It works with OpenOffice documents, Word doc, Excel, PDF, XML, RTF, TXT, etc. It can work via a servlet interface or a standalone application. Eric Anderson LanRx Network Solutions Quoting "Wilton, Reece" <[EMAIL PROTECTED]>: > The Lucene FAQ on Java Guru gives some hints on this: > http://www.jguru.com/faq/Lucene > > -----Original Message----- > From: Maurice Coyle [mailto:[EMAIL PROTECTED] > Sent: Monday, July 07, 2003 9:07 AM > To: [EMAIL PROTECTED] > Subject: lucene handling different document formats > > > could anyone tell me if there's some sort of repository somewhere that > contains parsers for document types such as .doc, .pdf, .xls? or how > i'd begin to go about thinking to write one (tutorials etc much > appreciated) > > thanks, > maurice > > ____________________________________________________ > <http://www.incredimail.com/redir.asp?ad_id=309&lang=9> > IncrediMail - Email has finally evolved - Click Here > <http://www.incredimail.com/redir.asp?ad_id=309&lang=9> > > LanRx Network Solutions, Inc. Providing Enterprise Level Solutions...On A Small Business Budget --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
