There's also a WAR that's already built, that's available at 
http://www.brownsite.net/docsearch.htm

It works with OpenOffice documents, Word doc, Excel, PDF, XML, RTF, TXT, etc.

It can work via a servlet interface or a standalone application.


Eric Anderson
LanRx Network Solutions


Quoting "Wilton, Reece" <[EMAIL PROTECTED]>:

> The Lucene FAQ on Java Guru gives some hints on this:
> http://www.jguru.com/faq/Lucene
> 
>       -----Original Message-----
>       From: Maurice Coyle [mailto:[EMAIL PROTECTED] 
>       Sent: Monday, July 07, 2003 9:07 AM
>       To: [EMAIL PROTECTED]
>       Subject: lucene handling different document formats
>       
>       
> could anyone tell me if there's some sort of repository somewhere that
> contains parsers for document types such as .doc, .pdf, .xls?  or how
> i'd begin to go about thinking to write one (tutorials etc much
> appreciated)
>  
> thanks,
> maurice
>                       
>       ____________________________________________________
>        <http://www.incredimail.com/redir.asp?ad_id=309&lang=9>
> IncrediMail - Email has finally evolved - Click Here
> <http://www.incredimail.com/redir.asp?ad_id=309&lang=9>  
> 
> 

LanRx Network Solutions, Inc.
Providing Enterprise Level Solutions...On A Small Business Budget

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to