Lucene cannot parse those document formats that you mentioned.  You
need 3rd party parsers to do that.  For example, POI will parse Excel
and MS Word docs, PDFBox will parse PDF.

Otis

--- "Natarajan.T" <[EMAIL PROTECTED]> wrote:
> Hi Guys,
>  
> I have a small query, ie. Lucene 1.4 APIs directly indexing all the
> documents(PPT,PDF,WORD,etc.) then why we go for Converters or
> Parsers.
>  
>  
> Thanks,
> Natarajan.
>  
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to