im using lucene with TXT and HTML files , its working. the only problem with HTML files is that i have to index html files as txt first , before to index them as HTML.
do anyone have try to index pdf files ? im trying the pdfbox , is there any samples for indexing pdf files ? (i dont find any samples to do that) with any of the parsers (pdfbox, jpedal ,etc). thanks for helping, Alvaro. from Lima - Peru --------------------------------- Do you Yahoo!? SBC Yahoo! DSL - Now only $29.95 per month!
