Lucene itself indexes java.lang.String or java.io.Reader data. It is completely up to your application to parse the data out of whatever source it is in and hand it to Lucene. There are a number of open-source libraries that make parsing XML, MS Word, Excel, HTML, and other formats trivial. If you search the e-mail list archives you'll find pointers to tons of options.

        Erik



On Dec 2, 2004, at 7:36 AM, Daniel Cortes wrote:

Hi I''m newer in this mail list and what you can see my English is very terrible.
I 'm having a study to select the best technology for a motor serching of an application web with a ratio of 1000 users/day.
I read a little bit of Lucene what I don't know what file types support the search.
If you can reply my or say me a page that tells this I regret you.
Thanks of a "novatillo"



--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to