Lucene itself indexes java.lang.String or java.io.Reader data. It is
completely up to your application to parse the data out of whatever
source it is in and hand it to Lucene. There are a number of
open-source libraries that make parsing XML, MS Word, Excel, HTML, and
other formats trivial. If you search the e-mail list archives you'll
find pointers to tons of options.
Erik
On Dec 2, 2004, at 7:36 AM, Daniel Cortes wrote:
Hi I''m newer in this mail list and what you can see my English is
very terrible.
I 'm having a study to select the best technology for a motor
serching of an application web with a ratio of 1000 users/day.
I read a little bit of Lucene what I don't know what file types
support the search.
If you can reply my or say me a page that tells this I regret you.
Thanks of a "novatillo"
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]