I would start at the Lucene Java home page (http://lucene.apache.org/java ) and dig in from there. There are a number of good docs on Scoring and the IR model used (Boolean plus Vector.) From there, I would dig into the javadocs and whip up some example code that indexes a set of tokens and documents with a controlled vocabulary. From there, you can dig into the source itself, especially the new DocumentsWriter class. And, of course, along the way, please feel free to submit documentation patches!

Also, this mailing list and the java-dev mailing list have a wealth of information about the internals of Lucene, so please dig through the archives and ask questions here as well.

-Grant

On Dec 22, 2007, at 9:10 PM, Berlin Brown wrote:

Do you guys have article links or other documents to describe the
lucene database.  Eg.  what is it composed of?

--
Berlin Brown
http://botspiritcompany.com/botlist/spring/help/about.html

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to