I would start at the Lucene Java home page (http://lucene.apache.org/java
) and dig in from there. There are a number of good docs on Scoring
and the IR model used (Boolean plus Vector.) From there, I would dig
into the javadocs and whip up some example code that indexes a set of
tokens and documents with a controlled vocabulary. From there, you
can dig into the source itself, especially the new DocumentsWriter
class. And, of course, along the way, please feel free to submit
documentation patches!
Also, this mailing list and the java-dev mailing list have a wealth of
information about the internals of Lucene, so please dig through the
archives and ask questions here as well.
-Grant
On Dec 22, 2007, at 9:10 PM, Berlin Brown wrote:
Do you guys have article links or other documents to describe the
lucene database. Eg. what is it composed of?
--
Berlin Brown
http://botspiritcompany.com/botlist/spring/help/about.html
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]