If I create an standard index, what does Lucene store in this index?
What should be stored in an index at least? Just a link to the file and keywords? Or also wordnumbers? What else?
Does somebody know a paper which discusses this problem of "what to put in an good universal IR index" ?
Well if you want a textbook I found "Managing Gigabytes" to have excellent coverage of the internals and messy details of search/indexes.
http://www.amazon.com/exec/obidos/ASIN/1558605703/tropoA http://www.cs.mu.oz.au/mg/
Cheers, Karl
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
