Lucene creates an inverted index and uses it to search. Frequency is encoded in the .frq files: http://lucene.apache.org/java/docs/fileformats.html
-Yonik On Wed, Jul 2, 2008 at 10:04 PM, blazingwolf7 <[EMAIL PROTECTED]> wrote: > > Hmmm, I don't think I get it. How is it tracked during index time? I index my > file earlier. Later I will open the index and perform a search. Shouldn't > the frequency of each term in each document found be calculated at during > the searching process? > > > Yonik Seeley wrote: >> >> The frequency is tracked at index time. It's simply a read at query >> time. See TermDocs. >> If you really want to understand more about the code internals of >> Lucene, I'd suggest stepping through more example queries with a >> debugger. >> >> -Yonik >> >> On Wed, Jul 2, 2008 at 8:49 PM, blazingwolf7 <[EMAIL PROTECTED]> >> wrote: >>> >>> Thanks, I am clear now on that. But do anyone know where is the frequency >>> of >>> the term for each document calculated? I mean which class it may be in >>> and >>> which method? >>> Thanks >>> >>> >>> Uwe Schindler wrote: >>>> >>>> A VInt is the way, how integers are stored in the index file in a >>>> compressed >>>> and variable length manner. >>>> >>>> Read here: http://lucene.apache.org/java/2_3_2/fileformats.html#VInt >>>> >>>> ----- >>>> Uwe Schindler >>>> H.-H.-Meier-Allee 63, D-28213 Bremen >>>> http://www.thetaphi.de >>>> eMail: [EMAIL PROTECTED] >>>> >>>>> -----Original Message----- >>>>> From: blazingwolf7 [mailto:[EMAIL PROTECTED] >>>>> Sent: Wednesday, July 02, 2008 11:47 AM >>>>> To: java-dev@lucene.apache.org >>>>> Subject: readVInt, what is it for? >>>>> >>>>> >>>>> Hi, >>>>> >>>>> I am fairly new to Lucene and is now currently going through its source >>>>> code. I am currently trying to determine how Lucene calculate the >>>>> frequency >>>>> of a term in each document located. >>>>> >>>>> I encounter a method named readVInt() in IndexInput class. It seems >>>>> everytime it called this method it will be able to generate the >>>>> document >>>>> number and the frequency of the term in each document. >>>>> >>>>> I am wondering how it work and fail to find and information on it on >>>>> the >>>>> Internet. Could anyone explain it to me? Thanks >>>>> -- >>>>> View this message in context: >>>>> http://www.nabble.com/readVInt%2C-what-is- >>>>> it-for--tp18233802p18233802.html >>>>> Sent from the Lucene - Java Developer mailing list archive at >>>>> Nabble.com. >>>>> >>>>> >>>>> --------------------------------------------------------------------- >>>>> To unsubscribe, e-mail: [EMAIL PROTECTED] >>>>> For additional commands, e-mail: [EMAIL PROTECTED] >>>> >>>> >>>> >>>> --------------------------------------------------------------------- >>>> To unsubscribe, e-mail: [EMAIL PROTECTED] >>>> For additional commands, e-mail: [EMAIL PROTECTED] >>>> >>>> >>>> >>> >>> -- >>> View this message in context: >>> http://www.nabble.com/readVInt%2C-what-is-it-for--tp18233802p18249790.html >>> Sent from the Lucene - Java Developer mailing list archive at Nabble.com. >>> >>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: [EMAIL PROTECTED] >>> For additional commands, e-mail: [EMAIL PROTECTED] >>> >>> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [EMAIL PROTECTED] >> For additional commands, e-mail: [EMAIL PROTECTED] >> >> >> > > -- > View this message in context: > http://www.nabble.com/readVInt%2C-what-is-it-for--tp18233802p18250434.html > Sent from the Lucene - Java Developer mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]