Hi,
I've been going throught the source code, attempting to find the exact point in time
where the score for each document is calculated and the methods that do this.
I've ended up very confused! Methods such as IndexReader.docFreq(Tem t) which are then
used by Query.scorer(...) are declared
Hi again!
I should really reword my question as follows:
On which criteria are relevant documents chosen given a particular query
and
once retrieved, how are these documents ranked?
The techniques by which this is done will then determine what type of IR model Lucene
implements.
Thanks
messages regarding their contents. You might want
to take a look at the archives
(http://www.mail-archive.com/lucene-dev@jakarta.apache.org/) or, better yet,
the source code ;-)
Regards,
--Daniel
-Original Message-
From: Melissa Mifsud [mailto:[EMAIL PROTECTED]]
Sent: quarta-feira, 6 de
Hi,
Does anyone know the significance of the files that are generated by Lucene? I know
they are essentially the term index, however I need to have a full understanding of
them.
Also, they look encrypted... can anyone confirm this?
Melissa
Hi!
Can anyone tell me what kind of indexer Lucene is? Statistical, Probabilistic,
Boolean, Extended Boolean?
I can't seem to find the answer in any documentation or article and it's really
important that I know the type before I use Lucene in for application!
Thanks!
Melissa
Hi,
Is it necessary to strip the HTML tags from HTML documents BEFORE telling Lucene to
index them? Does Lucene do this or will it index the tags too?!
Melissa