Document Scoring

2002-04-04 Thread Melissa Mifsud
Hi, I've been going throught the source code, attempting to find the exact point in time where the score for each document is calculated and the methods that do this. I've ended up very confused! Methods such as IndexReader.docFreq(Tem t) which are then used by Query.scorer(...) are declared

What type of indexer is Lucene? Question reworded.

2002-03-07 Thread Melissa Mifsud
Hi again! I should really reword my question as follows: On which criteria are relevant documents chosen given a particular query and once retrieved, how are these documents ranked? The techniques by which this is done will then determine what type of IR model Lucene implements. Thanks

Re: Lucene-created files

2002-03-07 Thread Melissa Mifsud
messages regarding their contents. You might want to take a look at the archives (http://www.mail-archive.com/lucene-dev@jakarta.apache.org/) or, better yet, the source code ;-) Regards, --Daniel -Original Message- From: Melissa Mifsud [mailto:[EMAIL PROTECTED]] Sent: quarta-feira, 6 de

Lucene-created files

2002-03-06 Thread Melissa Mifsud
Hi, Does anyone know the significance of the files that are generated by Lucene? I know they are essentially the term index, however I need to have a full understanding of them. Also, they look encrypted... can anyone confirm this? Melissa

What type of indexer is Lucene?

2002-03-05 Thread Melissa Mifsud
Hi! Can anyone tell me what kind of indexer Lucene is? Statistical, Probabilistic, Boolean, Extended Boolean? I can't seem to find the answer in any documentation or article and it's really important that I know the type before I use Lucene in for application! Thanks! Melissa

Indexing HTML with Lucene

2002-03-05 Thread Melissa Mifsud
Hi, Is it necessary to strip the HTML tags from HTML documents BEFORE telling Lucene to index them? Does Lucene do this or will it index the tags too?! Melissa