Hi,
I need to search a bunch of documents.Each document needs to be searched
only once. That means once I build the index and search it, I have no need
for that index and the document again.
The number of documents to be searched in the above process can be very
large. If this process needs to
Wouldn't field boosting (the new capability added as of
http://www.mail-archive.com/lucene-dev@jakarta.apache.org/msg01727.html)
be a simpler solution? I would just set the boost for the 'keywords'
field to something higher than one depending on your requirements. As
for the value of the booster,
I had the same problem as Brian. But since you have to rewrite the query
anyway to do a query in two different fields it makes no difference if you
use term or field boosting. Performance is the same.
For new applications I'd say field boosting is a little simpler because you
save on some
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2600.
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.
X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N
X-RCPT-TO: [EMAIL PROTECTED]
Status: U
X-UIDL: 332315228
I had the
Mailing Lists Account wrote:
I need to search a bunch of documents.Each document needs to be searched
only once. That means once I build the index and search it, I have no need
for that index and the document again.
This does not sound like the problem that Lucene is designed to solve.
i am just starting to use lucene, and it it very impressive! I hope to try
Dmitri's new term vectors when he gets them in, in order to do vector model
research, in particular LSA. i will port my existing code to use lucene
framework, and make it available when it is ready.
I am appending a