Re: document boost not showing up in Explanation

2004-12-28 Thread Paul Elschot
On Tuesday 28 December 2004 08:37, Erik Hatcher wrote: On Dec 27, 2004, at 9:54 PM, Vikas Gupta wrote: I am using lucene-1.4.1.jar(with nutch). For some reason, the effect of document boost is not showing up in the search results. Also, why is it not a part of the Explanation It

time of indexer

2004-12-28 Thread Daniel Cortes
Hi to everybody, and merry christmas for all(and specially people who that me today are working instead of stay with the family). I don't understand because my search in the index give this bad results: I index 112 php files how a txt. with this machine Pentium 4 2,4GHz 512 RAM running during

Re: time of indexer

2004-12-28 Thread Nader Henein
Download Luke, it makes life easy when you inspect the index, so you an actually look at what you've indexed, as opposed to what you may think you indexed. Nader Daniel Cortes wrote: Hi to everybody, and merry christmas for all(and specially people who that me today are working instead of

Re: how often to optimize?

2004-12-28 Thread aurora
Are not optimized indices causing you any problems (e.g. slow searches, high number of open file handles)? If no, then you don't even need to optimize until those issues become... issues. OK I have changed the process to not doing optimize() at all. So far so good. The number of files hover

Re: how often to optimize?

2004-12-28 Thread Otis Gospodnetic
Correct. The self-maintenance you are referring to is Lucene's periodic segment merging. The frequency of that can be controlled through IndexWriter's mergeFactor. Otis --- aurora [EMAIL PROTECTED] wrote: Are not optimized indices causing you any problems (e.g. slow searches, high number

QueryParser, default operator

2004-12-28 Thread Paul
Hi, the following code QueryParser qp = new QueryParser(itemContent, analyzer); qp.setOperator(org.apache.lucene.queryParser.QueryParser.DEFAULT_OPERATOR_AND); Query query = qp.parse(line, itemContent, analyzer); doesn't produce the expected result because a query foo bar results in:

how to get most frequented terms from hits

2004-12-28 Thread Miro Max
Hello, is ist possible to get most frequented terms from hits? thx miro ___ Gesendet von Yahoo! Mail - Jetzt mit 250MB Speicher kostenlos - Hier anmelden: http://mail.yahoo.de

Re: Translation

2004-12-28 Thread Gimmy Pegoraro
Hi, I'm the author of the following italian document posted on this mailing list from Tun Lin the 3th December 2003. Sorry for the huge delay of this reply, but I've just come back here after a very long time. That document is referred to Lucy, a Java application I developed using Lucene and

Asking Questions in a Search

2004-12-28 Thread aneesha
Hi Is it possible to do something like this with lucene: http://www.verity.com/products/response/index.html Thanks - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Asking Questions in a Search

2004-12-28 Thread Chuck Williams
Verity acquired Native Minds -- Verity Response appears to be that technology. It is not search technology at all -- rather it is a programmed question-answer script knowledge base. IMO, there are much better commercial solutions to this problem; e.g., see www.inquira.com, which integrates

Re: Word co-occurrences counts

2004-12-28 Thread Andrew Cunningham
Thanks Doug, This appears to works like a charm. Doug Cutting wrote: Doug Cutting wrote: You could use a custom Similarity implementation for this query, where tf() is the identity function, idf() returns 1.0, etc., so that the final score is the occurance count. You'll need to divide by