Ranking Terms

2005-02-25 Thread Daniel Cortes
Hi everybody, I need to found some documentation about the algorithms that lucene use internally in the indexation and how it works with weights and frequencies of the terms.This information will be used to know tastes of my users and to relate users with the same interest and restlessness.:D I

Bug 23650 (aka docs out of order)?

2005-02-25 Thread petite_abeille
Re: http://issues.apache.org/bugzilla/show_bug.cgi?id=23650 Hello, I'm pretty confident that I'm misusing Lucene one way or another... and of course it was just a question of time before I ran into this docs out of order exception: java.lang.IllegalStateException: docs out of order at

Re: 1.4.x TermInfosWriter.indexInterval not public static ?

2005-02-25 Thread Doug Cutting
Kevin A. Burton wrote: Whats the desired pattern of using of TermInfosWriter.indexInterval ? There isn't one. It is not a part of the public API. It is an unsupported internal feature. Do I have to compile my own version of Lucene to change this? Yes. The last API was public static final but

Search performance with one index vs. many indexes

2005-02-25 Thread Jochen Franke
Topic: Search performance with large numbers of indexes vs. one large index Hello, we are experiencing a performance problem when using large numbers of indexes. We have an application with about 6 Mio. Documents one index of about 7 GB probably 10 to 15 million different words in that index. The

help with boolean expression

2005-02-25 Thread Omar Didi
I have a problem understanding how would lucene iterpret this boolean expression : A AND B OR C . it neither return the same count as when I enter (A AND B) OR C nor A AND (B OR C). if anyone knows how it is interpreted i would be thankful. thanks

Re: help with boolean expression

2005-02-25 Thread Erik Hatcher
On Feb 25, 2005, at 4:19 PM, Omar Didi wrote: I have a problem understanding how would lucene iterpret this boolean expression : A AND B OR C . it neither return the same count as when I enter (A AND B) OR C nor A AND (B OR C). if anyone knows how it is interpreted i would be thankful. Output

Re: Not entire document being indexed?

2005-02-25 Thread Andrzej Bialecki
[EMAIL PROTECTED] wrote: Anyone else has any ideas why wouldn't the whole documents be indexed as described below? Or perhaps someone can enlighten me on how to use Luke to find out if the whole document was indexed or not. I have not used Luke in such capacity before so not sure what to do or

Re: Not entire document being indexed?

2005-02-25 Thread [EMAIL PROTECTED]
Thanks Andrzej and Pasha for your prompt replies and suggestions. I will try everything you have suggested and report back on the findings! regards -pedja Pasha Bizhan said the following on 2/25/2005 6:32 PM: Hi, whole document was indexed or not. Luke can help you to give an answer the

Re: 1.4.x TermInfosWriter.indexInterval not public static ?

2005-02-25 Thread Chris Hostetter
: Whats the desired pattern of using of TermInfosWriter.indexInterval ? : : There isn't one. It is not a part of the public API. It is an : unsupported internal feature. : It was never public. It used to be static and final, but is now an : instance variable. : The place to put

Sorting date stored in milliseconds time

2005-02-25 Thread Ben
Hi I store my date in milliseconds, how can I do a sort on it? SortField has INT, FLOAT and STRING. Do I need to create a new sort class, to sort the long value? Thanks Ben - To unsubscribe, e-mail: [EMAIL PROTECTED] For