Re: Scoring

2006-06-13 Thread karl wettin
On Wed, 2006-06-14 at 05:03 +0200, karl wettin wrote: > > > Feel free to let me know what I got wrong. Immediate reflections (given I got it all right): Scorer both match the index and calculate the score. I would like to see a new class, Matcher, that either cont

Re: Scoring

2006-06-13 Thread karl wettin
On Wed, 2006-06-07 at 08:27 -0400, Grant Ingersoll wrote: > I have started something in my sandbox that goes in the xdocs directory > that is going to cover the scoring and how it works (something parallel > in spirit to the file formats documentation). Adding in sequence > diagrams and whateve

[jira] Created: (LUCENENET-5) CJK Tokenizer in NLS fails to stop at end of input buffer.

2006-06-13 Thread Ben Tregenna (JIRA)
CJK Tokenizer in NLS fails to stop at end of input buffer. -- Key: LUCENENET-5 URL: http://issues.apache.org/jira/browse/LUCENENET-5 Project: Lucene.NET Type: Bug Environment: lucene.net.nls.1.3.2.2 on .NET 1.

Re: Java 1.5 was [jira] Updated: (LUCENE-600) ParallelWriter companion to ParallelReader

2006-06-13 Thread Grant Ingersoll
In addition to performance, productivity and functionality benefits, my main argument for 1.5 is that it is used by the vast majority of lucene community members. I am not so sure about this. Perhaps we should take a poll on the user list? Not even sure how that would be managed or counted

Refactored FuzzyTermEnum

2006-06-13 Thread Bob Carpenter
I refactored the org.apache.lucene.search.FuzzyTermEnum edit distance implementation. It now only uses a single pair of arrays, and those never get resized. That required turning the order of text/target around in the loops. You'll see that with the pair of arrays method, they get re-used hand

Re: Fwd: How to combine results from several indices

2006-06-13 Thread Chuck Williams
You can try that approach, but I think you will find it more difficult. E.g., all of the primitive query classes are written specifically to use doc-ids. So, you either need to do you searches separately on each subindex and then write your own routine to join the results, or you would need to re

Re: Java 1.5 was [jira] Updated: (LUCENE-600) ParallelWriter companion to ParallelReader

2006-06-13 Thread Chuck Williams
I think the last discussion ended with the main counter-argument being lack of support by gjc. Current top of GJC News: > *June 6, 2006* RMS approved the plan to use the Eclipse compiler as > the new gcj front end. Work is being done on the |gcj-eclipse| branch; > it can already build libgcj. Thi

Java 1.5 was [jira] Updated: (LUCENE-600) ParallelWriter companion to ParallelReader

2006-06-13 Thread Grant Ingersoll
Well, we have our first Java 1.5 patch... Now that we have had a week or two to digest the comments, do we want to reopen the discussion? Chuck Williams (JIRA) wrote: [ http://issues.apache.org/jira/browse/LUCENE-600?page=all ] Chuck Williams updated LUCENE-600: -

Re: Fwd: How to combine results from several indices

2006-06-13 Thread wu fox
thank you very much Chuck.But I still wondered is there any way that I can revise ParallelReader so that it do not need the same doc id .Can IndexReader comebine different doc according some mapping rules ?for example I can override Document method that combine docs from indices acoording to same