[jira] Commented: (LUCENE-584) Decouple Filter from BitSet

2007-07-26 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515630 ] Paul Elschot commented on LUCENE-584: - Mark, The exhausted flag is only in the iterator/Matcher, not in the

[EMAIL PROTECTED]: Project lucene-java (in module lucene-java) failed

2007-07-26 Thread Jason van Zyl
To whom it may engage... This is an automated request, but not an unsolicited one. For more information please visit http://gump.apache.org/nagged.html, and/or contact the folk at [EMAIL PROTECTED] Project lucene-java has an issue affecting its community integration. This issue affects

Re: [jira] Commented: (LUCENE-584) Decouple Filter from BitSet

2007-07-26 Thread eks dev
Mark Harwood commented on LUCENE-584: Hi Mark, we used to use Filters a lot... and concluded, Matcher is great! It just takes some time to get it in head, let me try to help you get there :) I saw BitSetMatcher etc and appreciate the motivation behind the design for alternative

RE: Token termBuffer issues

2007-07-26 Thread Steven Parkes
First I create a single large file that has one doc per line from Wikipedia content, using this alg Anybody disagree that the 1-line-per-doc format is better (at least for Wikipedia)? If so, I'll get rid of the intermediate one-file-per-doc step.

Re: Token termBuffer issues

2007-07-26 Thread Yonik Seeley
On 7/26/07, Steven Parkes [EMAIL PROTECTED] wrote: First I create a single large file that has one doc per line from Wikipedia content, using this alg Anybody disagree that the 1-line-per-doc format is better (at least for Wikipedia)? If so, I'll get rid of the intermediate

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515840 ] Michael Busch commented on LUCENE-965: -- Can we store the document length (with multiple fields) and average

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515850 ] Doug Cutting commented on LUCENE-965: - Did I miss something? What I meant is that the loops added by this

Last attempt

2007-07-26 Thread karl wettin
Some time ago I tried to introduce LUCENE-581, a new consumer top layer, the core changes required by LUCENE-550, my InstantaitedIndex. I would still like to see this a part of the core. It is completely backwards compatible but contains a few small changes that seems to be convtroversial,

[jira] Updated: (LUCENE-743) IndexReader.reopen()

2007-07-26 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-743: - Fix Version/s: 2.3 IndexReader.reopen() Key: LUCENE-743

[jira] Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span queries

2007-07-26 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-794: --- Attachment: spanhighlighter10.patch Requested changes have been made. Only relevant file now is

Re: [VOTE] Migrate Lucene to JDK 1.5 for 3.0 release

2007-07-26 Thread Grant Ingersoll
On Jul 26, 2007, at 8:56 PM, Michael Busch wrote: Grant Ingersoll wrote: 2. Release 2.4 so all of Mike M's goodness is available to 1.4 users within the next 2-4 weeks using our new release mechanism (i.e code Hi Grant, 2-4 weeks seems quite soon considering that 2.2 is very new and that

Re: [VOTE] Migrate Lucene to JDK 1.5 for 3.0 release

2007-07-26 Thread Michael Busch
Grant Ingersoll wrote: 2. Release 2.4 so all of Mike M's goodness is available to 1.4 users within the next 2-4 weeks using our new release mechanism (i.e code Hi Grant, 2-4 weeks seems quite soon considering that 2.2 is very new and that there are a lot of open issues targeted for 2.3. For

Re: [VOTE] Migrate Lucene to JDK 1.5 for 3.0 release

2007-07-26 Thread Mike Klaas
On 26-Jul-07, at 5:36 PM, Grant Ingersoll wrote: I propose we take the following path for migrating Lucene Java to JDK 1.5: 1. Put in any new deprecations we want, cleanups, etc. 2. Release 2.4 so all of Mike M's goodness is available to 1.4 users within the next 2-4 weeks using our new

[jira] Assigned: (LUCENE-865) SpellChecker not working because of stale IndexSearcher

2007-07-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic reassigned LUCENE-865: --- Assignee: Otis Gospodnetic SpellChecker not working because of stale IndexSearcher

[VOTE] Migrate Lucene to JDK 1.5 for 3.0 release

2007-07-26 Thread Grant Ingersoll
I propose we take the following path for migrating Lucene Java to JDK 1.5: 1. Put in any new deprecations we want, cleanups, etc. 2. Release 2.4 so all of Mike M's goodness is available to 1.4 users within the next 2-4 weeks using our new release mechanism (i.e code freeze, branch,

Re: The JDK 1.5 Can o' Worms

2007-07-26 Thread Grant Ingersoll
On Jul 25, 2007, at 9:58 PM, Bill Janssen wrote: Frankly, I am amazed at the pace of GCJ, but it is hard to imagine that with so much free code out there doing what they need, that it could take much longer to get 1.5 support. Just guessing, but could it be that the features in 1.5 and 1.6

[jira] Updated: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

2007-07-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated LUCENE-831: Comment: was deleted Complete overhaul of FieldCache API/Implementation

[jira] Commented: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span queries

2007-07-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515911 ] Otis Gospodnetic commented on LUCENE-794: - Mark, wow, long list of files up there. I can't tell which ones

[jira] Commented: (LUCENE-898) contrib/javascript is not packaged into releases

2007-07-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515908 ] Otis Gospodnetic commented on LUCENE-898: - I've emailed Kelvin and asked for his thoughts. Maybe he'll

[jira] Resolved: (LUCENE-781) NPE in MultiReader.isCurrent() and getVersion()

2007-07-26 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch resolved LUCENE-781. -- Resolution: Fixed Committed. MultiSegmentReader.isOptimized() now always returns false.

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515854 ] Michael Busch commented on LUCENE-965: -- I guess I would not be in favor of a special term, I would rather see

[jira] Updated: (LUCENE-967) Add tokenize documents only task to contrib/benchmark

2007-07-26 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-967: -- Lucene Fields: [New, Patch Available] (was: [New]) Add tokenize documents only task

[jira] Updated: (LUCENE-967) Add tokenize documents only task to contrib/benchmark

2007-07-26 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-967: -- Attachment: LUCENE-967.patch Attached patch that adds ReadTokensTask.java. I also

[jira] Commented: (LUCENE-781) NPE in MultiReader.isCurrent() and getVersion()

2007-07-26 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515797 ] Doug Cutting commented on LUCENE-781: - I looked into the javadocs built with this patch and I can't see the

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515773 ] Yonik Seeley commented on LUCENE-965: - It does seem like calculating the average field length at index time

[jira] Commented: (LUCENE-966) A faster JFlex-based replacement for StandardAnalyzer

2007-07-26 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515768 ] Mark Miller commented on LUCENE-966: Great patch! And a very quick turnaround to boot! I am seeing a HUGE speed

Re: [jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread crspan
Hello Hui: Thank you for contributing your axiomatic retrieval function to Lucene. Can not wait for the test drive :) Would you please report your setting for your experiment on Collection Function MAP [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] NumRR ROBUST04

[jira] Created: (LUCENE-966) A faster JFlex-based replacement for StandardAnalyzer

2007-07-26 Thread Stanislaw Osinski (JIRA)
A faster JFlex-based replacement for StandardAnalyzer - Key: LUCENE-966 URL: https://issues.apache.org/jira/browse/LUCENE-966 Project: Lucene - Java Issue Type: Improvement

Re: Last attempt

2007-07-26 Thread Grant Ingersoll
Hi Karl, I have seen this and have always thought I should spend some time on it, but then didn't get to it. That isn't to say it isn't useful. I think one thing I wonder about is if there is a way it could be a standalone contrib package or maybe there is a way to separate out the

[jira] Commented: (LUCENE-966) A faster JFlex-based replacement for StandardAnalyzer

2007-07-26 Thread Paul Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515882 ] Paul Smith commented on LUCENE-966: --- We did pretty much the same thing here at Aconex, The tokenization

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515846 ] Grant Ingersoll commented on LUCENE-965: I guess I would not be in favor of a special term, I would rather

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515803 ] Doug Cutting commented on LUCENE-965: - It does seem like calculating the average field length at index time

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Charlie Zhao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515783 ] Charlie Zhao commented on LUCENE-965: - Hello Hui: Thank you for contributing your axiomatic retrieval function

[jira] Commented: (LUCENE-966) A faster JFlex-based replacement for StandardAnalyzer

2007-07-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515726 ] Yonik Seeley commented on LUCENE-966: - Thanks Staszek, very nice! If FastTokenizer is compatible with

[jira] Updated: (LUCENE-966) A faster JFlex-based replacement for StandardAnalyzer

2007-07-26 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski updated LUCENE-966: - Attachment: AnalyzerBenchmark.java Here is a very simple benchmark I used to test the

[jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene

2007-07-26 Thread Charlie Zhao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12515833 ] Charlie Zhao commented on LUCENE-965: - Document Length and Average Document Length are sort of speed

[jira] Created: (LUCENE-967) Add tokenize documents only task to contrib/benchmark

2007-07-26 Thread Michael McCandless (JIRA)
Add tokenize documents only task to contrib/benchmark --- Key: LUCENE-967 URL: https://issues.apache.org/jira/browse/LUCENE-967 Project: Lucene - Java Issue Type: Improvement

[EMAIL PROTECTED]: Project lucene-java (in module lucene-java) failed

2007-07-26 Thread Jason van Zyl
To whom it may engage... This is an automated request, but not an unsolicited one. For more information please visit http://gump.apache.org/nagged.html, and/or contact the folk at [EMAIL PROTECTED] Project lucene-java has an issue affecting its community integration. This issue affects

[jira] Updated: (LUCENE-836) Benchmarks Enhancements (precision/recall, TREC, Wikipedia)

2007-07-26 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-836: --- Attachment: lucene-836.benchmark.quality.patch A ready to commit patch for search quality