[jira] Commented: (LUCENE-834) Payload Queries

2007-04-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491386 ] Grant Ingersoll commented on LUCENE-834: I applied and committed this patch > Payload Queries >

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491387 ] Grant Ingersoll commented on LUCENE-848: Hi Steven, Do you know what version of Xerces and xml-apis these ar

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491389 ] Grant Ingersoll commented on LUCENE-848: I'm getting: Getting: http://download.wikimedia.org/enwiki/latest/

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491393 ] Steven Parkes commented on LUCENE-848: -- Both jars are from xerces-2.9.0. > Add supported for Wikipedia English

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491394 ] Doron Cohen commented on LUCENE-848: I haven't tried this patch yet - hesitated/thinking it must take very long t

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491396 ] Steven Parkes commented on LUCENE-848: -- Yeah, it takes a while to download. I added the jars since that's what

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491397 ] Grant Ingersoll commented on LUCENE-848: +1 Not a big deal to go get the files via an ANT task. Of course,

[jira] Commented: (LUCENE-869) Make FSIndexInput and FSIndexOutput inner classes of FSDirectory

2007-04-24 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491401 ] Doug Cutting commented on LUCENE-869: - Heh. These classes precede the addition of nested classes to Java. They'

[jira] Updated: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Parkes updated LUCENE-848: - Attachment: LUCENE-848.txt Here's the patch with the README. By the way, there's also a .rsync-f

Re: [jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Grant Ingersoll
Is there a way to pick a specific day, versus "latest". How long does Wikipedia archive? Always using the latest makes comparisons more difficult. I wonder if licensing terms would allow us to host a specific date of the version on Lucene zones. Of course, that may not be a good idea ba

RE: [jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Steven Parkes
They don't seem to keep things around too long. There were more files available when I downloaded earlier this month, but they're already gone. Wikipedia is supposed to only contain stuff covered by the GNU Free Documentation License so saving it should be okay. In fact, one of the other files you

Re: [jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Grant Ingersoll
Well, there would be issues if we tried to do precision/recall/ quality of results benchmarks on it, I think. I can ask on legal- discuss. This is the license I found: http://en.wikipedia.org/wiki/ Wikipedia:Text_of_the_GFDL found via http://en.wikipedia.org/wiki/ Wikipedia:Database_download

RE: [jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-24 Thread Steven Parkes
You're right. Shows you my mind's been on performance, rather than scoring. My interpretation of the license was that it was pretty broad and we could host a fixed copy if we wanted to. But I'm not a lawyer, so ... -Original Message- From: Grant Ingersoll [mailto:[EMAIL PROTECTED] Sent:

[jira] Assigned: (LUCENE-789) Custom similarity is ignored when using MultiSearcher

2007-04-24 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen reassigned LUCENE-789: -- Assignee: Doron Cohen > Custom similarity is ignored when using MultiSearcher > ---

[jira] Updated: (LUCENE-789) Custom similarity is ignored when using MultiSearcher

2007-04-24 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-789: --- Attachment: 789_patch.txt Thanks for the test case, Alexey! Problem was in MultiSearcher.CachedDfSou

[jira] Updated: (LUCENE-789) Custom similarity is ignored when using MultiSearcher

2007-04-24 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen updated LUCENE-789: --- Lucene Fields: [Patch Available] (was: [New]) > Custom similarity is ignored when using MultiSearche