Now, a lost data problem with trunk too

2010-09-14 Thread karl.wright
Hi folks, It looks like the handle leak may be real - Simon Willnauer has been looking at it and could not find an explanation for the behavior I have been seeing. But before we got too far on that problem, I encountered what appears to be an even more serious problem. Specifically, I'm

Re: Now, a lost data problem with trunk too

2010-09-14 Thread Simon Willnauer
On Tue, Sep 14, 2010 at 10:37 AM, karl.wri...@nokia.com wrote: Hi folks, It looks like the handle leak may be real - Simon Willnauer has been looking at it and could not find an explanation for the behavior I have been seeing.   But before we got too far on that problem, I encountered what

Re: Fwd: Trunk file handle leak?

2010-09-14 Thread Simon Willnauer
An update on this, the error was on my side doubly incrementing the searcher reference. no problem on trunk! simon On Fri, Sep 10, 2010 at 10:04 PM, Simon Rosenthal simon.rosent...@yahoo.com wrote: Karl: I reported something very similar a few months back and opened a Jira issue - see

[jira] Commented: (SOLR-2106) Spelling Checking for Multiple Fields

2010-09-14 Thread JAYABAALAN V (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909157#action_12909157 ] JAYABAALAN V commented on SOLR-2106: what is procedure to download the SOLR-2010.patch

[jira] Commented: (SOLR-1900) move Solr to flex APIs

2010-09-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909160#action_12909160 ] Michael McCandless commented on SOLR-1900: -- I think it makes sense to move append

RE: Now, a lost data problem with trunk too

2010-09-14 Thread karl.wright
Yes. Of course. My oversight. So I did the obvious thing and searched for the value field directly, and it is there: str name=idPOI|DEU:205:20187477:1014564|brandenburger tor/strstr name=languageger/strstr name=latitude52.39935/strstr name=longitude13.04793/strstr name=referencebrandenburger

Re: [jira] Commented: (SOLR-2106) Spelling Checking for Multiple Fields

2010-09-14 Thread Erick Erickson
See: http://wiki.apache.org/solr/HowToContribute#Working_With_Patches http://wiki.apache.org/solr/HowToContribute#Working_With_PatchesErick On Tue, Sep 14, 2010 at 5:11 AM, JAYABAALAN V (JIRA) j...@apache.orgwrote: [

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909214#action_12909214 ] Robert Muir commented on LUCENE-2504: - {quote} Java (Oracle) really needs to do

[jira] Commented: (SOLR-1682) Implement CollapseComponent

2010-09-14 Thread Varun Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909223#action_12909223 ] Varun Gupta commented on SOLR-1682: --- Is there any workaround to use Highlight and Facet

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909230#action_12909230 ] Simon Willnauer commented on LUCENE-2504: - bq. I think we all owe it to ourselves

Re: Whither ORP?

2010-09-14 Thread Grant Ingersoll
On Sep 13, 2010, at 12:33 PM, Itamar Syn-Hershko wrote: With the proper two-way open-source development process (taking and then giving) I think it can become an important part of open-IR technologies, just like what Lucene did to the search engines world. What ORP has to offer is of

Re: Whither ORP?

2010-09-14 Thread Grant Ingersoll
I think the biggest hurdle we have in front of us is curating a data set that we can redistribute. I'm in the process of uploading all the ASF public mail archives as of Sept. 13 to Amazon S3. I also have some tools (thanks to Chris Rhodes) for processing this into Solr XML. I think this

[jira] Commented: (LUCENE-2643) StringHelper#stringDifference is wrong about supplementary chars

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909253#action_12909253 ] Robert Muir commented on LUCENE-2643: - My vote would be to drop it if we arent using

[jira] Updated: (LUCENE-2643) StringHelper#stringDifference is wrong about supplementary chars

2010-09-14 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2643: Attachment: LUCENE-2643.patch here is a patch StringHelper#stringDifference is wrong

[jira] Commented: (LUCENE-2643) StringHelper#stringDifference is wrong about supplementary chars

2010-09-14 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909256#action_12909256 ] Simon Willnauer commented on LUCENE-2643: - bq. since its unused, its not obvious

[jira] Commented: (LUCENE-2643) StringHelper#stringDifference is wrong about supplementary chars

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909260#action_12909260 ] Robert Muir commented on LUCENE-2643: - drop in trunk and mark deprecated in 3.x?

Re: Whither ORP?

2010-09-14 Thread Dan Cardin
Hello, This is a great start! I am interested in helping with the development of a crowd sourcing application. The next step would be creating a set of requirements for the web app. Would the ORP wiki be a good place to store the requirements? --Dan On Tue, Sep 14, 2010 at 9:51 AM, Grant

Re: Whither ORP?

2010-09-14 Thread Robert Muir
On Tue, Sep 14, 2010 at 10:22 AM, Dan Cardin dcardin2...@gmail.com wrote: Hello, This is a great start! I am interested in helping with the development of a crowd sourcing application. The next step would be creating a set of requirements for the web app. Would the ORP wiki be a good place

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909272#action_12909272 ] Yonik Seeley commented on LUCENE-2504: -- Looks like we're not using the correct

Re: Whither ORP?

2010-09-14 Thread Simon Willnauer
On Tue, Sep 14, 2010 at 4:30 PM, Robert Muir rcm...@gmail.com wrote: On Tue, Sep 14, 2010 at 10:22 AM, Dan Cardin dcardin2...@gmail.com wrote: Hello, This is a great start! I am interested in helping with the development of a crowd sourcing application. The next step would be creating a set

[jira] Updated: (LUCENE-2630) make the build more friendly to apache harmony

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2630: Attachment: LUCENE-2630.patch The harmony developers applied the UTF-8 fix (HARMONY-6640), so we

Re: exceptions from solr/contrib/dataimporthandler and solr/contrib/extraction

2010-09-14 Thread Grant Ingersoll
On Sep 13, 2010, at 1:59 PM, Lance Norskog wrote: What I want you to do is, I want you to find the guys who are putting all the bugs in the code, and I want you to FIRE THEM! He who is without bugs in their code, may be the first to fire. -Grant

[jira] Updated: (LUCENE-2630) make the build more friendly to apache harmony

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2630: Attachment: LUCENE-2630_charutils.patch make the build more friendly to apache harmony

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909312#action_12909312 ] Michael McCandless commented on LUCENE-2504: bq. I'm now getting many

[jira] Updated: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-2504: - Attachment: LUCENE-2504.patch Attaching a draft patch that seems to fix the issue (the ones I

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909337#action_12909337 ] Michael McCandless commented on LUCENE-2504: {quote} I think we all owe it to

[jira] Reopened: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened LUCENE-2504: bq. Yes, but FieldValueHitQueue has it's own list of comparators that never get

[jira] Updated: (LUCENE-2575) Concurrent byte and int block implementations

2010-09-14 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-2575: - Attachment: LUCENE-2575.patch Term frequency is recorded and returned. There are

Re: Whither ORP?

2010-09-14 Thread Itamar Syn-Hershko
On 14/9/2010 4:22 PM, Dan Cardin wrote: Hello, This is a great start! I am interested in helping with the development of a crowd sourcing application. The next step would be creating a set of requirements for the web app. Would the ORP wiki be a good place to store the requirements? --Dan

Re: Whither ORP?

2010-09-14 Thread Itamar Syn-Hershko
On 14/9/2010 3:44 PM, Grant Ingersoll wrote: If you can, putting them up as a patch would be useful. That way, we can show some progress. I will, but first it needs to be workable. It is 80% done, but still not that usable. I expect to be able to work on it again in a month or so. Or

[jira] Commented: (LUCENE-2630) make the build more friendly to apache harmony

2010-09-14 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909347#action_12909347 ] Simon Willnauer commented on LUCENE-2630: - bq. Here's the patch for

Re: Whither ORP?

2010-09-14 Thread Dan Cardin
Hello, I will begin documenting some basic requirements for a crowd sourcing web app. I will use some of the work done by Itamar as a basis for the requirements. --Dan On Tue, Sep 14, 2010 at 1:18 PM, Itamar Syn-Hershko ita...@code972.comwrote: On 14/9/2010 3:44 PM, Grant Ingersoll wrote:

[jira] Updated: (LUCENE-2630) make the build more friendly to apache harmony

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2630: Attachment: LUCENE-2630_intl.patch here's a patch for the internationalization differences, since

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909407#action_12909407 ] Yonik Seeley commented on LUCENE-2504: -- bq. The open question is whether this hotspot

[jira] Commented: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909456#action_12909456 ] Yonik Seeley commented on LUCENE-2504: -- OK, I've committed the fix to always use the

[jira] Created: (SOLR-2120) Facet Field Value truncation

2010-09-14 Thread Niall O'Connor (JIRA)
Facet Field Value truncation Key: SOLR-2120 URL: https://issues.apache.org/jira/browse/SOLR-2120 Project: Solr Issue Type: Bug Components: search Affects Versions: 1.4.1 Reporter: Niall

[jira] Created: (LUCENE-2644) LowerCaseTokenizer Does Not Behave As One Might Expect (or Desire)--Given Its Name

2010-09-14 Thread Scott Gonyea (JIRA)
LowerCaseTokenizer Does Not Behave As One Might Expect (or Desire)--Given Its Name -- Key: LUCENE-2644 URL: https://issues.apache.org/jira/browse/LUCENE-2644 Project:

[jira] Updated: (LUCENE-2644) LowerCaseTokenizer Does Not Behave As One Might Expect (or Desire)--Given Its Name

2010-09-14 Thread Scott Gonyea (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Scott Gonyea updated LUCENE-2644: - Attachment: LowerCaseTokenizer.patch This patch will retain original functionality, while

[jira] Updated: (LUCENE-2504) sorting performance regression

2010-09-14 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-2504: - Attachment: LUCENE-2504_SortMissingLast.patch This was a simple attempt to try and simplify the

[jira] Updated: (LUCENE-2575) Concurrent byte and int block implementations

2010-09-14 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-2575: - Attachment: LUCENE-2575.patch Added a unit test for payloads, term vectors, and doc

[jira] Resolved: (SOLR-1194) Query Analyzer not Invoking for Custom FiledType - When we use Custom QParser Plugin

2010-09-14 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man resolved SOLR-1194. Resolution: Invalid This sounds like a bug in your custom QParser -- the QParser is what calls the

[jira] Commented: (SOLR-2119) IndexSchema should log warning if analyzer is declared with charfilter/tokenizer/tokenfiler out of order

2010-09-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909511#action_12909511 ] Robert Muir commented on SOLR-2119: --- {quote} There seems to be a segment of hte user

[jira] Created: (SOLR-2121) distributed highlighting using q.alt=*:* causes NPE in finishStages

2010-09-14 Thread Hoss Man (JIRA)
distributed highlighting using q.alt=*:* causes NPE in finishStages --- Key: SOLR-2121 URL: https://issues.apache.org/jira/browse/SOLR-2121 Project: Solr Issue Type: Bug

[jira] Commented: (SOLR-2121) distributed highlighting using q.alt=*:* causes NPE in finishStages

2010-09-14 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909514#action_12909514 ] Hoss Man commented on SOLR-2121: Marc Sturlese posted his fix but it's not entirely obvious

Current trunk example woes...

2010-09-14 Thread Erick Erickson
If I check out the current trunk, and from solr do an ant clean example all is well, even up to starting Solr. But trying to hit anything on the site gives a response in the browser starting with: org.apache.solr.common.SolrException: Plugin init failure for [schema.xml] fieldType:Error loading

Obsolete instructions for Velocity ResponseWriter on the Wiki

2010-09-14 Thread Erick Erickson
For trunk, the instructions here: http://wiki.apache.org/solr/VelocityResponseWriter about starting up VRW/Solaritas are obsolete I think. It looks like all this has been folded into core. I'll go up and add some notes for trunk/1.5 unless someone objects. Erick

Build failed in Hudson: Lucene-3.x #115

2010-09-14 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Lucene-3.x/115/changes Changes: [rmuir] LUCENE-2630: look for the correct exception according to javadoc contract [gsingers] SOLR-1568: move DistanceUtils up a package [gsingers] SOLR-1568: backport to 3.x [rmuir] LUCENE-2630: allow lucene to be built

/trunk sortMissingLast=true status?

2010-09-14 Thread Ryan McKinley
Testing with r997128: I have a field defined as: fieldType name=bytes class=solr.TrieLongField sortMissingLast=true precisionStep=0 omitNorms=true positionIncrementGap=0/ When I call ?sort=bytes desc, everything works as expected, the biggest thigns are first. When I call ?sort=bytes asc, the

Re: Current trunk example woes...

2010-09-14 Thread Yonik Seeley
On Tue, Sep 14, 2010 at 8:16 PM, Erick Erickson erickerick...@gmail.com wrote: If I check out the current trunk, and from solr do an ant clean example all is well, even up to starting Solr. But trying to hit anything on the site gives a response in the browser starting with:

Re: /trunk sortMissingLast=true status?

2010-09-14 Thread Yonik Seeley
On Tue, Sep 14, 2010 at 9:40 PM, Ryan McKinley ryan...@gmail.com wrote: Testing with r997128: I have a field defined as: fieldType name=bytes  class=solr.TrieLongField sortMissingLast=true precisionStep=0 omitNorms=true positionIncrementGap=0/ SortMissingLast/SortMissingFirst is currently

[jira] Created: (LUCENE-2645) False assertion of 0 position delta in StandardPostingsWriterImpl

2010-09-14 Thread David Smiley (JIRA)
False assertion of 0 position delta in StandardPostingsWriterImpl -- Key: LUCENE-2645 URL: https://issues.apache.org/jira/browse/LUCENE-2645 Project: Lucene - Java Issue Type:

[jira] Commented: (LUCENE-2611) IntelliJ IDEA setup

2010-09-14 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909579#action_12909579 ] Steven Rowe commented on LUCENE-2611: - Once Robert's latest patch on SOLR-2002 gets

[jira] Commented: (LUCENE-2575) Concurrent byte and int block implementations

2010-09-14 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12909580#action_12909580 ] Jason Rutherglen commented on LUCENE-2575: -- For the posting skip list we need to

Build failed in Hudson: Solr-3.x #104

2010-09-14 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Solr-3.x/104/changes Changes: [rmuir] LUCENE-2630: fix intl test bugs that rely on cldr version [rmuir] LUCENE-2630: look for the correct exception according to javadoc contract [gsingers] SOLR-1568: move DistanceUtils up a package [gsingers]