Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Robert Muir
no, i'm not doing any caching but as mentioned it did require some work to become almost completely i/o bound due to the nature of my wacky queries, example removing O(n) behavior from fuzzy and regexp. probably the os cache is not helping much because indexes are very large. I'm very happy being

[jira] Issue Comment Edited: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653214#action_12653214 ] thetaphi edited comment on LUCENE-1470 at 12/3/08 11:48 PM:

[jira] Reopened: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler reopened LUCENE-1470: --- Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) > Add TrieRangeQuery to

[jira] Updated: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1470: -- Attachment: fixbuild-LUCENE-1470.patch Hi Mike, the last Hudson build failed. It seems it did

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
Thanks Robert, definitely interested! We are too, looking into SSDs for performance. 2.4 allows you to create extend QueryParser and create your own "leaf" queries. I am surprised you are mostly IO bound. Lucene does a good job caching. Do you do some sort of caching yourself? If your index is not

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Robert Muir
yeah i am using read-only. i will admit to subclassing queryparser and having customized query/scorer for several. all queries contain fuzzy queries so this was necessary. "high" throughput i guess is a matter of opinion. in attempting to profile high-throughput, again customized query/scorer mad

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
Thanks Eks for the "education". 1) If you think Lucene is good enough for you, then great. I think there is room for improvement, and wanted to share on some work we did to the rest of the community thru open source. If you are happy to take a snapshot of lucene and build on top of it, then good f

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
Thanks Robert for sharing. Good to hear it is working for what you need it to do. 3) Especially with ReadOnlyIndexReaders, you should not be blocked while indexing. Especially if you have multicore machines. 4) do you stay with sub-second responses with high thru-put? -John On Wed, Dec 3, 2008 a

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Robert Muir
On Thu, Dec 4, 2008 at 1:24 AM, John Wang <[EMAIL PROTECTED]> wrote: > Nice! > Some questions: > > 1) one index? > no, but two individual ones today were around 100M docs > 2) how big is your document? e.g. how many terms etc. > last one built has over 4M terms > 3) are you serving(searching) th

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread eks dev
John, sorry I have to comment, but I feel here some substantial missconceptions abot Open Source 1) "e.g. >30 million documents indexed and searched in realtime., and I really had to do some tweaking." So what? What I or anyone else has to do with it? "some tweaking" is definitely better than

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
Nice! Some questions: 1) one index? 2) how big is your document? e.g. how many terms etc. 3) are you serving(searching) the docs in realtime? 4) search speed? I'd love to learn more about your architecture. -John On Wed, Dec 3, 2008 at 10:13 PM, Robert Muir <[EMAIL PROTECTED]> wrote: > sorry

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Robert Muir
sorry gotta speak up on this. i indexed 300m docs today. I'm using an out of box jar. yeah i have some special subclasses but if i thought any of this stuff was general enough to be useful to others i'd submit it. I'm just happy to have something scalable that i can customize to my peculiarities.

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread robert engels
My two cents... I think the committers do a great job of managing the product. I feel the single biggest failure when it comes to producing quality software is lack of vision, and/or enforcement of this vision. If every "wisher" or "submitter" had their code committed - even if it is "go

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
Grant: I am sorry that I disagree with some points: 1) "I think it's a sign that Lucene is pretty stable." - While lucene is a great project, especially with 2.x releases, great improvements are made, but do we really have a clear picture on how lucene is being used and deployed. While luc

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653139#action_12653139 ] Andi Vajda commented on LUCENE-1390: Yep, I'm leaning that way too. > add ISOLati

Build failed in Hudson: Lucene-trunk #665

2008-12-03 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/665/changes Changes: [mikemccand] LUCENE-1457: fix possible overflow bugs during binary search [mikemccand] LUCENE-1470: add TrieRangeQuery, a much more efficient implementation of RangeQuery at the expense of added space consumed in th

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653132#action_12653132 ] Mark Miller commented on LUCENE-1390: - Not to be wishy washy, but deprecating is looki

[jira] Updated: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andi Vajda updated LUCENE-1390: --- Attachment: (was: ISOLatinAccentFilter.java) > add ISOLatinAccentFilter and deprecate ISOLatin1A

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653123#action_12653123 ] Andi Vajda commented on LUCENE-1390: Mark, I attached a new version of the patch with

[jira] Updated: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andi Vajda updated LUCENE-1390: --- Attachment: ASCIIFoldingFilter.patch This latest version supercedes the previous one and moves all s

[jira] Commented: (LUCENE-1473) Implement standard Serialization across Lucene versions

2008-12-03 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653109#action_12653109 ] Yonik Seeley commented on LUCENE-1473: -- bq. The contrib section of Lucene contains an

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653098#action_12653098 ] Uwe Schindler commented on LUCENE-1470: --- bq. I think if you call ExtendedFieldCache.

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653099#action_12653099 ] Mark Miller commented on LUCENE-1390: - bq. does ISOLatin1AccentFilter really need to b

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653087#action_12653087 ] Uwe Schindler commented on LUCENE-1470: --- This is messy and not easy to understand fo

[jira] Commented: (LUCENE-1448) add getFinalOffset() to TokenStream

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653079#action_12653079 ] Michael McCandless commented on LUCENE-1448: OK, me too. I'll move forward wi

[jira] Commented: (LUCENE-1476) BitVector implement DocIdSet

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653080#action_12653080 ] Jason Rutherglen commented on LUCENE-1476: -- BitVector does not implement the meth

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653078#action_12653078 ] Michael McCandless commented on LUCENE-1470: I think if you call ExtendedField

[jira] Commented: (LUCENE-1476) BitVector implement DocIdSet

2008-12-03 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653076#action_12653076 ] robert engels commented on LUCENE-1476: --- BitSet is already random access, DocIdSet i

[jira] Commented: (LUCENE-1476) BitVector implement DocIdSet

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653075#action_12653075 ] Jason Rutherglen commented on LUCENE-1476: -- Looks like we need a new abstract cla

[jira] Commented: (LUCENE-1476) BitVector implement DocIdSet

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653070#action_12653070 ] Michael McCandless commented on LUCENE-1476: But, SegmentReader needs random a

[jira] Updated: (LUCENE-1476) BitVector implement DocIdSet

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1476: - Attachment: LUCENE-1476.patch LUCENE-1476.patch BitVector extends DocIdSet. TestBitV

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653066#action_12653066 ] Uwe Schindler commented on LUCENE-1470: --- The question is, how to plugin the whole Fi

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653064#action_12653064 ] Robert Muir commented on LUCENE-1390: - does ISOLatin1AccentFilter really need to be de

[jira] Commented: (LUCENE-1448) add getFinalOffset() to TokenStream

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653065#action_12653065 ] Mark Miller commented on LUCENE-1448: - bq. Another option is to "define" the API such

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653061#action_12653061 ] Mark Miller commented on LUCENE-1390: - Everything looks pretty good to me. If you can

[jira] Resolved: (LUCENE-1457) There are a few binary search implmentations in lucene that suffer from a now well known overflow bug

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1457. Resolution: Fixed Fix Version/s: 2.9 Committed revision 723149. I fixed th

[jira] Commented: (LUCENE-1473) Implement standard Serialization across Lucene versions

2008-12-03 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653058#action_12653058 ] robert engels commented on LUCENE-1473: --- Even better. Thanks Mark. > Implement stan

[jira] Commented: (LUCENE-1473) Implement standard Serialization across Lucene versions

2008-12-03 Thread Mark Harwood (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653057#action_12653057 ] Mark Harwood commented on LUCENE-1473: -- The contrib section of Lucene contains an XML

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653049#action_12653049 ] Steven Rowe commented on LUCENE-1390: - bq. What is the likelyhood that a forced upgrad

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653045#action_12653045 ] Andi Vajda commented on LUCENE-1390: This class includes all of ISOLatin1AccentFilte

[jira] Updated: (LUCENE-1316) Avoidable synchronization bottleneck in MatchAlldocsQuery$MatchAllScorer

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1316: --- Fix Version/s: 2.9 > Avoidable synchronization bottleneck in MatchAlldocsQuery$Match

[jira] Updated: (LUCENE-1472) DateTools.stringToDate() can cause lock contention under load

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1472: --- Fix Version/s: 2.9 > DateTools.stringToDate() can cause lock contention under load >

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653038#action_12653038 ] Michael McCandless commented on LUCENE-1470: Uwe, can't you use ExtendedFieldC

[jira] Issue Comment Edited: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653031#action_12653031 ] [EMAIL PROTECTED] edited comment on LUCENE-1390 at 12/3/08 3:00 PM:

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653031#action_12653031 ] Mark Miller commented on LUCENE-1390: - In regards to deprecating ISOLatin1AccentFilter

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Grant Ingersoll
On Dec 3, 2008, at 2:27 PM, Jason Rutherglen (JIRA) wrote: Hoss wrote: "sort of mythical "Lucene powerhouse" Lucene seems to run itself quite differently than other open source Java projects. Perhaps it would be good to spell out the reasons for the reluctance to move ahead with features

[jira] Updated: (LUCENE-1476) BitVector implement DocIdSet

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1476: - Description: BitVector can implement DocIdSet. This is for making SegmentReader.

[jira] Assigned: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller reassigned LUCENE-1390: --- Assignee: Mark Miller > add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter > -

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653016#action_12653016 ] Mark Miller commented on LUCENE-1390: - Hey guys, not sure how soon I can bring some ti

[jira] Updated: (LUCENE-1473) Implement standard Serialization across Lucene versions

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1473: - Description: To maintain serialization compatibility between Lucene versions, serialVers

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653010#action_12653010 ] Robert Muir commented on LUCENE-1390: - thanks guys, just as a comment to whoever is li

[jira] Updated: (LUCENE-1473) Implement standard Serialization across Lucene versions

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1473: - Remaining Estimate: 8h Original Estimate: 8h Summary: Implement stand

[jira] Created: (LUCENE-1477) Pluggable SegmentReader.deletedDocs

2008-12-03 Thread Jason Rutherglen (JIRA)
Pluggable SegmentReader.deletedDocs --- Key: LUCENE-1477 URL: https://issues.apache.org/jira/browse/LUCENE-1477 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions:

[jira] Created: (LUCENE-1476) DeletedDocs implement DocIdSet

2008-12-03 Thread Jason Rutherglen (JIRA)
DeletedDocs implement DocIdSet -- Key: LUCENE-1476 URL: https://issues.apache.org/jira/browse/LUCENE-1476 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions: 2.4

[jira] Updated: (LUCENE-1476) SegmentReader.DeletedDocs implement DocIdSet

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1476: - Description: SegmentReader.DeletedDocs can implement DocIdSet. Then it can be exposed a

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653001#action_12653001 ] Jason Rutherglen commented on LUCENE-1473: -- The discussion has evolved out of sco

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652965#action_12652965 ] Doug Cutting commented on LUCENE-1473: -- > I'm not sure why you and Doug and focusing

RE: [jira] Resolved: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler
Thanks for committing! As soon as there is a way to simply plug in other field caches, I will implement a FieldCache implementation similar to LongFieldCache, that can handle trie-encoded fields. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: [EMAIL PROTECT

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652962#action_12652962 ] robert engels commented on LUCENE-1473: --- The reason the XML is not needed, is becaus

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652945#action_12652945 ] Jason Rutherglen commented on LUCENE-1473: -- Robert: > using XML for queries if yo

[jira] Updated: (LUCENE-1475) Expose sub-IndexReaders from MultiReader or MultiSegmentReader

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1475: - Component/s: Index Priority: Minor (was: Major) Affects Versio

[jira] Created: (LUCENE-1475) Expose sub-IndexReaders from MultiReader or MultiSegmentReader

2008-12-03 Thread Jason Rutherglen (JIRA)
Expose sub-IndexReaders from MultiReader or MultiSegmentReader -- Key: LUCENE-1475 URL: https://issues.apache.org/jira/browse/LUCENE-1475 Project: Lucene - Java Issue Type: Improvem

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652940#action_12652940 ] robert engels commented on LUCENE-1473: --- Jason, you are only partially correct. SOL

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652939#action_12652939 ] Doug Cutting commented on LUCENE-1473: -- The documentation should probably be fixed to

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652932#action_12652932 ] Doug Cutting commented on LUCENE-1473: -- > Doesn't Hadoop handle versioning inside of

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
You are right, we can always transmit the string form and re-parse on the other-end. Our problem is that we took this (serialization nature) for granted, and once something is deployed over a cluster, it would be difficult to do partial roll-outs in this case. But I guess there is no immediate reme

[jira] Resolved: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1470. Resolution: Fixed Fix Version/s: 2.9 Lucene Fields: [New, Patch Availabl

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652921#action_12652921 ] Jason Rutherglen commented on LUCENE-1473: -- Mark: "There will be complaints no ma

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652920#action_12652920 ] Jason Rutherglen commented on LUCENE-1473: -- "In regards to Doug's comment about a

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652917#action_12652917 ] Michael McCandless commented on LUCENE-1470: bq. But as the step from 8 to 4 b

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652914#action_12652914 ] Jason Rutherglen commented on LUCENE-1473: -- "This is a hard problem." I disagree

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread John Wang
Doug: My apologies if I came off seeming angry and/or trying to lobby to be a committer. Neither is the case. I am expressing a concern with how patches are being handled with this project, and providing my view point on how this can be better managed. Of course my concern can be eit

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652911#action_12652911 ] Andi Vajda commented on LUCENE-1390: Great, I'll include Robert's change and try to co

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652906#action_12652906 ] Uwe Schindler commented on LUCENE-1470: --- bq. As a baseline, how large is an index wi

[jira] Updated: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1470: -- Attachment: LUCENE-1470.patch I am sorry, one JavaDoc bug in the new static method, has a @lin

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652899#action_12652899 ] Mark Miller commented on LUCENE-1473: - bq. The "implements Serializeable" was added to

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652898#action_12652898 ] Steven Rowe commented on LUCENE-1390: - bq. Steven, I can amend the patch but you said

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652897#action_12652897 ] Michael McCandless commented on LUCENE-1470: Those results are nice, thanks Uw

[jira] Updated: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1470: -- Attachment: LUCENE-1470.patch New Patch, I think this is really ready to commit. Includes Mike

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652892#action_12652892 ] robert engels commented on LUCENE-1473: --- In regards to Doug's comment about an alter

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652888#action_12652888 ] Doug Cutting commented on LUCENE-1473: -- bq. If it is not meant to be serialized, why

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652882#action_12652882 ] Doug Cutting commented on LUCENE-1473: -- > But, what's now being asked for (expected)

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652878#action_12652878 ] Jason Rutherglen commented on LUCENE-1473: -- > the performance improvement should

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Doug Cutting
John Wang wrote: If you guys need help, maybe you guys should expand your committer list? Committers are added when they've contributed a series of high-quality patches that have been committed, and demonstrated their ability to be easy to work with. Displaying anger is not a good way to bec

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Andi Vajda (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652875#action_12652875 ] Andi Vajda commented on LUCENE-1390: Ah, I see now what you're asking for. Sorry abou

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652872#action_12652872 ] Hoss Man commented on LUCENE-1473: -- For the record: i have limited understanding of java

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652865#action_12652865 ] Paul Elschot commented on LUCENE-1470: -- Uwe, thanks very much for the size and perfor

Re: [jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread Michael McCandless
John Wang wrote: It would have been avoided if the same compiler was used for the release, I took the same compiler (Sun JDK 1.6.0_06) and used the "serialver" tool to compute the SUID for Term.java, and on 2.3.2 it reports "554776219862331599L" for 2.4.0 and "43509097181257L" for 2

[jira] Reopened: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened LUCENE-1465: Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) Let's backpor

[jira] Updated: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1465: --- Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) Fix Version/

Re: [jira] Commented: (LUCENE-1474) Incorrect SegmentInfo.delCount when IndexReader.flush() is used

2008-12-03 Thread Michael McCandless
I agree -- I'll reopen & add 2.4.1 fix version. Can you back port the fix? Mike Mark Miller wrote: The broken getPayloads from nearspanordered should prob make this list. It makes the spans getpayloads stuff pretty much unusable as is (unless you happen to only use nearspanordered with

[jira] Resolved: (LUCENE-1474) Incorrect SegmentInfo.delCount when IndexReader.flush() is used

2008-12-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1474. Resolution: Fixed Committed revision 722964 on 2.4 branch. > Incorrect SegmentInf

[jira] Commented: (LUCENE-1473) Implement Externalizable in main top level searcher classes

2008-12-03 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652843#action_12652843 ] robert engels commented on LUCENE-1473: --- I don't see why you can't just break compat

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652834#action_12652834 ] Robert Muir commented on LUCENE-1390: - with regards to transliteration the bgn/pcgn st

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652831#action_12652831 ] Robert Muir commented on LUCENE-1390: - sean... from your link: On 16th May 1992 the La

[jira] Issue Comment Edited: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652791#action_12652791 ] thetaphi edited comment on LUCENE-1470 at 12/3/08 7:20 AM:

[jira] Commented: (LUCENE-1390) add ISOLatinAccentFilter and deprecate ISOLatin1AccentFilter

2008-12-03 Thread Sean Timm (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652808#action_12652808 ] Sean Timm commented on LUCENE-1390: --- >From my brief reading, it seems that "ae" would be

[jira] Resolved: (LUCENE-1246) Missing a null check in BooleanQuery.toString(String)

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller resolved LUCENE-1246. - Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) Th

[jira] Updated: (LUCENE-1246) Missing a null check in BooleanQuery.toString(String)

2008-12-03 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1246: Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) Affects Version/s:

[jira] Commented: (LUCENE-1470) Add TrieRangeQuery to contrib

2008-12-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652791#action_12652791 ] Uwe Schindler commented on LUCENE-1470: --- Just one note: The queries in the comment b

  1   2   >