[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720529#action_12720529 ] Michael Busch commented on LUCENE-1693: --- I don't think we mention subclassing of Tok

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720530#action_12720530 ] Michael Busch commented on LUCENE-1693: --- But I'll definitely buy Uwe a beer if he co

Re: New Token API was Re: Payloads and TrieRangeQuery

2009-06-17 Thread Michael Busch
On 6/15/09 10:10 AM, Grant Ingersoll wrote: But, as Michael M reminded me, it is complex, so please accept my apologies. No worries, Grant! I was not really offended, but rather confused... Thanks for clarifying. Michael

[jira] Issue Comment Edited: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720534#action_12720534 ] Uwe Schindler edited comment on LUCENE-1693 at 6/17/09 12:39 AM: ---

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720534#action_12720534 ] Uwe Schindler commented on LUCENE-1693: --- Hi Michael, in principle your test is inval

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720538#action_12720538 ] Michael Busch commented on LUCENE-1693: --- OK, what about this sentence in Token.java:

[jira] Updated: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-1630: --- Attachment: LUCENE-1630.patch Fixed most of your comments Mike. I also noticed I did not document C

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720550#action_12720550 ] Uwe Schindler commented on LUCENE-1693: --- OK, I have a solution: I write a wrapper cl

Re: Lucene 2.9 Again

2009-06-17 Thread Simon Willnauer
On Tue, Jun 16, 2009 at 11:47 PM, Yonik Seeley wrote: > On Tue, Jun 16, 2009 at 5:38 PM, Simon > Willnauer wrote: >> I was thinking of adding a patch for >> https://issues.apache.org/jira/browse/LUCENE-1085 > > That's *way* too big of an issue and it breaks back compat in Solr (to > change from Sol

Re: madvise(ptr, len, MADV_SEQUENTIAL)

2009-06-17 Thread Michael McCandless
I think readahead would be less interesting to Lucene; while we definitely want a certain amount of readahead (to "amortize" the seeking), too much readahead starts means evicting things from the IO cache. OSs already do a fair job (I think) of some amount of readahead, though if we do gain posix_

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
I would love to see function queries consolidated between Solr and Lucene! I think it's a prime example of duplicated and then diverged sources between Lucene and Solr... And it's fabulous that you are "volunteering", Simon ;) We have precious few volunteers that stride both communities well eno

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
On Tue, Jun 16, 2009 at 6:06 PM, Michael Busch wrote: > Cool, seems like Mark is volunteering to be the 2.9 release manager ;) Yay! Mike - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
On Tue, Jun 16, 2009 at 7:16 PM, Yonik Seeley wrote: > On Tue, Jun 16, 2009 at 6:37 PM, Mark Miller wrote: >> I've looked at the release todo wiki and I am still having nightmares. > > Indeed - it's gotten 5 times longer since the last time I did Lucene or Solr. > There are parts that aren't strict

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720562#action_12720562 ] Michael Busch commented on LUCENE-1693: --- For caching: I guess you would have to impl

Re: Lucene 2.9 Again

2009-06-17 Thread Simon Willnauer
On Wed, Jun 17, 2009 at 10:42 AM, Michael McCandless wrote: > I would love to see function queries consolidated between Solr and > Lucene!  I think it's a prime example of duplicated and then diverged > sources between Lucene and Solr... > > And it's fabulous that you are "volunteering", Simon ;)  

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720568#action_12720568 ] Uwe Schindler commented on LUCENE-1693: --- bq. I think you should try it out and see i

[jira] Commented: (LUCENE-1516) Integrate IndexReader with IndexWriter

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720569#action_12720569 ] Michael McCandless commented on LUCENE-1516: {quote} Currently we check the in

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720571#action_12720571 ] Michael Busch commented on LUCENE-1693: --- {quote} I am working on that, I have a meet

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720570#action_12720570 ] Michael McCandless commented on LUCENE-1692: Robert, you should probably also

[jira] Updated: (LUCENE-1505) Change contrib/spatial to use trie's NumericUtils, and remove NumberUtils

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1505: --- Fix Version/s: 2.9 Summary: Change contrib/spatial to use trie's NumericUti

[jira] Commented: (LUCENE-1505) Remove NumberUtils from spatial contrib

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720572#action_12720572 ] Michael McCandless commented on LUCENE-1505: LUCENE-1496 is "won't fix" becaus

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720574#action_12720574 ] Michael McCandless commented on LUCENE-1673: Note that LUCENE-1505 is open for

[jira] Commented: (LUCENE-1699) Field tokenStream should be usable with stored fields.

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720578#action_12720578 ] Michael McCandless commented on LUCENE-1699: Patch looks good: * Can you m

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720593#action_12720593 ] Michael McCandless commented on LUCENE-1673: bq. Want a convenience method for

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720607#action_12720607 ] Michael McCandless commented on LUCENE-1630: {quote} bq. Can we make Collector

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720612#action_12720612 ] Shai Erera commented on LUCENE-1630: Ok I will change acceptsDocsOutOfOrder on Collect

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720619#action_12720619 ] Earwin Burrfoot commented on LUCENE-1630: - I wasn't following the issue closely, s

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720623#action_12720623 ] Michael McCandless commented on LUCENE-1630: Still working through the patch..

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720622#action_12720622 ] Shai Erera commented on LUCENE-1630: It isn't and that's what I expressed in the javad

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720629#action_12720629 ] Shai Erera commented on LUCENE-1630: bq. You forgot to fill in the "?" in CHANGES I g

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720632#action_12720632 ] Shai Erera commented on LUCENE-1630: {quote} You forgot to fill in the "?" in CHAN

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Attached is a new patch, that implements the last idea: - There

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Sorry, small bug in cloning inside next(): the POSToken-test was

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: (was: LUCENE-1693.patch) > AttributeSource/TokenStream API improvements >

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
On Tue, Jun 16, 2009 at 6:06 PM, Michael Busch wrote: > How soon is soon? Code freeze in 2-3 weeks or so maybe? Then 7-10 days > testing, so 2.9 should be out mid July? Sounds reasonable? This schedule might be tight for me... I'm "on vacation" for the week starting Jun 29. Hopefully I can most

RE: Lucene 2.9 Again

2009-06-17 Thread Uwe Schindler
I tend also to a little bit later; maybe we need more discussions about NumericField and NumericSortField, especially between the two fractions Mike vs. Yonik :-) After finishing the TokenStream simplification and optimization, I will now again start rewriting of javadocs for trie and hopefully I

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: On Tue, Jun 16, 2009 at 6:37 PM, Mark Miller wrote: There are parts that aren't strictly part of the release process IMO - things like maven seem optional. -1. Maven support is not optional. +1 for more automation. For the record, once set

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Uwe Schindler wrote: Maybe Mark helps me and I can do it alone the next time, if I have to? :-) Tag team effort ? It will be my first release to, so that would be great ! -- - Mark http://www.lucidimagination.com - To

RE: Lucene 2.9 Again

2009-06-17 Thread Uwe Schindler
> On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: > > > On Tue, Jun 16, 2009 at 6:37 PM, Mark Miller > > wrote: > > There are parts that aren't strictly part of the release process IMO - > > things like maven seem optional. > > -1. Maven support is not optional. > > +1 for more automation. Fo

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
On Jun 17, 2009, at 4:42 AM, Michael McCandless wrote: I would love to see function queries consolidated between Solr and Lucene! I think it's a prime example of duplicated and then diverged sources between Lucene and Solr... The primary reason it's diverged is it gets a lot of attention on

RE: Lucene 2.9 Again

2009-06-17 Thread Uwe Schindler
> Uwe Schindler wrote: > > Maybe Mark helps me and I can do > > it alone the next time, if I have to? :-) > > > Tag team effort ? It will be my first release to, so that would be great ! Ah ok, I interpreted your mail different yesterday (but it was 1 or 2 am in Germany...). Uwe --

Re: New Token API was Re: Payloads and TrieRangeQuery

2009-06-17 Thread Grant Ingersoll
On Jun 15, 2009, at 2:11 PM, Grant Ingersoll wrote: More questions: 1. What about Highlighter and MoreLikeThis? They have not been converted. Also, what are they going to do if the attributes they need are not available? Caveat emptor? 2. Same for TermVectors. What if the user specif

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
I have a special gift in not being clear. I was just saying "be prepared, your turn is coming ;) " But I havn't done a release myself - we don't release that often despite discussion that we should release more often every year or so. I did notice though, that Mike did the release right after

[jira] Updated: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-1630: --- Attachment: LUCENE-1630.patch * Collector's acceptDocsOutOfOrder is abstract - this was a really goo

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720676#action_12720676 ] Uwe Schindler commented on LUCENE-1693: --- Hi Michael, I did not do any performance te

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
I agree. I'm picturing some hopefully-not-that-distant future when we have a queries "module" and analysis "module" that live quite separately from Lucene & Solr's "core", and committers from both Solr and Lucene would work on it. Mike On Wed, Jun 17, 2009 at 9:01 AM, Grant Ingersoll wrote: > >

Re: Lucene 2.9 Again

2009-06-17 Thread Yonik Seeley
On Wed, Jun 17, 2009 at 8:57 AM, Grant Ingersoll wrote: > On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: >> There are parts that aren't strictly part of the release process IMO - >> things like maven seem optional. > > -1.  Maven support is not optional. I can't always follow Lucene closely, but

[jira] Updated: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1673: -- Attachment: LUCENE-1673.patch Here some intermediate update... > Move TrieRange to core > ---

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
On Jun 17, 2009, at 10:11 AM, Yonik Seeley wrote: On Wed, Jun 17, 2009 at 8:57 AM, Grant Ingersoll wrote: On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: There are parts that aren't strictly part of the release process IMO - things like maven seem optional. -1. Maven support is not op

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
Let's not forget Nutch... Also, for that matter, Mahout uses Lucene's Analysis and Core (in fact, I just committed MAHOUT-126 which allows one to create Vectors from a Lucene index!), although those are just as consumers, I doubt there is a need for Mahout committers to change Lucene. O

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720692#action_12720692 ] Shai Erera commented on LUCENE-1693: You can run tokenize.alg which invokes the ReadTo

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720696#action_12720696 ] Robert Muir commented on LUCENE-1692: - michael, ok. I know additional tests here (agai

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
I'm happy to hear that :) I suggested 2-3 weeks to prevent the schedule from being even tighter, as it sounded like you guys wanted to get 2.9 out really really soon. I'm really busy the rest of June and will have much more time for Lucene in July. So if we could wait until end of July before

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
On 6/17/09 6:23 AM, Mark Miller wrote: I have a special gift in not being clear. I was just saying "be prepared, your turn is coming ;) " But I havn't done a release myself - we don't release that often despite discussion that we should release more often every year or so. I did notice thoug

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: One?!? I did 2.2, 2.3, 2.3.1, 2.3.2! What can you do ... there was no new guy to relieve you :) -- - Mark http://www.lucidimagination.com - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: wanted to get 2.9 out really really soon. really, really is probably not totally accurate. I just know how things can get drawn out. Even still, we have 30 some issues to resolve. If we don't make a drive though, when will 2.9 come out? Next fall at the earliest? Later? S

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
We should just not put more items in the 2.9 list anymore (except bug fixes of course) and then fix the 30 issues and don't rush them too much. If it takes until end of July I think that's acceptable. A good quality of the release should be highest priority in my opinion. Michael On 6/17/09

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
That means the release frequency should not exceed the new-committer frequency. :) On 6/17/09 10:09 AM, Mark Miller wrote: Michael Busch wrote: One?!? I did 2.2, 2.3, 2.3.1, 2.3.2! What can you do ... there was no new guy to relieve you :) ---

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: We should just not put more items in the 2.9 list anymore (except bug fixes of course) and then fix the 30 issues and don't rush them too much. If it takes until end of July I think that's acceptable. A good quality of the release should be highest priority in my opinion.

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
+1 Michael On 6/17/09 10:32 AM, Mark Miller wrote: Michael Busch wrote: We should just not put more items in the 2.9 list anymore (except bug fixes of course) and then fix the 30 issues and don't rush them too much. If it takes until end of July I think that's acceptable. A good quality of

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720794#action_12720794 ] Michael Busch commented on LUCENE-1693: --- I'm looking at TokenStream.next(): {code:j

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720809#action_12720809 ] Uwe Schindler commented on LUCENE-1693: --- The code is almost identical to before, the

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720811#action_12720811 ] Uwe Schindler commented on LUCENE-1693: --- Ah I understand the problem: As I told, if

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720820#action_12720820 ] Michael Busch commented on LUCENE-1693: --- {quote} Ah I understand the problem: As I t

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720822#action_12720822 ] Uwe Schindler commented on LUCENE-1693: --- I could change the calling chain: increment

[jira] Updated: (LUCENE-1625) openReaderPassed not populated in CheckIndex.Status.SegmentInfoStatus

2009-06-17 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Smith updated LUCENE-1625: -- Attachment: CheckIndex.patch Attached patch for exposing all collected stats (created with svn diff >

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720846#action_12720846 ] Uwe Schindler commented on LUCENE-1693: --- I have a solution to build in some shortcut

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720849#action_12720849 ] Mark Miller commented on LUCENE-1693: - Should I wait to put in the Highlighter update

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720854#action_12720854 ] Uwe Schindler commented on LUCENE-1693: --- bq. Should I wait to put in the Highlighter

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: Everyone who is unhappy with the release TODO's, go back in your mail archive to the 2.2 release and check how many tedious little changes we made to improve the release quality. And besides the maven stuff, there is not really more to do compared to pre-2.2, it's just d

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720887#action_12720887 ] Michael Busch commented on LUCENE-1693: --- I'm not convinced yet that we will be able

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Here my solution: The three default methods are now optimized to

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-1693: -- Attachment: TestCompatibility.java Slightly changes tool yields on 2.4 and identically on trun

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720913#action_12720913 ] Michael Busch commented on LUCENE-1693: --- You can probably fix CachingTokenFilter and

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720919#action_12720919 ] Michael Busch commented on LUCENE-1693: --- Btw: SinkTokenizer in my patch has a small

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720926#action_12720926 ] Uwe Schindler commented on LUCENE-1693: --- Exactly: The problem is in SinkTokenizer. w

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720952#action_12720952 ] Uwe Schindler commented on LUCENE-1693: --- The second test does not work, because it a

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Small updates, before I go to sleep. This patch removes the incr

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720984#action_12720984 ] Michael Busch commented on LUCENE-1693: --- Go to bed, I'll review later... in meetings

[jira] Updated: (LUCENE-1313) Near Realtime Search

2009-06-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1313: - Attachment: LUCENE-1313.patch The patch is cleaned up. A static variable IndexWriter.GLO

[jira] Created: (LUCENE-1700) LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader

2009-06-17 Thread Jason Rutherglen (JIRA)
LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader Key: LUCENE-1700 URL: https://issues.apache.org/jira/browse/LUCENE-1700 Project:

[jira] Commented: (LUCENE-1700) LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader

2009-06-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721024#action_12721024 ] Jason Rutherglen commented on LUCENE-1700: -- Taking a step back, maybe we can solv

[jira] Updated: (LUCENE-1595) Split DocMaker into ContentSource and DocMaker

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1595: Attachment: LUCENE-1595.patch Added to changes a bit Removed modification to core Document class u

[jira] Updated: (LUCENE-1695) Update the Highlighter to use the new TokenStream API

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1695: Attachment: LUCENE-1695.patch Pretty much done, all tests pass. It breaks back compat, but frankly

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721044#action_12721044 ] Mark Miller commented on LUCENE-1628: - bq. mark, on the same topic: if possible, at so

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721048#action_12721048 ] Mark Miller commented on LUCENE-1628: - Looks pretty good. Not sure if we should update

[jira] Commented: (LUCENE-1599) SpanRegexQuery and SpanNearQuery is not working with MultiSearcher

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721052#action_12721052 ] Mark Miller commented on LUCENE-1599: - Well yuck. SpanNearQuery does this clone call

[jira] Commented: (LUCENE-1696) Added New Token API impl for ASCIIFoldingFilter

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721054#action_12721054 ] Mark Miller commented on LUCENE-1696: - Patch looks good! I'll just hold off till the t

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721057#action_12721057 ] Robert Muir commented on LUCENE-1628: - mark: thanks for the followup on the licenses!

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721061#action_12721061 ] Mark Miller commented on LUCENE-1628: - Okay, fair enough. I figured you'd know better

[jira] Commented: (LUCENE-1595) Split DocMaker into ContentSource and DocMaker

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721078#action_12721078 ] Shai Erera commented on LUCENE-1595: bq. I still want to run some tests with the wikip

Synchronizing Lucene indexes across 2 application servers

2009-06-17 Thread mitu2009
I've a web application which uses Lucene for search functionality. Lucene search requests are served by web services sitting on 2 application servers (IIS 7).The 2 application servers are Load balanced using "netscaler". Both these servers have a batch job running which updates search indexes on

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: (was: LUCENE-1693.patch) > AttributeSource/TokenStream API improvements >

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Sorry, last patch was invalid (did not compile), I forgot to to

[jira] Updated: (LUCENE-1692) Contrib analyzers need tests

2009-06-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1692: Attachment: LUCENE-1692.txt adds tests for thaianalyzer token offsets and types, both of which hav