[jira] Updated: (LUCENE-1692) Contrib analyzers need tests

2009-06-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1692: Attachment: LUCENE-1692.txt adds tests for thaianalyzer token offsets and types, both of which hav

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Sorry, last patch was invalid (did not compile), I forgot to to

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: (was: LUCENE-1693.patch) > AttributeSource/TokenStream API improvements >

Synchronizing Lucene indexes across 2 application servers

2009-06-17 Thread mitu2009
I've a web application which uses Lucene for search functionality. Lucene search requests are served by web services sitting on 2 application servers (IIS 7).The 2 application servers are Load balanced using "netscaler". Both these servers have a batch job running which updates search indexes on

[jira] Commented: (LUCENE-1595) Split DocMaker into ContentSource and DocMaker

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721078#action_12721078 ] Shai Erera commented on LUCENE-1595: bq. I still want to run some tests with the wikip

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721061#action_12721061 ] Mark Miller commented on LUCENE-1628: - Okay, fair enough. I figured you'd know better

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721057#action_12721057 ] Robert Muir commented on LUCENE-1628: - mark: thanks for the followup on the licenses!

[jira] Commented: (LUCENE-1696) Added New Token API impl for ASCIIFoldingFilter

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721054#action_12721054 ] Mark Miller commented on LUCENE-1696: - Patch looks good! I'll just hold off till the t

[jira] Commented: (LUCENE-1599) SpanRegexQuery and SpanNearQuery is not working with MultiSearcher

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721052#action_12721052 ] Mark Miller commented on LUCENE-1599: - Well yuck. SpanNearQuery does this clone call

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721048#action_12721048 ] Mark Miller commented on LUCENE-1628: - Looks pretty good. Not sure if we should update

[jira] Commented: (LUCENE-1628) Persian Analyzer

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721044#action_12721044 ] Mark Miller commented on LUCENE-1628: - bq. mark, on the same topic: if possible, at so

[jira] Updated: (LUCENE-1695) Update the Highlighter to use the new TokenStream API

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1695: Attachment: LUCENE-1695.patch Pretty much done, all tests pass. It breaks back compat, but frankly

[jira] Updated: (LUCENE-1595) Split DocMaker into ContentSource and DocMaker

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1595: Attachment: LUCENE-1595.patch Added to changes a bit Removed modification to core Document class u

[jira] Commented: (LUCENE-1700) LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader

2009-06-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721024#action_12721024 ] Jason Rutherglen commented on LUCENE-1700: -- Taking a step back, maybe we can solv

[jira] Created: (LUCENE-1700) LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader

2009-06-17 Thread Jason Rutherglen (JIRA)
LogMergePolicy.findMergesToExpungeDeletes need to get deletes from the SegmentReader Key: LUCENE-1700 URL: https://issues.apache.org/jira/browse/LUCENE-1700 Project:

[jira] Updated: (LUCENE-1313) Near Realtime Search

2009-06-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1313: - Attachment: LUCENE-1313.patch The patch is cleaned up. A static variable IndexWriter.GLO

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720984#action_12720984 ] Michael Busch commented on LUCENE-1693: --- Go to bed, I'll review later... in meetings

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Small updates, before I go to sleep. This patch removes the incr

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720952#action_12720952 ] Uwe Schindler commented on LUCENE-1693: --- The second test does not work, because it a

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720926#action_12720926 ] Uwe Schindler commented on LUCENE-1693: --- Exactly: The problem is in SinkTokenizer. w

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720919#action_12720919 ] Michael Busch commented on LUCENE-1693: --- Btw: SinkTokenizer in my patch has a small

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720913#action_12720913 ] Michael Busch commented on LUCENE-1693: --- You can probably fix CachingTokenFilter and

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-1693: -- Attachment: TestCompatibility.java Slightly changes tool yields on 2.4 and identically on trun

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Here my solution: The three default methods are now optimized to

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720887#action_12720887 ] Michael Busch commented on LUCENE-1693: --- I'm not convinced yet that we will be able

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: Everyone who is unhappy with the release TODO's, go back in your mail archive to the 2.2 release and check how many tedious little changes we made to improve the release quality. And besides the maven stuff, there is not really more to do compared to pre-2.2, it's just d

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720854#action_12720854 ] Uwe Schindler commented on LUCENE-1693: --- bq. Should I wait to put in the Highlighter

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720849#action_12720849 ] Mark Miller commented on LUCENE-1693: - Should I wait to put in the Highlighter update

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720846#action_12720846 ] Uwe Schindler commented on LUCENE-1693: --- I have a solution to build in some shortcut

[jira] Updated: (LUCENE-1625) openReaderPassed not populated in CheckIndex.Status.SegmentInfoStatus

2009-06-17 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Smith updated LUCENE-1625: -- Attachment: CheckIndex.patch Attached patch for exposing all collected stats (created with svn diff >

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720822#action_12720822 ] Uwe Schindler commented on LUCENE-1693: --- I could change the calling chain: increment

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720820#action_12720820 ] Michael Busch commented on LUCENE-1693: --- {quote} Ah I understand the problem: As I t

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720811#action_12720811 ] Uwe Schindler commented on LUCENE-1693: --- Ah I understand the problem: As I told, if

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720809#action_12720809 ] Uwe Schindler commented on LUCENE-1693: --- The code is almost identical to before, the

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720794#action_12720794 ] Michael Busch commented on LUCENE-1693: --- I'm looking at TokenStream.next(): {code:j

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
+1 Michael On 6/17/09 10:32 AM, Mark Miller wrote: Michael Busch wrote: We should just not put more items in the 2.9 list anymore (except bug fixes of course) and then fix the 30 issues and don't rush them too much. If it takes until end of July I think that's acceptable. A good quality of

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: We should just not put more items in the 2.9 list anymore (except bug fixes of course) and then fix the 30 issues and don't rush them too much. If it takes until end of July I think that's acceptable. A good quality of the release should be highest priority in my opinion.

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
That means the release frequency should not exceed the new-committer frequency. :) On 6/17/09 10:09 AM, Mark Miller wrote: Michael Busch wrote: One?!? I did 2.2, 2.3, 2.3.1, 2.3.2! What can you do ... there was no new guy to relieve you :) ---

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
We should just not put more items in the 2.9 list anymore (except bug fixes of course) and then fix the 30 issues and don't rush them too much. If it takes until end of July I think that's acceptable. A good quality of the release should be highest priority in my opinion. Michael On 6/17/09

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: wanted to get 2.9 out really really soon. really, really is probably not totally accurate. I just know how things can get drawn out. Even still, we have 30 some issues to resolve. If we don't make a drive though, when will 2.9 come out? Next fall at the earliest? Later? S

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Michael Busch wrote: One?!? I did 2.2, 2.3, 2.3.1, 2.3.2! What can you do ... there was no new guy to relieve you :) -- - Mark http://www.lucidimagination.com - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
On 6/17/09 6:23 AM, Mark Miller wrote: I have a special gift in not being clear. I was just saying "be prepared, your turn is coming ;) " But I havn't done a release myself - we don't release that often despite discussion that we should release more often every year or so. I did notice thoug

Re: Lucene 2.9 Again

2009-06-17 Thread Michael Busch
I'm happy to hear that :) I suggested 2-3 weeks to prevent the schedule from being even tighter, as it sounded like you guys wanted to get 2.9 out really really soon. I'm really busy the rest of June and will have much more time for Lucene in July. So if we could wait until end of July before

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720696#action_12720696 ] Robert Muir commented on LUCENE-1692: - michael, ok. I know additional tests here (agai

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720692#action_12720692 ] Shai Erera commented on LUCENE-1693: You can run tokenize.alg which invokes the ReadTo

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
Let's not forget Nutch... Also, for that matter, Mahout uses Lucene's Analysis and Core (in fact, I just committed MAHOUT-126 which allows one to create Vectors from a Lucene index!), although those are just as consumers, I doubt there is a need for Mahout committers to change Lucene. O

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
On Jun 17, 2009, at 10:11 AM, Yonik Seeley wrote: On Wed, Jun 17, 2009 at 8:57 AM, Grant Ingersoll wrote: On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: There are parts that aren't strictly part of the release process IMO - things like maven seem optional. -1. Maven support is not op

[jira] Updated: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1673: -- Attachment: LUCENE-1673.patch Here some intermediate update... > Move TrieRange to core > ---

Re: Lucene 2.9 Again

2009-06-17 Thread Yonik Seeley
On Wed, Jun 17, 2009 at 8:57 AM, Grant Ingersoll wrote: > On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: >> There are parts that aren't strictly part of the release process IMO - >> things like maven seem optional. > > -1.  Maven support is not optional. I can't always follow Lucene closely, but

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
I agree. I'm picturing some hopefully-not-that-distant future when we have a queries "module" and analysis "module" that live quite separately from Lucene & Solr's "core", and committers from both Solr and Lucene would work on it. Mike On Wed, Jun 17, 2009 at 9:01 AM, Grant Ingersoll wrote: > >

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720676#action_12720676 ] Uwe Schindler commented on LUCENE-1693: --- Hi Michael, I did not do any performance te

[jira] Updated: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-1630: --- Attachment: LUCENE-1630.patch * Collector's acceptDocsOutOfOrder is abstract - this was a really goo

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
I have a special gift in not being clear. I was just saying "be prepared, your turn is coming ;) " But I havn't done a release myself - we don't release that often despite discussion that we should release more often every year or so. I did notice though, that Mike did the release right after

Re: New Token API was Re: Payloads and TrieRangeQuery

2009-06-17 Thread Grant Ingersoll
On Jun 15, 2009, at 2:11 PM, Grant Ingersoll wrote: More questions: 1. What about Highlighter and MoreLikeThis? They have not been converted. Also, what are they going to do if the attributes they need are not available? Caveat emptor? 2. Same for TermVectors. What if the user specif

RE: Lucene 2.9 Again

2009-06-17 Thread Uwe Schindler
> Uwe Schindler wrote: > > Maybe Mark helps me and I can do > > it alone the next time, if I have to? :-) > > > Tag team effort ? It will be my first release to, so that would be great ! Ah ok, I interpreted your mail different yesterday (but it was 1 or 2 am in Germany...). Uwe --

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
On Jun 17, 2009, at 4:42 AM, Michael McCandless wrote: I would love to see function queries consolidated between Solr and Lucene! I think it's a prime example of duplicated and then diverged sources between Lucene and Solr... The primary reason it's diverged is it gets a lot of attention on

RE: Lucene 2.9 Again

2009-06-17 Thread Uwe Schindler
> On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: > > > On Tue, Jun 16, 2009 at 6:37 PM, Mark Miller > > wrote: > > There are parts that aren't strictly part of the release process IMO - > > things like maven seem optional. > > -1. Maven support is not optional. > > +1 for more automation. Fo

Re: Lucene 2.9 Again

2009-06-17 Thread Mark Miller
Uwe Schindler wrote: Maybe Mark helps me and I can do it alone the next time, if I have to? :-) Tag team effort ? It will be my first release to, so that would be great ! -- - Mark http://www.lucidimagination.com - To

Re: Lucene 2.9 Again

2009-06-17 Thread Grant Ingersoll
On Jun 16, 2009, at 7:16 PM, Yonik Seeley wrote: On Tue, Jun 16, 2009 at 6:37 PM, Mark Miller wrote: There are parts that aren't strictly part of the release process IMO - things like maven seem optional. -1. Maven support is not optional. +1 for more automation. For the record, once set

RE: Lucene 2.9 Again

2009-06-17 Thread Uwe Schindler
I tend also to a little bit later; maybe we need more discussions about NumericField and NumericSortField, especially between the two fractions Mike vs. Yonik :-) After finishing the TokenStream simplification and optimization, I will now again start rewriting of javadocs for trie and hopefully I

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
On Tue, Jun 16, 2009 at 6:06 PM, Michael Busch wrote: > How soon is soon? Code freeze in 2-3 weeks or so maybe? Then 7-10 days > testing, so 2.9 should be out mid July? Sounds reasonable? This schedule might be tight for me... I'm "on vacation" for the week starting Jun 29. Hopefully I can most

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: (was: LUCENE-1693.patch) > AttributeSource/TokenStream API improvements >

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Sorry, small bug in cloning inside next(): the POSToken-test was

[jira] Updated: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1693: -- Attachment: LUCENE-1693.patch Attached is a new patch, that implements the last idea: - There

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720632#action_12720632 ] Shai Erera commented on LUCENE-1630: {quote} You forgot to fill in the "?" in CHAN

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720629#action_12720629 ] Shai Erera commented on LUCENE-1630: bq. You forgot to fill in the "?" in CHANGES I g

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720622#action_12720622 ] Shai Erera commented on LUCENE-1630: It isn't and that's what I expressed in the javad

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720623#action_12720623 ] Michael McCandless commented on LUCENE-1630: Still working through the patch..

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720619#action_12720619 ] Earwin Burrfoot commented on LUCENE-1630: - I wasn't following the issue closely, s

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720612#action_12720612 ] Shai Erera commented on LUCENE-1630: Ok I will change acceptsDocsOutOfOrder on Collect

[jira] Commented: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720607#action_12720607 ] Michael McCandless commented on LUCENE-1630: {quote} bq. Can we make Collector

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720593#action_12720593 ] Michael McCandless commented on LUCENE-1673: bq. Want a convenience method for

[jira] Commented: (LUCENE-1699) Field tokenStream should be usable with stored fields.

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720578#action_12720578 ] Michael McCandless commented on LUCENE-1699: Patch looks good: * Can you m

[jira] Commented: (LUCENE-1673) Move TrieRange to core

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720574#action_12720574 ] Michael McCandless commented on LUCENE-1673: Note that LUCENE-1505 is open for

[jira] Commented: (LUCENE-1505) Remove NumberUtils from spatial contrib

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720572#action_12720572 ] Michael McCandless commented on LUCENE-1505: LUCENE-1496 is "won't fix" becaus

[jira] Updated: (LUCENE-1505) Change contrib/spatial to use trie's NumericUtils, and remove NumberUtils

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1505: --- Fix Version/s: 2.9 Summary: Change contrib/spatial to use trie's NumericUti

[jira] Commented: (LUCENE-1692) Contrib analyzers need tests

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720570#action_12720570 ] Michael McCandless commented on LUCENE-1692: Robert, you should probably also

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720571#action_12720571 ] Michael Busch commented on LUCENE-1693: --- {quote} I am working on that, I have a meet

[jira] Commented: (LUCENE-1516) Integrate IndexReader with IndexWriter

2009-06-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720569#action_12720569 ] Michael McCandless commented on LUCENE-1516: {quote} Currently we check the in

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720568#action_12720568 ] Uwe Schindler commented on LUCENE-1693: --- bq. I think you should try it out and see i

Re: Lucene 2.9 Again

2009-06-17 Thread Simon Willnauer
On Wed, Jun 17, 2009 at 10:42 AM, Michael McCandless wrote: > I would love to see function queries consolidated between Solr and > Lucene!  I think it's a prime example of duplicated and then diverged > sources between Lucene and Solr... > > And it's fabulous that you are "volunteering", Simon ;)  

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720562#action_12720562 ] Michael Busch commented on LUCENE-1693: --- For caching: I guess you would have to impl

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
On Tue, Jun 16, 2009 at 7:16 PM, Yonik Seeley wrote: > On Tue, Jun 16, 2009 at 6:37 PM, Mark Miller wrote: >> I've looked at the release todo wiki and I am still having nightmares. > > Indeed - it's gotten 5 times longer since the last time I did Lucene or Solr. > There are parts that aren't strict

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
On Tue, Jun 16, 2009 at 6:06 PM, Michael Busch wrote: > Cool, seems like Mark is volunteering to be the 2.9 release manager ;) Yay! Mike - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e

Re: Lucene 2.9 Again

2009-06-17 Thread Michael McCandless
I would love to see function queries consolidated between Solr and Lucene! I think it's a prime example of duplicated and then diverged sources between Lucene and Solr... And it's fabulous that you are "volunteering", Simon ;) We have precious few volunteers that stride both communities well eno

Re: madvise(ptr, len, MADV_SEQUENTIAL)

2009-06-17 Thread Michael McCandless
I think readahead would be less interesting to Lucene; while we definitely want a certain amount of readahead (to "amortize" the seeking), too much readahead starts means evicting things from the IO cache. OSs already do a fair job (I think) of some amount of readahead, though if we do gain posix_

Re: Lucene 2.9 Again

2009-06-17 Thread Simon Willnauer
On Tue, Jun 16, 2009 at 11:47 PM, Yonik Seeley wrote: > On Tue, Jun 16, 2009 at 5:38 PM, Simon > Willnauer wrote: >> I was thinking of adding a patch for >> https://issues.apache.org/jira/browse/LUCENE-1085 > > That's *way* too big of an issue and it breaks back compat in Solr (to > change from Sol

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720550#action_12720550 ] Uwe Schindler commented on LUCENE-1693: --- OK, I have a solution: I write a wrapper cl

[jira] Updated: (LUCENE-1630) Mating Collector and Scorer on doc Id orderness

2009-06-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-1630: --- Attachment: LUCENE-1630.patch Fixed most of your comments Mike. I also noticed I did not document C

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720538#action_12720538 ] Michael Busch commented on LUCENE-1693: --- OK, what about this sentence in Token.java:

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720534#action_12720534 ] Uwe Schindler commented on LUCENE-1693: --- Hi Michael, in principle your test is inval

[jira] Issue Comment Edited: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720534#action_12720534 ] Uwe Schindler edited comment on LUCENE-1693 at 6/17/09 12:39 AM: ---

Re: New Token API was Re: Payloads and TrieRangeQuery

2009-06-17 Thread Michael Busch
On 6/15/09 10:10 AM, Grant Ingersoll wrote: But, as Michael M reminded me, it is complex, so please accept my apologies. No worries, Grant! I was not really offended, but rather confused... Thanks for clarifying. Michael

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720530#action_12720530 ] Michael Busch commented on LUCENE-1693: --- But I'll definitely buy Uwe a beer if he co

[jira] Commented: (LUCENE-1693) AttributeSource/TokenStream API improvements

2009-06-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720529#action_12720529 ] Michael Busch commented on LUCENE-1693: --- I don't think we mention subclassing of Tok