[jira] Commented: (LUCENE-1516) Integrate IndexReader with IndexWriter

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695306#action_12695306 ] Michael McCandless commented on LUCENE-1516: Good catch! I'll fix. > Integra

Re: IndexWriter.addIndexesNoOptimize(IndexReader[] readers)

2009-04-03 Thread Michael McCandless
Makes sense. Wanna make a patch? We'd then deprecate addIndexes(IndexReader[]). Mike On Thu, Apr 2, 2009 at 9:16 PM, Jason Rutherglen wrote: > This seems like something that's tenable?  It would be useful for merging > ram indexes to disk where if a directory is passed, the directory may be >

[jira] Commented: (LUCENE-1584) Callback for intercepting merging segments in IndexWriter

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695318#action_12695318 ] Michael McCandless commented on LUCENE-1584: I think this can be achieved, tod

Re: Lucene filter

2009-04-03 Thread Michael McCandless
Could you re-ask this on java-user, instead? Thanks. Mike On Thu, Apr 2, 2009 at 6:24 PM, addman wrote: > > How do you create a Lucene Filter to check if a field has a value?  It is > part for a ChainedFilter that I am creating. > -- > View this message in context: > http://www.nabble.com/Luce

Re: Future projects

2009-04-03 Thread Michael McCandless
On Thu, Apr 2, 2009 at 5:56 PM, Jason Rutherglen wrote: >> I think I need to understand better why delete by Query isn't > viable in your situation... > > The delete by query is a separate problem which I haven't fully > explored yet. Oh, I had thought we were tugging on this thread in order to e

Re: Future projects

2009-04-03 Thread Michael McCandless
On Thu, Apr 2, 2009 at 6:55 PM, John Wang wrote: > Just to clarify, Approach 1 and approach 2 are both currently performing ok > currently for us. OK that's very good to know. Mike - To unsubscribe, e-mail: java-dev-unsubscr...

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695338#action_12695338 ] Shai Erera commented on LUCENE-1575: I've been thinking about TimeLimitedCollector and

[jira] Updated: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1582: -- Attachment: LUCENE-1582.patch > Make TrieRange completely independent from Document/Field with

[jira] Commented: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695341#action_12695341 ] Uwe Schindler commented on LUCENE-1582: --- A first version of the patch: - JavaDocs no

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695358#action_12695358 ] Michael McCandless commented on LUCENE-1575: bq. If you agree with any of the

[jira] Updated: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1575: --- Attachment: LUCENE-1575.patch OK, I attached a new patch with some minor changes:

[jira] Commented: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695362#action_12695362 ] Michael McCandless commented on LUCENE-1582: bq. Maybe this should be provides

[jira] Commented: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695364#action_12695364 ] Uwe Schindler commented on LUCENE-1582: --- bq. Hmm, we should do some perf tests to se

[jira] Commented: (LUCENE-1341) BoostingNearQuery class (prototype)

2009-04-03 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695370#action_12695370 ] Grant Ingersoll commented on LUCENE-1341: - Hi Peter, This looks good, I think it

[jira] Commented: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695400#action_12695400 ] Michael McCandless commented on LUCENE-1582: bq. I can add a setValue() method

[jira] Commented: (LUCENE-1539) Improve Benchmark

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695462#action_12695462 ] Michael McCandless commented on LUCENE-1539: This patch looks good -- some que

[jira] Assigned: (LUCENE-1539) Improve Benchmark

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-1539: -- Assignee: Michael McCandless > Improve Benchmark > - > >

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695478#action_12695478 ] Shai Erera commented on LUCENE-1575: bq. I like "TimeLimitingCollector", or maybe "Tim

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695513#action_12695513 ] Michael McCandless commented on LUCENE-1575: bq. I like TimeLimitingCollector

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695523#action_12695523 ] Shai Erera commented on LUCENE-1575: bq. Can't the scoring collector impls in TopField

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695525#action_12695525 ] Shai Erera commented on LUCENE-1575: BTW Mike - I think the accidental changes to Sear

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695537#action_12695537 ] Michael McCandless commented on LUCENE-1575: I ran a "first do no harm" perf

Re: Future projects

2009-04-03 Thread John Wang
By default bobo DOES use a flavor of the field cache data structure with some addition information for performance. (e.g. minDocid,maxDocid,freq per term) Bobo is architected as a platform where clients can write their own "FacetHandlers" in which each FacetHandler manages its own view of memory st

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695541#action_12695541 ] Michael McCandless commented on LUCENE-1575: {quote} > Can't the scoring colle

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695543#action_12695543 ] Michael McCandless commented on LUCENE-1575: bq. BTW Mike - I think the accide

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695553#action_12695553 ] Jason Rutherglen commented on LUCENE-1575: -- Something related to time limiting co

[jira] Updated: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1582: -- Attachment: LUCENE-1582.patch > Make TrieRange completely independent from Document/Field with

[jira] Commented: (LUCENE-1582) Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values

2009-04-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695554#action_12695554 ] Uwe Schindler commented on LUCENE-1582: --- Updated patch: - supports a setValue() to r

[jira] Commented: (LUCENE-1584) Callback for intercepting merging segments in IndexWriter

2009-04-03 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695557#action_12695557 ] Jason Rutherglen commented on LUCENE-1584: -- I would like to move away from our cu

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695575#action_12695575 ] Shai Erera commented on LUCENE-1575: How do I run such a test? Is there an algorithm

[jira] Commented: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-04-03 Thread Jeremy Volkman (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695577#action_12695577 ] Jeremy Volkman commented on LUCENE-1483: I'm trying to create a FieldValueHitQueue

[jira] Commented: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695586#action_12695586 ] Shai Erera commented on LUCENE-1483: Hi Jeremy This will be taken care of in 1575 by

[jira] Commented: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-04-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695587#action_12695587 ] Uwe Schindler commented on LUCENE-1483: --- This will be changed as part of LUCENE-1575

Re: Future projects

2009-04-03 Thread Jason Rutherglen
> meaning in Bobo you'd like to manage your own memory resident field caches, and merge them whenever IW has merged a segment? Seems like you don't need genealogy for that. Agreed, there is no need for full genealogy. > CSF isn't really designed yet. How come it can't be used with Bobo's field ca

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695595#action_12695595 ] Shai Erera commented on LUCENE-1575: BTW, I can change FieldValueHitQueue like I chang

Re: Future projects

2009-04-03 Thread Jason Rutherglen
> I think the realtime reader'd just store the maxDocID it's allowed to search, and we would likely keep using the RAM format now used. Sounds pretty good. Are there any other gotchas in the design? On Thu, Apr 2, 2009 at 1:40 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > On Wed

Re: Future projects

2009-04-03 Thread Michael Busch
On 4/3/09 3:35 AM, Michael McCandless wrote: It seems like we've been talking about CSF for 2 years and there isn't a patch for it? If I had more time I'd take a look. What is the status of it? I think Michael is looking into it? I'd really like to get it into 2.9. We should do it in co

Re: Future projects

2009-04-03 Thread Jason Rutherglen
I looked at the IndexWriter code in regards to creating a realtime reader, with the many flexible indexing classes I'm unsure of how one would get a frozenish IndexInput of the byte slices, given the byte slices are attached to different threads? On Fri, Apr 3, 2009 at 2:42 PM, Jason Rutherglen w

[jira] Commented: (LUCENE-1231) Column-stride fields (aka per-document Payloads)

2009-04-03 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695666#action_12695666 ] Michael Busch commented on LUCENE-1231: --- For the search side we need an API similar

[jira] Created: (LUCENE-1585) Allow to control how payloads are merged

2009-04-03 Thread Michael Busch (JIRA)
Allow to control how payloads are merged Key: LUCENE-1585 URL: https://issues.apache.org/jira/browse/LUCENE-1585 Project: Lucene - Java Issue Type: New Feature Components: Index

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695674#action_12695674 ] Shai Erera commented on LUCENE-1575: Mike - about your comments on the new Searcher an

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-03 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695677#action_12695677 ] Shai Erera commented on LUCENE-1575: Oh wait .. I should have tried to implement it be