[jira] Updated: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1575: --- Attachment: sortCollate5.py sortBench5.py I'm attaching the Python s

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695699#action_12695699 ] Michael McCandless commented on LUCENE-1575: bq. adds one collector.setScorer(

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695700#action_12695700 ] Michael McCandless commented on LUCENE-1575: {quote} BTW, I can change FieldVa

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695701#action_12695701 ] Michael McCandless commented on LUCENE-1575: {quote} How problematic is this b

Re: Future projects

2009-04-04 Thread Michael McCandless
On Fri, Apr 3, 2009 at 3:16 PM, John Wang wrote: > By default bobo DOES use a flavor of the field cache data structure with > some addition information for performance. (e.g. minDocid,maxDocid,freq per > term) > Bobo is architected as a platform where clients can write their own > "FacetHandlers"

Re: Future projects

2009-04-04 Thread Michael McCandless
On Fri, Apr 3, 2009 at 5:32 PM, Jason Rutherglen wrote: >> meaning in Bobo you'd like to manage your own memory resident > field caches, and merge them whenever IW has merged a segment? > Seems like you don't need genealogy for that. > > Agreed, there is no need for full genealogy. OK >> CSF isn

Re: Future projects

2009-04-04 Thread Michael McCandless
On Fri, Apr 3, 2009 at 5:42 PM, Jason Rutherglen wrote: >> I think the realtime reader'd just store the maxDocID it's allowed to >> search, and we would likely keep using the RAM format now used. > > Sounds pretty good.  Are there any other gotchas in the design? Yes: the flushing process becomes

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695711#action_12695711 ] Shai Erera commented on LUCENE-1575: There are no super.XXX calls. The two FVHQ implem

Re: Future projects

2009-04-04 Thread Michael McCandless
On Fri, Apr 3, 2009 at 7:11 PM, Michael Busch wrote: > Yeah me too. I think eventually we want this to be a Codec, but we probably > don't want to wait until all the flexible indexing work is done. > So maybe we should just not worry too much about a perfectly integrated API > yet and release it

Re: Future projects

2009-04-04 Thread Michael McCandless
On Fri, Apr 3, 2009 at 8:01 PM, Jason Rutherglen wrote: > I looked at the IndexWriter code in regards to creating a realtime reader, > with the many flexible indexing classes I'm unsure of how one would get a > frozenish IndexInput of the byte slices, given the byte slices are attached > to differ

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695723#action_12695723 ] Shai Erera commented on LUCENE-1575: bq. OK let's add both and call it out in CHANGES.

[jira] Commented: (LUCENE-1231) Column-stride fields (aka per-document Payloads)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695726#action_12695726 ] Michael McCandless commented on LUCENE-1231: {quote} Eventually we need more f

[jira] Commented: (LUCENE-1585) Allow to control how payloads are merged

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695728#action_12695728 ] Michael McCandless commented on LUCENE-1585: I agree: extensibility to the Seg

[jira] Commented: (LUCENE-1584) Callback for intercepting merging segments in IndexWriter

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695742#action_12695742 ] Michael McCandless commented on LUCENE-1584: I'd like to step back and underst

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695743#action_12695743 ] Michael McCandless commented on LUCENE-1575: {quote} There are no super.XXX ca

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695745#action_12695745 ] Michael McCandless commented on LUCENE-1575: Odd -- inlining super.collect in

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695746#action_12695746 ] Michael McCandless commented on LUCENE-1575: bq. We should also perf. test sor

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695750#action_12695750 ] Michael McCandless commented on LUCENE-1575: Shai can you post your latest pat

[jira] Created: (LUCENE-1586) add IndexReader.getUniqueTermCount

2009-04-04 Thread Michael McCandless (JIRA)
add IndexReader.getUniqueTermCount -- Key: LUCENE-1586 URL: https://issues.apache.org/jira/browse/LUCENE-1586 Project: Lucene - Java Issue Type: Improvement Components: Index Reporter: Mi

[jira] Updated: (LUCENE-1586) add IndexReader.getUniqueTermCount

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1586: --- Attachment: LUCENE-1586.patch Attached patch. I plan to commit in a day or two...

[jira] Updated: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-1575: --- Attachment: LUCENE-1575.7.patch - Changed TermScorer.score() method to not call Similarity.decodeNor

[jira] Commented: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695782#action_12695782 ] Michael McCandless commented on LUCENE-1575: OK thanks. Numbers w/ new patch:

[jira] Updated: (LUCENE-1575) Refactoring Lucene collectors (HitCollector and extensions)

2009-04-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1575: --- Attachment: LUCENE-1575.patch Attached patch; only differences are: * Under contr

[jira] Commented: (LUCENE-1586) add IndexReader.getUniqueTermCount

2009-04-04 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695787#action_12695787 ] Uwe Schindler commented on LUCENE-1586: --- Hi Mike, why not just use getSequentialSubR

[jira] Commented: (LUCENE-1586) add IndexReader.getUniqueTermCount

2009-04-04 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12695788#action_12695788 ] Uwe Schindler commented on LUCENE-1586: --- Sorry, that cannot work, because the segmen