ported lucandra: lucene index on HBase

2010-03-25 Thread Thomas Koch
Hi, Lucandra stores a lucene index on cassandra: http://blog.sematext.com/2010/02/09/lucandra-a-cassandra-based-lucene-backend As the author of lucandra writes: I’m sure something similar could be built on hbase. So here it is: http://github.com/thkoch2001/lucehbase This is only a first

[jira] Commented: (LUCENE-2215) paging collector

2010-03-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849639#action_12849639 ] Michael McCandless commented on LUCENE-2215: This is a neat collector! I like

Re: Baby steps towards making Lucene's scoring more flexible...

2010-03-25 Thread Michael McCandless
On Mon, Mar 22, 2010 at 12:45 PM, Marvin Humphrey mar...@rectangular.com wrote: On Thu, Mar 18, 2010 at 05:16:23AM -0500, Michael McCandless wrote: Also, will Lucy store the original stats? These? * Total number of tokens in the field. * Number of unique terms in the field. * Doc

[jira] Updated: (LUCENE-2345) Make it possible to subclass SegmentReader

2010-03-25 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Smith updated LUCENE-2345: -- Attachment: LUCENE-2345_3.0.patch Here's a patch against 3.0 that provides the SegmentReaderFactory

[jira] Commented: (LUCENE-2345) Make it possible to subclass SegmentReader

2010-03-25 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849728#action_12849728 ] Shai Erera commented on LUCENE-2345: bq. The IndexWriter now has a getter and setter

[jira] Commented: (LUCENE-2345) Make it possible to subclass SegmentReader

2010-03-25 Thread Tim Smith (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849731#action_12849731 ] Tim Smith commented on LUCENE-2345: --- that was my plan Make it possible to subclass

Re: Baby steps towards making Lucene's scoring more flexible...

2010-03-25 Thread Marvin Humphrey
On Thu, Mar 25, 2010 at 06:24:34AM -0400, Michael McCandless wrote: Also, will Lucy store the original stats? These? * Total number of tokens in the field. * Number of unique terms in the field. * Doc boost. * Field boost. Also sum(tf). Robert can generate more :) Hmm,

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849806#action_12849806 ] Jason Rutherglen commented on LUCENE-2324: -- Michael, I'm guessing this patch

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849808#action_12849808 ] Jason Rutherglen commented on LUCENE-2324: -- Actually, I just browsed the patch

[jira] Updated: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2324: -- Attachment: (was: lucene-2324-no-pooling.patch) Per thread DocumentsWriters that write

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849819#action_12849819 ] Michael Busch commented on LUCENE-2324: --- Hey Jason, Disregard my patch here. I

[jira] Commented: (LUCENE-2215) paging collector

2010-03-25 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849843#action_12849843 ] Grant Ingersoll commented on LUCENE-2215: - Mike, don't you think, though, that

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849844#action_12849844 ] Jason Rutherglen commented on LUCENE-2324: -- Michael, I'm working on a patch and

[jira] Commented: (LUCENE-2215) paging collector

2010-03-25 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849851#action_12849851 ] Uwe Schindler commented on LUCENE-2215: --- Hey, and I want to fix the NaN thing in

[jira] Commented: (LUCENE-2215) paging collector

2010-03-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849863#action_12849863 ] Michael McCandless commented on LUCENE-2215: bq. ...through a fairly simple

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849899#action_12849899 ] Michael Busch commented on LUCENE-2324: --- Awesome! Per thread DocumentsWriters that

[jira] Created: (LUCENE-2346) Explore other in-memory postinglist formats for realtime search

2010-03-25 Thread Michael Busch (JIRA)
Explore other in-memory postinglist formats for realtime search --- Key: LUCENE-2346 URL: https://issues.apache.org/jira/browse/LUCENE-2346 Project: Lucene - Java Issue Type:

[jira] Created: (LUCENE-2347) Dump WordNet to SOLR Synonym format

2010-03-25 Thread Bill Bell (JIRA)
Dump WordNet to SOLR Synonym format --- Key: LUCENE-2347 URL: https://issues.apache.org/jira/browse/LUCENE-2347 Project: Lucene - Java Issue Type: New Feature Components: contrib/* Affects

[jira] Updated: (LUCENE-2347) Dump WordNet to SOLR Synonym format

2010-03-25 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bill Bell updated LUCENE-2347: -- Attachment: Syns2Solr.java Dump WordNet to SOLR Synonym format ---

[jira] Commented: (LUCENE-2215) paging collector

2010-03-25 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849961#action_12849961 ] Grant Ingersoll commented on LUCENE-2215: - Yeah, but one could make the argument,

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-25 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12849965#action_12849965 ] Jason Rutherglen commented on LUCENE-2324: -- I'm a little confused in the

[jira] Commented: (LUCENE-2215) paging collector

2010-03-25 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12850002#action_12850002 ] Shai Erera commented on LUCENE-2215: bq. since I think it's safe to say most

[jira] Created: (LUCENE-2348) DuplicateFilter incorrectly handles multiple calls to getDocIdSet for segment readers

2010-03-25 Thread Trejkaz (JIRA)
DuplicateFilter incorrectly handles multiple calls to getDocIdSet for segment readers - Key: LUCENE-2348 URL: https://issues.apache.org/jira/browse/LUCENE-2348

[jira] Updated: (LUCENE-2348) DuplicateFilter incorrectly handles multiple calls to getDocIdSet for segment readers

2010-03-25 Thread Trejkaz (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trejkaz updated LUCENE-2348: Component/s: (was: Search) contrib/* Changing to contrib, only just realised it was

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-25 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12850012#action_12850012 ] Robert Muir commented on LUCENE-2323: - Committed 927696 (and 927697 for the solr