Re: Best Practices for getting Strings from a position range

2007-08-07 Thread Grant Ingersoll
Hi Peter, Give https://issues.apache.org/jira/browse/LUCENE-975 a try. It provides a TermVectorMapper that loads by position. Still not what ideally what you want, but I haven't had time to scope that one out yet., -Grant On Jul 24, 2007, at 6:02 PM, Peter Keegan wrote: Hi Grant, No

[jira] Updated: (LUCENE-975) Position based TermVectorMapper

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-975: --- Attachment: LUCENE-975.patch First attempt at a position based TermVectorMapper. I believe i

[jira] Created: (LUCENE-975) Position based TermVectorMapper

2007-08-07 Thread Grant Ingersoll (JIRA)
Position based TermVectorMapper --- Key: LUCENE-975 URL: https://issues.apache.org/jira/browse/LUCENE-975 Project: Lucene - Java Issue Type: New Feature Components: Store Reporter: Grant Inge

[jira] Commented: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518291 ] Grant Ingersoll commented on LUCENE-550: Hey Karl, I started to look at this, but there are a few stoppers a

[jira] Updated: (LUCENE-974) Remove Author tags from code

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-974: --- Attachment: LUCENE-974.patch Remove all @author tags > Remove Author tags from code > --

Re: Deprecating IndexModifier

2007-08-07 Thread Grant Ingersoll
+1 On Aug 7, 2007, at 3:37 PM, Ning Li wrote: With the plan towards 3.0 release laid out, I think it's a good time to deprecate IndexModifier and eventually remove IndexModifier. The only method in IndexModifier which is not implemented in IndexWriter is "deleteDocument(int doc)". This is bec

[jira] Assigned: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2007-08-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned LUCENE-550: -- Assignee: Grant Ingersoll (was: Karl Wettin) > InstantiatedIndex - faster but memory c

TREC Collection, NIST and Lucene

2007-08-07 Thread Grant Ingersoll
DISCLAIMER: Just to be clear, what follows is my personal opinion and in no way, shape or form reflects an official position from the Lucene project: So, now that we have all this great stuff for running TREC experiments in contrib/benchmark, I am wondering if people think it would be use

[jira] Created: (LUCENE-974) Remove Author tags from code

2007-08-07 Thread Grant Ingersoll (JIRA)
Remove Author tags from code Key: LUCENE-974 URL: https://issues.apache.org/jira/browse/LUCENE-974 Project: Lucene - Java Issue Type: Wish Reporter: Grant Ingersoll Assignee: Grant Ingerso

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518263 ] Michael McCandless commented on LUCENE-847: --- > I'm thinking maybe a MergePolicy#useCompoundDocStore( Segmen

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518251 ] Steven Parkes commented on LUCENE-847: -- Ah. I understand better now. I have to admit, I haven't kept up to date

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518237 ] Michael McCandless commented on LUCENE-847: --- > Looking at IW, with the new DocsWriter stuff, it looks like

Deprecating IndexModifier

2007-08-07 Thread Ning Li
With the plan towards 3.0 release laid out, I think it's a good time to deprecate IndexModifier and eventually remove IndexModifier. The only method in IndexModifier which is not implemented in IndexWriter is "deleteDocument(int doc)". This is because of the concern that document ids are changing

Hudson build is back to normal: Lucene-Nightly #179

2007-08-07 Thread hudson
See http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/179/changes - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518222 ] Steven Parkes commented on LUCENE-847: -- On a related note, Mike, there a few FIXME's in IW related to useCompoun

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518210 ] Steven Parkes commented on LUCENE-847: -- Is the separate IndexMerger interface really necessary? I wrestled

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518186 ] Michael McCandless commented on LUCENE-847: --- > > I think we ideally would like concurrency to be fully inde

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518184 ] Michael McCandless commented on LUCENE-847: --- Some more feedback: - Is the separate IndexMerger interfac

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518165 ] Steven Parkes commented on LUCENE-847: -- I think we ideally would like concurrency to be fully in

[jira] Commented: (LUCENE-847) Factor merge policy out of IndexWriter

2007-08-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518162 ] Michael McCandless commented on LUCENE-847: --- This looks great Steve! More specific feeedback soon, but ...

[jira] Updated: (LUCENE-973) Token of "" returns in CJK

2007-08-07 Thread Toru Matsuzawa (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Toru Matsuzawa updated LUCENE-973: -- Attachment: CJKTokenizer20070807.patch patch attached. > Token of "" returns in CJK > ---

[jira] Created: (LUCENE-973) Token of "" returns in CJK

2007-08-07 Thread Toru Matsuzawa (JIRA)
Token of "" returns in CJK --- Key: LUCENE-973 URL: https://issues.apache.org/jira/browse/LUCENE-973 Project: Lucene - Java Issue Type: Bug Components: Analysis Affects Versions: 2.3 Reporte