[jira] Commented: (LUCENE-743) IndexReader.reopen()

2007-08-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520476 ] Michael Busch commented on LUCENE-743: -- > IndexReader a, b, c = ... > MultiReader ab = new MultiReader({a, b

[jira] Commented: (LUCENE-983) Enable IndexReader to merge tail segments on demand, in RAM, when opening

2007-08-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520480 ] Michael Busch commented on LUCENE-983: -- I like this idea. Merging small segments in memory is probably fast, and

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-17 Thread Ian Boston (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520499 ] Ian Boston commented on LUCENE-871: --- My pleasure, for once to give something back to Lucene :) FYI, this is used fo

Re: TermVectorsWriter and DocumentsWriter

2007-08-17 Thread Michael McCandless
Hi Grant, > I am wondering if TermVectorsWriter is still used? It doesn't seem > to be, at least not any of its methods (some of the constants still > are, either that or my IDE is not properly finding method calls or I > am too bleary-eyed at the moment). It seems to be replaced by the > writeV

Re: TermVectorsWriter and DocumentsWriter

2007-08-17 Thread karl wettin
17 aug 2007 kl. 11.51 skrev Michael McCandless: One thing I have been wondering is whether it really is necessary to sort the term vectors before writing to the index it is necessary for backwards compatibility. But, with the new TermVectorMapper, if an application is going to sort by freq

Re: TermVectorsWriter and DocumentsWriter

2007-08-17 Thread Doug Cutting
Michael McCandless wrote: One thing I have been wondering is whether it really is necessary to sort the term vectors before writing to the index Terms in vectors are prefix-compressed. So not sorting would make indexes bigger, and slower to read & write. http://lucene.apache.org/java/do

[jira] Resolved: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-871. --- Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [Patch Available

[jira] Created: (LUCENE-984) remove TermVectorsWriter (it's no longer used)

2007-08-17 Thread Michael McCandless (JIRA)
remove TermVectorsWriter (it's no longer used) -- Key: LUCENE-984 URL: https://issues.apache.org/jira/browse/LUCENE-984 Project: Lucene - Java Issue Type: Bug Components: Index Affect

[jira] Commented: (LUCENE-845) If you "flush by RAM usage" then IndexWriter may over-merge

2007-08-17 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520611 ] Yonik Seeley commented on LUCENE-845: - Merging small segments in the reader seems like a cool idea on it's own. B

Re: TermVectorsWriter and DocumentsWriter

2007-08-17 Thread Michael McCandless
"Doug Cutting" <[EMAIL PROTECTED]> wrote: > Michael McCandless wrote: > > One thing I have been wondering is whether it really is necessary to > > sort the term vectors before writing to the index > > Terms in vectors are prefix-compressed. So not sorting would make > indexes bigger, and slo

[jira] Created: (LUCENE-985) AAIOB thrown when length of termText is longer than 16384 characters

2007-08-17 Thread Michael McCandless (JIRA)
AAIOB thrown when length of termText is longer than 16384 characters Key: LUCENE-985 URL: https://issues.apache.org/jira/browse/LUCENE-985 Project: Lucene - Java Issue Type

[jira] Updated: (LUCENE-985) AIOOB thrown when length of termText is longer than 16384 characters

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-985: -- Summary: AIOOB thrown when length of termText is longer than 16384 characters (was: AA

[jira] Commented: (LUCENE-845) If you "flush by RAM usage" then IndexWriter may over-merge

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520649 ] Michael McCandless commented on LUCENE-845: --- > Merging small segments in the reader seems like a cool idea

[jira] Commented: (LUCENE-845) If you "flush by RAM usage" then IndexWriter may over-merge

2007-08-17 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520655 ] Yonik Seeley commented on LUCENE-845: - > But if writer flushes frequently and reader re-opens less frequently > t

[jira] Commented: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2007-08-17 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520659 ] Karl Wettin commented on LUCENE-550: I just found a bug that I can not explain. While scoring this one specific

[jira] Commented: (LUCENE-845) If you "flush by RAM usage" then IndexWriter may over-merge

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520667 ] Michael McCandless commented on LUCENE-845: --- Agreed. OK, I think this is a dead end: it adds complexity an

[jira] Closed: (LUCENE-983) Enable IndexReader to merge tail segments on demand, in RAM, when opening

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless closed LUCENE-983. - Resolution: Won't Fix On further discussion in LUCENE-845 it doesn't look like this idea

[jira] Commented: (LUCENE-985) AIOOB thrown when length of termText is longer than 16384 characters

2007-08-17 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520693 ] Hoss Man commented on LUCENE-985: - As a clarification point for people who stumble upon this issue years from now af

[jira] Updated: (LUCENE-550) InstantiatedIndex - faster but memory consuming index

2007-08-17 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-550: --- Attachment: LUCENE-550_20070817_no_core_changes.txt Previously mentioned problems deloused. The phras

[jira] Updated: (LUCENE-985) AIOOB thrown when length of termText is longer than 16384 characters (ArrayIndexOutOfBoundsException)

2007-08-17 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated LUCENE-985: Description: DocumentsWriter has a max term length of 16384; if you cross that you get an unfriendly ArrayI

[jira] Commented: (LUCENE-985) AIOOB thrown when length of termText is longer than 16384 characters (ArrayIndexOutOfBoundsException)

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520727 ] Michael McCandless commented on LUCENE-985: --- > As a clarification point for people who stumble upon this is

[jira] Commented: (LUCENE-985) AIOOB thrown when length of termText is longer than 16384 characters (ArrayIndexOutOfBoundsException)

2007-08-17 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12520741 ] Karl Wettin commented on LUCENE-985: I doubt anyone will have a problem with the limit. And if they hit the exce

[jira] Updated: (LUCENE-985) AIOOB thrown when length of termText is longer than 16384 characters (ArrayIndexOutOfBoundsException)

2007-08-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-985: -- Attachment: LUCENE-985.patch > I doubt anyone will have a problem with the limit. And i