[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650882#action_12650882 ] Earwin Burrfoot commented on LUCENE-1461: - Somewhat off topic, but nonetheless, my

Re: [jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread eks dev
I think it is configurable, do not know if default or not From: Jason Rutherglen <[EMAIL PROTECTED]> To: java-dev@lucene.apache.org Sent: Tuesday, 25 November, 2008 23:39:15 Subject: Re: [jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650855#action_12650855 ] Marvin Humphrey commented on LUCENE-1458: - > Well ...that cat command can be deadl

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650854#action_12650854 ] Marvin Humphrey commented on LUCENE-1458: - > I think, just like we are pushing for

[jira] Updated: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Tim Sturge (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Sturge updated LUCENE-1461: --- Attachment: LUCENE-1461b.patch Here's the first cleanup Changes: RangeMultiFilter now FieldCacheRa

[jira] Issue Comment Edited: (LUCENE-1453) When reopen returns a new IndexReader, both IndexReaders may now control the lifecycle of the underlying Directory which is managed by reference counting

2008-11-25 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650800#action_12650800 ] [EMAIL PROTECTED] edited comment on LUCENE-1453 at 11/25/08 3:49 PM: ---

Re: TrieRangeQuery for contrib?

2008-11-25 Thread Andrzej Bialecki
Uwe Schindler wrote: - Inside Luke, the values of such "Trie" fields are not human readable (because of the encoding). Even when stored, the current implementation uses the special encoding to store the field. For displaying the field you have to use the decoder from the TrieUtils class. But thi

[jira] Commented: (LUCENE-1453) When reopen returns a new IndexReader, both IndexReaders may now control the lifecycle of the underlying Directory which is managed by reference counting

2008-11-25 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650800#action_12650800 ] Mark Miller commented on LUCENE-1453: - I think this bug should be called it in changes

[jira] Resolved: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-11-25 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller resolved LUCENE-1465. - Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [New]) Thanks Jonathan and

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Tim Sturge (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650777#action_12650777 ] Tim Sturge commented on LUCENE-1461: Progress report: Having written some javadocs, I

Re: [jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Jason Rutherglen
> simple default could be B-Tree with prefix compression, it never disappoints and is relatively simple to implement. Berkeley DB (java edition) uses this Just to clarify, are you saying BDB Java performs prefix compression by default? On Sat, Nov 22, 2008 at 6:38 AM, eks dev <[EMAIL PROTECTED]>

[jira] Resolved: (LUCENE-1467) Consolidate Solr's and Lucene's OpenBitSet classes

2008-11-25 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch resolved LUCENE-1467. --- Resolution: Fixed Committed revision 720609. > Consolidate Solr's and Lucene's OpenBitSet c

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650722#action_12650722 ] Marvin Humphrey commented on LUCENE-1458: - > How about the caller provides a codec

RE: TrieRangeQuery for contrib?

2008-11-25 Thread Uwe Schindler
Hi Mike, > > - Inside Luke, the values of such "Trie" fields are not human readable > > (because of the encoding). Even when stored, the current > > implementation uses > > the special encoding to store the field. For displaying the field > > you have > > to use the decoder from the TrieUtils clas

Re: TrieRangeQuery for contrib?

2008-11-25 Thread Michael McCandless
Uwe Schindler wrote: Hi Mike, hi Paul, Mike: You are right, the algorithm has one advantage and one disadvantage: - There is not only a logarithmic bound, there is a hard upper bound. In my algorithm with 8 precisions (so from 1 to 8 bytes length of keys) the maximum numbers of terms to

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650697#action_12650697 ] Paul Elschot commented on LUCENE-1461: -- {quote}Are we settled on RangeMultiFilter as

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650694#action_12650694 ] Michael McCandless commented on LUCENE-1461: OK that makes sense -- let's leav

RE: TrieRangeQuery for contrib?

2008-11-25 Thread Uwe Schindler
Hi Mike, hi Paul, Mike: You are right, the algorithm has one advantage and one disadvantage: - There is not only a logarithmic bound, there is a hard upper bound. In my algorithm with 8 precisions (so from 1 to 8 bytes length of keys) the maximum numbers of terms to be visited is limited to 3825

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Tim Sturge (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650685#action_12650685 ] Tim Sturge commented on LUCENE-1461: Paul, Thanks for the updates. I'll see about to

[jira] Commented: (LUCENE-1467) Consolidate Solr's and Lucene's OpenBitSet classes

2008-11-25 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650669#action_12650669 ] Michael Busch commented on LUCENE-1467: --- They return -1. next(int) has a comment whi

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650647#action_12650647 ] Marvin Humphrey commented on LUCENE-1458: - >> We're trying to fake up an array of

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650644#action_12650644 ] Michael McCandless commented on LUCENE-1458: {quote} I was hoping to avoid for

Re: TrieRangeQuery for contrib? was: [jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Michael McCandless
I would love to see this find it's way into Lucene! It puts a logarithmic bound (I think?) on the number of terms whose docs must be visited to implement the range query, at the expense of an increase in index size for those fields that need range search. For large indexes this should make an i

Re: Mark Miller as core Lucene committer

2008-11-25 Thread Grant Ingersoll
On Nov 24, 2008, at 10:19 PM, Mark Miller wrote: Shes been trying to trick me into going outdoors for months now. Fresh air is good for your search skills, you should listen to her... - To unsubscribe, e-mail: [EMAIL PROTEC

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650619#action_12650619 ] Marvin Humphrey commented on LUCENE-1458: - > I would think this "openReader" metho

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650599#action_12650599 ] Michael McCandless commented on LUCENE-1458: {quote} The equivalent to segmen

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650585#action_12650585 ] Marvin Humphrey commented on LUCENE-1458: - > I think you "just" have to have "inde

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650579#action_12650579 ] Paul Elschot commented on LUCENE-1461: -- Uwe, As it is already under APL 2.0, TrieRan

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650566#action_12650566 ] Uwe Schindler commented on LUCENE-1461: --- I understood this in the same way. This is

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650562#action_12650562 ] Paul Elschot commented on LUCENE-1461: -- {quote}Someday, I think RangeFilter and Range

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650558#action_12650558 ] Michael McCandless commented on LUCENE-1458: {quote} > Be careful: it's the se

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650546#action_12650546 ] Michael McCandless commented on LUCENE-1458: {quote} If IndexReader were an or

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650542#action_12650542 ] Michael McCandless commented on LUCENE-1458: {quote} How do you plan on deali

TrieRangeQuery for contrib? was: [jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Uwe Schindler
I do not want to attach this to the issue report, just for completeness: I implemented (based on RangeFilter) another approach for faster RangeQueries, based on longs stored in index in a special format. The idea behind this is to store the longs in different precision in index and partition the

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650534#action_12650534 ] Michael McCandless commented on LUCENE-1458: {quote} Just the index terms (i.e

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650530#action_12650530 ] Michael McCandless commented on LUCENE-1461: Should we absorb RangeMultiFilter

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650527#action_12650527 ] Michael McCandless commented on LUCENE-1461: In fact, could you add a test tha

[jira] Commented: (LUCENE-1467) Consolidate Solr's and Lucene's OpenBitSet classes

2008-11-25 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650517#action_12650517 ] Michael McCandless commented on LUCENE-1467: Michael, what do nextDoc() and ne

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650496#action_12650496 ] Paul Elschot commented on LUCENE-1461: -- Tim, If there is code that depends on some p

[jira] Updated: (LUCENE-1461) Cached filter for a single term field

2008-11-25 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Elschot updated LUCENE-1461: - Attachment: LUCENE-1461a.patch Here's a patch for the latest RangeMultiFilter. I've changed the