Committed revision 735928.

2009-01-19 Thread patrick o'leary
Committed revision 735928. Adding myself to contrib committers list / testing karma Thanks Patrick scootie:site pjaol$ svn diff docs/*.html Index: docs/whoweare.html === --- docs/whoweare.html (revision 735927) +++ docs/whoweare.

[jira] Closed: (LUCENE-1519) Change Primitive Data Types from int to long in class SegmentMerger.java

2009-01-19 Thread Deepak (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deepak closed LUCENE-1519. -- No problem > Change Primitive Data Types from int to long in class SegmentMerger.java > -

[jira] Updated: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1483: Description: This issue changes how an IndexSearcher searches over multiple segments. The current

[jira] Updated: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1483: Description: This issue changes how an IndexSearcher searches over multiple segments. The current

[jira] Updated: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1483: Description: This issue changes how an IndexSearcher searches over multiple segments. The current

[jira] Updated: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1483: Description: This issue changes how an IndexSearcher searches over multiple segments. The current

[jira] Updated: (LUCENE-1522) another highlighter

2009-01-19 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1522: --- Description: I've written this highlighter for my project to support bi-gram token stream.

[jira] Commented: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12665252#action_12665252 ] Michael Busch commented on LUCENE-1483: --- Mark and Mike, this issue and the patch ar

Re: Filesystem based bitset

2009-01-19 Thread Paul Elschot
On Monday 19 January 2009 11:32:17 Michael McCandless wrote: > > Paul Elschot wrote: > > > Since this started by thinking out loud, I'd like to continue doing > > that. > > I've been thinking about how to add a decent skipTo() to something > > that > > compresses better than an (Open)BitSet,

[jira] Updated: (LUCENE-1522) another highlighter

2009-01-19 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1522: --- Attachment: LUCENE-1522.patch to apply this patch, LUCENE-1448 also need to be applied. {cod

[jira] Created: (LUCENE-1522) another highlighter

2009-01-19 Thread Koji Sekiguchi (JIRA)
another highlighter --- Key: LUCENE-1522 URL: https://issues.apache.org/jira/browse/LUCENE-1522 Project: Lucene - Java Issue Type: Improvement Components: contrib/highlighter Reporter: Koji Sekiguchi

Re: Question on Lucene search

2009-01-19 Thread Grant Ingersoll
Please ask your question on java-u...@lucene.apache.org. Thanks, Grant On Jan 19, 2009, at 1:20 AM, fell wrote: Hi all, I am new to Lucene and I need to know the following: In case I have indexed some data using Lucene and it contains the fields: Location, City, Country. Suppose the dat

Using full norms (Was: Bubbling up newer records)

2009-01-19 Thread Jiří Kuhn
Hello, > > Michael McCandless wrote: > > The upcoming Lucene in Action revision (now available online through Manning's MEAP) has a basic example of this (boosting by recency) in the Advanced Search chapter, using function queries. > I have never used function queries before, but it was very easy

Using full norms (Was: Bubbling up newer records)

2009-01-19 Thread Jiří Kuhn
Hello, > > Michael McCandless wrote: > > The upcoming Lucene in Action revision (now available online through Manning's MEAP) has a basic example of this (boosting by recency) in the Advanced Search chapter, using function queries. > I have never used function queries before, but it was very easy

[jira] Commented: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12665102#action_12665102 ] Michael McCandless commented on LUCENE-1483: I'm working on another iteration

Re: Filesystem based bitset

2009-01-19 Thread Michael McCandless
Paul Elschot wrote: Since this started by thinking out loud, I'd like to continue doing that. I've been thinking about how to add a decent skipTo() to something that compresses better than an (Open)BitSet, and this turns out to be an integer set implemented as a B plus tree (all leafs on th

Re: [jira] Commented: (LUCENE-1483) Change IndexSearcher multisegment searches to search each individual segment using a single HitCollector

2009-01-19 Thread Michael McCandless
I'm also seeing decent gains (~13%) for sort-by-relevance (ie the default sort) term queries w/ large number (~97K and ~386K) of hits on 10 & 36 segment indices. So I agree, LUCENE-1483 is not just about speeding up sort-by-field queries. It seems to give good speedups all around, and of

Re: Filesystem based bitset

2009-01-19 Thread eks dev
Hi Paul, not really an answer to your questions, I just thought you may find it useful as a confirmation that this packing of integers into (B or some other) Tree is good one. I have seen Integer set distributions that can profit hugely from the tree organization on top. have look at: http