Re: Storing phrases in index

2008-04-10 Thread Chris Hostetter
: To: java-dev@lucene.apache.org : Subject: Storing phrases in index http://people.apache.org/~hossman/#java-dev Please Use "[EMAIL PROTECTED]" Not "[EMAIL PROTECTED]" Your question is better suited for the [EMAIL PROTECTED] mailing list ... not the [EMAIL PROTECTED] list. java-dev is for disc

Re: Pooling of posting objects in DocumentsWriter

2008-04-10 Thread Marvin Humphrey
On Apr 10, 2008, at 2:37 AM, Michael McCandless wrote: IMO, the abstract base Posting class should not track text. It should include only one datum: a document number. This keeps it in line with the simplest IR definition for a "posting": one document matching one term. But how do you t

Re: Storing phrases in index

2008-04-10 Thread palexv
Please, can you give me a simple example of how to use ShingleFilter? Thanks a lot! Mathieu Lecarme wrote: > > palexv a écrit : >> Thanks! >> Can you help me to get ShingleFilter class. It is absent in version >> 2.3.1. >> How can I get it? >> > It's in the SVN version. You can backport it,

Re: Flexible indexing design

2008-04-10 Thread Michael McCandless
Marvin Humphrey <[EMAIL PROTECTED]> wrote: > On Apr 9, 2008, at 6:35 AM, Michael Busch wrote: > > > > We also need to come up with a good solution for the dictionary, because a > term with frq/prx postings needs to store two (or three for skiplist) file > pointers in the dictionary, whereas e. g. a

Re: Flexible indexing design (was Re: Pooling of posting objects in DocumentsWriter)

2008-04-10 Thread Michael McCandless
Michael Busch <[EMAIL PROTECTED]> wrote: > > I agree we would have an abstract base Posting class that just tracks > > the term text. > > > > Then, DocumentsWriter manages inverting each field, maintaining the > > per-field hash of term Text -> abstract Posting instances, exposing > > the methods

Re: Pooling of posting objects in DocumentsWriter

2008-04-10 Thread Michael McCandless
Marvin Humphrey <[EMAIL PROTECTED]> wrote: > > On Apr 8, 2008, at 10:25 AM, Michael McCandless wrote: > > > I've actually been working on factoring DocumentsWriter, as a first > > step towards flexible indexing. > > > > The way I handled this in KS was to turn Posting into a class akin to > TermB

Re: Storing phrases in index

2008-04-10 Thread Mathieu Lecarme
palexv a écrit : Thanks! Can you help me to get ShingleFilter class. It is absent in version 2.3.1. How can I get it? It's in the SVN version. You can backport it, are building your own, with a Stack. M. - To unsubscribe,

Re: Storing phrases in index

2008-04-10 Thread palexv
Thanks! Can you help me to get ShingleFilter class. It is absent in version 2.3.1. How can I get it? Mathieu Lecarme wrote: > > palexv a écrit : >> Hello all. >> I have a question to advanced in lucene. >> I have a set of phrases which I need to store in index. >> Is there is a way of storing