Re: 答复: 答复: Lucene in large database contexts

Antonello Provenzano Fri, 10 Aug 2007 03:25:22 -0700

Kai,

The context I'm going to work with requires a continuous addition of
documents to the indexes, since it's user-driven content, and this
would require the content to be always up-to-date.
This is the problem I'm facing, since I cannot rebuild a 1Gb (at
least) index every time a user inserts a new entry into the database.


I know Digg, for instance, is using Lucene as search engine: since the
amount of data they're dealing with is much higher than mine, I would
like to understand the way they used to implement this kind of
solution.

Thank you again.
Antonello


On 8/10/07, Kai Hu <[EMAIL PROTECTED]> wrote:
> Antonello,
>         You are right,I think lucene indexsearcher will search the old 
> information if IndexWriter was not closed(I think lucene release the Lock 
> here),so I only add a few documents every time from buffer to implement index 
> "real time".
>
> kai
>
>
> 发件人: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] 代表 Antonello Provenzano
> 发送时间: 2007年8月10日 星期五 17:59
> 收件人: [email protected]
> 主题: Re: 答复: Lucene in large database contexts
>
> Kai,
>
> Thanks. The problem I see it's that although I can add a Document
> through IndexWriter or IndexModifier, this won't be searchable until
> the index is closed and, possibly, optimized, since the score of the
> document in the index context must be re-calculated on the basis of
> the whole context.
>
> Is this assumption true? or am I completely wrong?
>
> Cheers.
> Antonello
>
>
> On 8/10/07, Kai Hu <[EMAIL PROTECTED]> wrote:
> > Hi, Antonello
> >         You can use IndexWriter.addDocument(Document document) to add 
> > single document,same to update,delete operation.
> >
> > kai
> >
> > -----邮件原件-----
> > 发件人: Antonello Provenzano [mailto:[EMAIL PROTECTED]
> > 发送时间: 2007年8月10日 星期五 17:09
> > 收件人: [email protected]
> > 主题: Lucene in large database contexts
> >
> > Hi There!
> >
> > I've been working for a while on the implementation of a website
> > oriented to contents that would contain millions of entries, most of
> > them indexable (such as descriptions, texts, names, etc.).
> > The ideal solution to make them searchable would be to use Lucene as
> > index and search engine.
> >
> > The reason I'm posting the mailing list is the following: since all
> > the entries will be stored in a database (most likely MySQL InnoDB or
> > Oracle), what's the best technique to implement a system that indexes
> > in "real time" (eg. when an entry is inserted into the databsse) the
> > content and make it searchable? Based on my understanding of Lucene,
> > such this thing is not possible, since the index must be re-created to
> > be able to search the indexed contents. Is this true?
> >
> > Eventually, could anyone point me to a working example about how to
> > implement such a similar context?
> >
> >
> > Thank you for the support.
> > Antonello
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>

Re: 答复: 答复: Lucene in large database contexts

Reply via email to