Re: Control over Lucene Index

Grant Ingersoll Wed, 03 May 2006 05:30:07 -0700

You could write a "dummy" Analyzer that provides the tokens from yourexternal process. As for statistics, what kind are you interested in?I suppose you can store them in a field along with the document, or youcan set the boost values for the field/document, but that may be a bitsimple for your needs.


Ralf Bierig wrote:

Hi,
in the context of a distributed information retrieval project, wewould like to use Lucene for its indexing capabilities but not forretrieval. In particular, we would like to populate a Lucene indexwith the tokens and statistics already computed by an externalindexer, thereby bypassing the document-based parsing, analysis, andingestion into the index which characterises Lucene's standardworkflow. Is this possible? That is, is it possible to feedprecomputed statistics into a Lucene's index? And is it possible tohave control on what statistics are associated with each document (aswe will not use Lucene for retrieval we are not interested incomplying with the statistics it needs to perform a search).
Any help greatly appreciated, many thanks.

Cheers,


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

--

Grant IngersollSr. Software EngineerCenter for Natural Language ProcessingSyracuse UniversitySchool of Information Studies335 Hinds HallSyracuse, NY 13244http://www.cnlp.orgVoice: 315-443-5484Fax: 315-443-6886


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Control over Lucene Index

Reply via email to