> I think it is not good idea to use lucene as storage, it is just index.

I strongly disagree with this position.

To qualify my disagreement: yes, you should not use Lucene as your
primary storage for your data in your organization.

But, for a particular application, taking content from your primary
storage system (SQL database, filesystem files, etc) and - in the
context of an end-user application - both indexing and storing the
content is a good solution. The stored content in Lucene if
effectively a cache.

Advantages:
- faster (don't have to make additional queries to find content in
primary storage system)
- less system dependencies (if the primary system is down...)
- no longer hitting primary storage system (which are usually already
busy doing other things and also tend to be expensive)
- simpler

Disadvantages
- larger index
- might be slower, if the index is significantly larger
- updating issues

thanks,

Glen
http://zzzoot.blogspot.com/

2008/9/19 Dragan Jotanovic <[EMAIL PROTECTED]>:
> I think it is not good idea to use lucene as storage, it is just index.
> You could probably implement this using flat files and lucene.
> Your simDocId would be stored field which you can retrieve from the index 
> after search, and it could also contain the information where on the disk is 
> document located.
>
>
> -----Original Message-----
> From: xh sun [mailto:[EMAIL PROTECTED]
> Sent: Friday, September 19, 2008 9:44 AM
> To: java-user@lucene.apache.org
> Subject: Re: could I implement this scenario?
>
> I store the data in flatfiles and db. I want to implement it using Lucene 
> only, but if it fails, maybe I shall create a temporary table for each query.
>
>
>
> ----- Original Message ----
> From: mathieu <[EMAIL PROTECTED]>
> To: java-user@lucene.apache.org
> Sent: Friday, September 19, 2008 4:34:13 PM
> Subject: Re: could I implement this scenario?
>
>
> Lucene is just an index. Where do you wont to store your data? in a db,
> flatfiles, document with an url, in lucene?
>
> M.
>
> On Fri, 19 Sep 2008 16:25:27 +0800 (CST), xh sun
> <[EMAIL PROTECTED]> wrote:
>> Thank you. Mathieu.
>>
>> But the hits don't include the document doc02  in my example, how to
>> display  doc02?  I don't want to search by docid. Thanks.
>>
>>
>>
>> ----- Original Message ----
>> From: mathieu <[EMAIL PROTECTED]>
>> To: java-user@lucene.apache.org
>> Sent: Friday, September 19, 2008 4:14:34 PM
>> Subject: Re: could I implement this scenario?
>>
>>
>>
>> Yes. You can store data in lucene index and don't search on it : your
>> simdocid.
>>
>> M.
>>
>> On Fri, 19 Sep 2008 16:00:20 +0800 (CST), xh sun
>> <[EMAIL PROTECTED]> wrote:
>>> Hi all,
>>>
>>> How can I implemented this scenario in lucene?
>>>
>>> suppose every document has three fields: docid, doctext and simdocid.
>>> docid is the id of the document, doctext is the content of the document,
>>> dimdocid is the docid of a similar document with this document.
>>> example:
>>> docid  doctext                    simdocid
>>> doc01  ************************   doc04
>>> doc02  ************************   doc03
>>> doc03  ************************   doc02
>>> doc04  ************************   doc03
>>> doc05  ************************   doc04
>>> doc06  ************************   doc02
>>>
>>> During query, the index will be searched basing on field doctext. If the
>>> hits include four documents doc01,doc03,doc04, doc05, I want to display
>> the
>>> corresponding similar documents only, that is, the three documents
>>> doc04,doc02,doc03.
>>>
>>> Appreicate your help very much.
>>>
>>> BR,
>>> Shawn
>>>
>>>
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> For additional commands, e-mail: [EMAIL PROTECTED]
>>
>>
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>



-- 

-

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to