Re: Index: mixing the structure of persistence

Orion Letizi Mon, 26 Nov 2007 10:56:15 -0800

Because Terracotta treats the Lucene indexes as cluster-wide shared objects,
updates to those indexes are made automatically available across the entire
cluster.


On one machine, a plain Lucene index will give you the in-memory caching and
spill-to-disk behavior you are looking for.  Terracotta will give you that
behavior across a cluster of machines.  I wouldn't compare the performance
of a plain Lucene index on one machine with the performance of a clustered
Lucene index on one machine.  Use Terracotta if you need the index available
on multiple machines; use plain Lucene if you want high-performance on a
single machine.

That said, if you need help setting up and tuning a  Terracotta-clustered
Lucene index for your use-case, you can visit the Terracotta forums: 

  http://forums.terracotta.org/

Cheers,
--Orion
http://www.terracotta.org/


Haroldo Nascimento wrote:
> 
> Hi,
> 
>   I have a very great volume of data (6.000.000 of documents) and I
> need to have a very fast search. I am thinking about using Terracotta
> (with Lucene) for clustering the solution.
> 
>   One of the advantages of the Terracotta is that part of the index is
> stored in memory and part is persisted em disk. If not to find in
> memory the application searchs in disk. This is transparent for the
> user. The problem would be in the process of updat of index.
> 
>   Another solution would be to persist the index using only Lucene,
> but I believe that the reply time very using disk either bigger that
> the solution in memory.
> 
>   You know some document comparative about the solution em memory
> (RAMDirectory) and solution em disk using Lucene?
> 
>   Tip: In another application I am using the solution index in memory
> (RAMDirectory), to initiate the process of load of the indice I
> serialized the RAMDirectory object. For it I need  insert "implements
> Serializable" in some classrooms of Lucene.
> 
> 
> 
> On Nov 25, 2007 5:55 PM, Erick Erickson <[EMAIL PROTECTED]> wrote:
>> As I understand, Lucene does a fair amount of caching of terms in
>> memory without you having to specify anything.
>>
>> But it's hard to see how your question relates. Remember that Lucene is
>> finding *all* matching docs. So searching in a RAMdirectory and then
>> searching in the file doesn't really seem possible since Lucene has to
>> search
>> the entire index every time to score the docs. It doesn't stop after
>> the first hit, since the next hit may score higher.
>>
>> But I'm sure Lucene *does* cache portions of the index in RAM when
>> possible, but I've never had occasion to dig into the details.
>>
>> Which leads me to ask "Why do you care?". Is there a specific
>> situation you're trying to get better performance from or is this more
>> of a background question? If you have a specific situation, please
>> describe
>> it in some detail so better minds than mine can give you a better
>> response <G>....
>>
>> Best
>> Erick
>>
>> On Nov 24, 2007 10:26 AM, Haroldo Nascimento <[EMAIL PROTECTED]>
>> wrote:
>>
>>
>> > Hi,
>> >
>> >  I have a question ?
>> >
>> >  Lucene offers a mixing structure of storage of index, that is, first
>> > do search in memoria (ARMDirectory) and in case of not found do search
>> > in index file automatically ? For example: Load part of index in
>> > memory for do the search fastest.
>> >
>> >  Thnaks
>> >
>> > ---------------------------------------------------------------------
>> > To unsubscribe, e-mail: [EMAIL PROTECTED]
>> > For additional commands, e-mail: [EMAIL PROTECTED]
>> >
>> >
>>
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Index%3A-mixing-the-structure-of-persistence-tf4866100.html#a13955202
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Index: mixing the structure of persistence

Reply via email to