Yes we also found the RAMDirectory to have a big load penalty and not much performance gain.
Glyn, we have found with our high-traffic sites that it is beneficial to not overwrite the latest version of the index in the directory that is currently being read from. Instead we write to a new time-stamped directory and update an XML file with the latest directory location of the index. The website then monitors that xml file and when it changes, uses the latest index location to read. A windows service takes care of generating update indexes and distributing them to web nodes. Pete -----Original Message----- From: Glyn Darkin [mailto:g...@darkinsystems.com] Sent: 26 February 2009 18:24 To: lucene-net-user@incubator.apache.org Subject: Re: Lucene Replication Support Great stuff, thankyou for the thoughts. Glyn 2009/2/26 Jokin Cuadrado <joki...@gmail.com>: > we use the fsdirectory, ramdirectory could be better on some edge cases, > especially when you have a small index or do a lot of inserts and updates to > the index. If you have enough memory on the machine, Lucene does a good job > caching the termvectors and some relevant info, after some time the disk > usage should be very low. The ramdirectory has a big penalty on load time > has it have to load all the info in the memory, even that info that will > never be used, that is a big cost with such a big index, also it doesn't > ensure that it wouldn't be swapped to disk if the OS needs memory. > > On Thu, Feb 26, 2009 at 6:07 PM, Glyn Darkin <g...@darkinsystems.com> wrote: > >> Thanks for the quick response Jokin, >> >> We are currently building a search solution that has a 3.6 g index >> Max concurrent reads at the moment is 54, but we are hoping that this >> will increase significantly as the website traffic increases >> This will be a read only index, with a deployment of a new index nightly. >> >> On another note, >> We are considering running searches against the index using a >> FSDirectory. Is this how you have your search or do you load the index >> into a RAMDirectory? >> >> Cheers >> >> Glyn >> >> >> >> >> >> >> 2009/2/26 Jokin Cuadrado <joki...@gmail.com>: >> > i have a 3 Gb index and another one of 800 Mb, the update rate is small >> and >> > the number or concurrent search right now it's also small, but i made the >> > synchronization during a stress test without any problem, you just have >> to >> > copy the cfs file before the segments one, and as the most copy software >> > list the directories in alphabetically order, it always happen. (cfs >> files >> > start with an "_" character, so they are always the first to be copied). >> > in my case this works well because i don't have big restrictions, for >> > example the result of the search could be diferent between 2 machines >> during >> > an small amount of time, and the incremental updates are done every >> couple >> > hours. >> > >> > If you tell us your constraints, I could suggest you more complex >> approaches >> > . >> > >> > >> > >> > >> > On Thu, Feb 26, 2009 at 4:44 PM, Glyn Darkin <g...@darkinsystems.com> >> wrote: >> > >> >> Jokin, >> >> >> >> What sort of index size are you dealing with and how many concurrent >> >> searches would be running against your index when you Robocopy? >> >> >> >> Cheers >> >> >> >> Glyn >> >> >> >> >> >> 2009/2/26 Jokin Cuadrado <joki...@gmail.com>: >> >> > Robocopy /Mir works smoothly, just make sure that you copy first the >> >> index >> >> > files (.cfs) and after the segments.* ones. >> >> > >> >> > On Thu, Feb 26, 2009 at 5:35 AM, Nitin Shiralkar < >> nit...@coreobjects.com >> >> >wrote: >> >> > >> >> >> Hi All, >> >> >> >> >> >> Do we have any in-built replication support? Today, we are building >> the >> >> >> index and generating a periodic backup through the same builder >> service. >> >> >> This is working fine for couple of years. But I would like to know if >> >> there >> >> >> are any better options within Lucene library. Though we are using >> >> >> Lucene.NET, but would also like to know if there is any support on >> Java >> >> side >> >> >> if not on .NET. >> >> >> >> >> >> >> >> >> Thanks & regards, >> >> >> >> >> >> Nitin Shiralkar >> >> >> > >> > -- >> > Jokin >> > >> >> >> >> -- >> Glyn Darkin >> >> Darkin Systems Ltd >> Mob: 07961815649 >> Fax: 08717145065 >> Web: www.darkinsystems.com >> >> Company No: 6173001 >> VAT No: 906350835 >> > > > > -- > Jokin > -- Glyn Darkin Darkin Systems Ltd Mob: 07961815649 Fax: 08717145065 Web: www.darkinsystems.com Company No: 6173001 VAT No: 906350835 ** Please consider the environment before printing this e-mail ** The information contained in this e-mail is of a confidential nature and is intended only for the addressee. If you are not the intended addressee, any disclosure, copying or distribution by you is prohibited and may be unlawful. Disclosure to any party other than the addressee, whether inadvertent or otherwise, is not intended to waive privilege or confidentiality. Internet communications are not secure and therefore Conde Nast does not accept legal responsibility for the contents of this message. Any views or opinions expressed are those of the author. Company Registration details: The Conde Nast Publications Ltd Vogue House Hanover Square London W1S 1JU Registered in London No. 226900