Re: ThreadLocal causing memory leak with J2EE applications

robert engels Wed, 10 Sep 2008 13:04:22 -0700

Always your prerogative.

On Sep 10, 2008, at 1:15 PM, Chris Lu wrote:

Actually I am done with it by simply downgrading and not to user659602 and later.The old version is more clean and consistent with the API and close() does mean close, not something complicated and unknown to mostusers, which almost feels like a trap. And later on, if no changeshappened for this file, I will have to upgrade Lucene and manuallyremove the patch Lucene-1195.
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Wed, Sep 10, 2008 at 10:56 AM, robert engels<[EMAIL PROTECTED]> wrote:
Why not just use reopen() and be done with it???

On Sep 10, 2008, at 12:48 PM, Chris Lu wrote:
Yeah, the timing is different. But it's an unknown, undetermined,and uncontrollable time...
We can not ask the user,

while(memory is low){
  sleep(1000);
}
do_the_real_thing_an_hour_later


--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Wed, Sep 10, 2008 at 10:39 AM, robert engels<[EMAIL PROTECTED]> wrote:Close() does work - it is just that the memory may not be freeduntil much later...
When working with VERY LARGE objects, this can be a problem.

On Sep 10, 2008, at 12:36 PM, Chris Lu wrote:
Thanks for the analysis, really appreciate it, and I agree withit. But...
This is really a normal J2EE use case. The threads seldom die.
Doesn't that mean closing the RAMDirectory doesn't work for J2EEapplications?
And only reopen() works?
And close() doesn't release the resources? duh...

I can only say this is a problem to be cleaned up.

--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Wed, Sep 10, 2008 at 9:10 AM, robert engels<[EMAIL PROTECTED]> wrote:You do not need to create a new RAMDirectory - just write to theexisting one, and then reopen() the IndexReader using it.
This will prevent lots of big objects being created. This may bethe source of your problem.
Even if the Segment is closed, the ThreadLocal will no longer bereferenced, but there will still be a reference to theSegmentTermEnum (which will be cleared when the thread dies, or"most likely" when new thread locals on that thread a created, sohere is a potential problem.
Thread 1 does a search, creates a thread local that referencesthe RAMDir (A).Thread 2 does a search, creates a thread local that referencesthe RAMDir (A).
All readers, are closed on RAMDir (A).

A new RAMDir (B) is opened.
There may still be references in the thread local maps to RAMDirA (since no new thread local have been created yet).
So you may get OOM depending on the size of the RAMDir (since youwould need room for more than 1). If you extend this out withlots of threads that don't run very often, you can see how youcould easily run out of memory. "I think" that ThreadLocalshould use a ReferenceQueue so stale object slots can bereclaimed as soon as the key is dereferenced - but that is anissue for SUN.
This is why you don't want to create new RAMDirs.
A good rule of thumb - don't keep references to large objects inThreadLocal (especially indirectly). If needed, use a "key", andthen read the cache using a the "key".
This would be something for the Lucene folks to change.

On Sep 10, 2008, at 10:44 AM, Chris Lu wrote:
I am really want to find out where I am doing wrong, if that'sthe case.
Yes. I have made certain that I closed all Readers/Searchers,and verified that through memory profiler.
Yes. I am creating new RAMDirectory. But that's the problem. Ineed to update the content. Sure, if no content update andeverything the same, of course no OOM.
Yes. No guarantee of the thread schedule. But that's theproblem. If Lucene is using ThreadLocal to cache lots of thingsby the Thread as the key, and no idea when it'll be released. Ofcourse ThreadLocal is not Lucene's problem...
Chris
On Wed, Sep 10, 2008 at 8:34 AM, robert engels<[EMAIL PROTECTED]> wrote:It is basic Java. Threads are not guaranteed to run on any sortof schedule. If you create lots of large objects in one thread,releasing them in another, there is a good chance you will getan OOM (since the releasing thread may not run before the OOMoccurs)... This is not Lucene specific by any means.
It is a misunderstanding on your part about how GC works.
I assume you must at some point be creating new RAMDirectories -otherwise the memory would never really increase, since theIndexReader/enums/etc are not very large...
When you create a new RAMDirectories, you need to BE CERTAIN !!!that the other IndexReaders/Searchers using the old RAMDirectoryare ALL CLOSED, otherwise their memory will still be in use,which leads to your OOM...
On Sep 10, 2008, at 10:16 AM, Chris Lu wrote:
I do not believe I am making any mistake. Actually I just gotan email from another user, complaining about the same thing.And I am having the same usage pattern.
After the reader is opened, the RAMDirectory is shared byseveral objects.There is one instance of RAMDirectory in the memory, and it isholding lots of memory, which is expected.
If I close the reader in the same thread that has opened it,the RAMDirectory is gone from the memory.If I close the reader in other threads, the RAMDirectory isleft in the memory, referenced along the tree I draw in thefirst email.
I do not think the usage is wrong. Period.

-------------------------------------
Hi,
i found a forum post from you here [1] where you mentionthat youhave a memory leak using the lucene ram directory. I'd like toask youif you already have resolved the problem and how you did it ormaybe
you know where i can read about the solution. We are using
RAMDirectory too and figured out, that over time the memory
consumption raises and raises until the system breaks down butonlywhen we performing much index updates. if we only create theindex and
don't do nothing except searching it, it work fine.

maybe you can give me a hint or a link,
greetz,
-------------------------------------

--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Wed, Sep 10, 2008 at 7:12 AM, robert engels<[EMAIL PROTECTED]> wrote:
Sorry, but I am fairly certain you are mistaken.
If you only have a single IndexReader, the RAMDirectory will beshared in all cases.
The only memory growth is any buffer space allocated by anIndexInput (used in many places and cached).
Normally the IndexInput created by a RAMDirectory do not haveany buffer allocated, since the underlying store is already inmemory.
You have some other problem in your code...

On Sep 10, 2008, at 1:10 AM, Chris Lu wrote:
Actually, even I only use one IndexReader, some resources arecached via the ThreadLocal cache, and can not be releasedunless all threads do the close action.
SegmentTermEnum itself is small, but it holds RAMDirectoryalong the path, which is big.
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Tue, Sep 9, 2008 at 10:43 PM, robert engels<[EMAIL PROTECTED]> wrote:
You do not need a pool of IndexReaders...
It does not matter what class it is, what matters is the classthat ultimately holds the reference.
If the IndexReader is never closed, the SegmentReader(s) isnever closed, so the thread local in TermInfosReader is notcleared (because the thread never dies). So you will get oneSegmentTermEnum, per thread * per segment.
The SegmentTermEnum is not a large object, so even if you had100 threads, and 100 segments, for 10k instances, seems hardto believe that is the source of your memory issue.
The SegmentTermEnum is cached by thread since it needs toenumerate the terms, not having a per thread cache, would leadto lots of random access when multiple threads read the index- very slow.
You need to keep in mind, what if every thread was executing asearch simultaneously - you would still have 100x100SegmentTermEnum instances anyway ! The only way to preventthat would be to create and destroy the SegmentTermEnum oneach call (opening and seeking to the proper spot) - whichwould be SLOW SLOW SLOW.
On Sep 10, 2008, at 12:19 AM, Chris Lu wrote:
I have tried to create an IndexReader pool and dynamicallycreate searcher. But the memory leak is the same. It's notrelated to the Searcher class specifically, but theSegmentTermEnum in TermInfosReader.
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Tue, Sep 9, 2008 at 10:14 PM, robert engels<[EMAIL PROTECTED]> wrote:A searcher uses an IndexReader - the IndexReader is slow toopen, not a Searcher. And searchers can share an IndexReader.
You want to create a single shared (across all threads/users)IndexReader (usually), and create an Searcher as needed anddispose. It is VERY CHEAP to create the Searcher.
I am fairly certain the javadoc on Searcher is incorrect.The warning "For performance reasons it is recommended toopen only one IndexSearcher and use it for all of yoursearches" is not true in the case where an IndexReader ispassed to the ctor.
Any caching should USUALLY be performed at the IndexReaderlevel.
You are most likely using the "path" ctor, and that is thesource of your problems, as multiple IndexReader instancesare being created, and thus the memory use.
On Sep 9, 2008, at 11:44 PM, Chris Lu wrote:
On J2EE environment, usually there is a searcher pool withseveral searchers open.The speed to opening a large index for every user is notacceptable.
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!
On Tue, Sep 9, 2008 at 9:03 PM, robert engels<[EMAIL PROTECTED]> wrote:You need to close the searcher within the thread that isusing it, in order to have it cleaned up quickly... usuallyright after you display the page of results.
If you are keeping multiple searcher refs across multiplethreads for paging/whatever, you have not coded it correctly.
Imagine 10,000 users - storing a searcher for each one isnot going to work...
On Sep 9, 2008, at 10:21 PM, Chris Lu wrote:
Right, in a sense I can not release it from another thread.But that's the problem.
It's a J2EE environment, all threads are kind of equal.It's simply not possible to iterate through all threads toclose the searcher, thus releasing the ThreadLocal cache.Unless Lucene is not recommended for J2EE environment, thishas to be fixed.
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymousper request) got 2.6 Million Euro funding!
On Tue, Sep 9, 2008 at 8:14 PM, robert engels<[EMAIL PROTECTED]> wrote:Your code is not correct. You cannot release it on anotherthread - the first thread may creating hundreds/thousandsof instances before the other thread ever runs...
On Sep 9, 2008, at 10:10 PM, Chris Lu wrote:
If I release it on the thread that's creating thesearcher, by setting searcher=null, everything is fine,the memory is released very cleanly.My load test was to repeatedly create a searcher on aRAMDirectory and release it on another thread. The testwill quickly go to OOM after several runs. I set the heapsize to be 1024M, and the RAMDirectory is of size 250M.Using some profiling tool, the used size simply stepped uppretty obviously by 250M.
I think we should not rely on something that's a "maybe"behavior, especially for a general purpose library.
Since it's a multi-threaded env, the thread that'screating the entries in the LRU cache may not go awayquickly(actually most, if not all, application serverswill try to reuse threads), so the LRU cache, which usesthread as the key, can not be released, so theSegmentTermEnum which is in the same class can not bereleased.
And yes, I close the RAMDirectory, and the fileMap isreleased. I verified that through the profiler by directlychecking the values in the snapshot.
Pretty sure the reference tree wasn't like this using codebefore this commit, because after close the searcher inanother thread, the RAMDirectory totally disappeared fromthe memory snapshot.
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymousper request) got 2.6 Million Euro funding!
On Tue, Sep 9, 2008 at 5:03 PM, Michael McCandless<[EMAIL PROTECTED]> wrote:
Chris Lu wrote:
The problem should be similar to what's talked about onthis discussion.http://lucene.markmail.org/message/keosgz2c2yjc7qre?q=ThreadLocal
The "rough" conclusion of that thread is that,technically, this isn't a memory leak but rather a"delayed freeing" problem. Ie, it may take longer,possibly much longer, than you want for the memory to befreed.
There is a memory leak for Lucene search from Lucene-1195.(svn r659602, May23,2008)
This patch brings in a ThreadLocal cache to TermInfosReader.
One thing that confuses me: TermInfosReader was alreadyusing a ThreadLocal to cache the SegmentTermEnuminstance. What was added in this commit (for LUCENE-1195)was an LRU cache storing Term -> TermInfo instances. Butit seems like it's the SegmentTermEnum instance thatyou're tracing below.
It's usually recommended to keep the reader open, andreuse it whenpossible. In a common J2EE application, the http requestsare usuallyhandled by different threads. But since the cache isThreadLocal, the cacheare not really usable by other threads. What's worse, thecache can not be
cleared by another thread!
This leak is not so obvious usually. But my case is usingRAMDirectory,having several hundred megabytes. So one un-releasedresource is obvious to
me.

Here is the reference tree:
org.apache.lucene.store.RAMDirectory
 |- directory of org.apache.lucene.store.RAMFile
    |- file of org.apache.lucene.store.RAMInputStream
|- base oforg.apache.lucene.index.CompoundFileReader$CSIndexInput|- input oforg.apache.lucene.index.SegmentTermEnum|- value of java.lang.ThreadLocal$ThreadLocalMap$Entry
So you have a RAMDir that has several hundred MB stored init, that you're done with yet through this path Lucene iskeeping it alive?
Did you close the RAMDir? (which will null its fileMapand should also free your memory).
Also, that reference tree doesn't show the ThreadResourcesclass that was added in that commit -- are you sure thisreference tree wasn't before the commit?
Mike
---------------------------------------------------------------------To unsubscribe, e-mail: java-dev-[EMAIL PROTECTED]For additional commands, e-mail: java-dev-[EMAIL PROTECTED]
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymousper request) got 2.6 Million Euro funding!
--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutesDBSight customer, a shopping comparison site, (anonymous perrequest) got 2.6 Million Euro funding!

Re: ThreadLocal causing memory leak with J2EE applications

Reply via email to