Re: [pylucene-dev] pylucene memory usage

Rune Hansen Sun, 07 Aug 2005 01:15:38 -0700

On 7. aug. 2005, at 02.29, [EMAIL PROTECTED] wrote:

Hi all,
I just coded a twisted + pylucene server. After having live queryfeeds to the daemon, I notice the python server's memory just goesup and up.
After it reaches around 1.3GB of memory, I start to get

"GC Warning: Repeated allocation of very large block""
After a minute or two over the 2GB memory mark, I have 4GB memoryrhel 4 with zero swap in use so memory is not the problem, theserver dies with
"Too many heap sections: Increase MAXHINCR or MAX_HEAP_SECTS"
I'm new to python. I tried adding gc.collect() at the end of thesearches and adding "del myvars" at the end as well before thegc.collect(). Still memory goes up and up with no end in sight. Thecode base is very small so not sure where the memory leaks are coming.
Thanks for any help.

Xing Li


Hi Xing Li,

I 'd like to say - "What you'd expect? It's Java!", but that wouldjust be me, venting my Java frustrations on an unsuspecting by-stander.

There are tips and tricks to make Java use more memory and compileroptions to gcj and so on. I've never been successful with that, maybesomeone else on the list has.But, that's not the point. My experience is that (Py)Lucene willcontinue to use memory into kingdom come. 8GB will just make it crasha little later.

I have a Indexer/Searcher coded in pure Java running in tomcat. It isquite a bit faster than my PyLucene implementation but it crasheswith "java.lang.OutOfMemoryError: Java heap space" at least onceevery three days. Running under Jetty I can get it to behave for acouple of days more, but it crashes. I suspect the "singelthreadedness" of my application is what's keeping it alive for such along time anyway(!).

I also has the same implementation coded in Apache/mod_python. Apacheis running in pre-fork mode and I reap Apaches children after 125requests. At that point the memory usage of each child is ~100mb andit's time to die. This never, ever crashes (knock on wood), but it'snot nearly as fast as the Lucene/tomcat implementation.

Unfortunately I don't know Twisted very well. Twisted is async, notnecessarily forked but never threaded? Anyway, if you can makeTwisted fork off a request and kill it dead afterwards you'd be homefree but I suspect your performance may suffer badly.I would be very interested in taking a look at your application ifyou manage to "pull it off".


best regards
/rune

Happy those, who can remain at Highbury!
Jane Austen (1775-1817)


_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Re: [pylucene-dev] pylucene memory usage

Reply via email to