In his example though, he states there are only two threads, and that even with two being created, he should have plenty of memory to spare (if I am reading it correctly).

On Feb 18, 2008, at 9:23 AM, Paul Elschot wrote:

I think this code has been discussed before, but I can't find
the occasion on which that happened.

Actually, this code may do some things like creating two different
caches, and as you said calling filter.bits(reader) in parallel.

Creating different caches does not hurt because in the end
one only one will survive the assignment,  but it's not nice as
different threads might synchronize on different caches
for the first cache.get() call, and I don't know whether
java guarantees that the cache.get() call will synchronize
on the same cache as the cache in the cache.get()
call in this case.

Doing the filter.bits(reader) call without synchronization
is probably on purpose: it's better not to hold any visible
lock while calling outside an object.

As it stands, I think the only way to make the current version
work correctly is to make whole method synchronized on this
object, i.e. declare it as public synchronized, and then
the synchronized(cache) occurrences can be removed.

It might be better to initialize the cache in the constructor,
and then synchronize on the cache while even while
calling filter.bits(reader). This is safe when the cache is private.

Regards,
Paul Elschot



Op Monday 18 February 2008 13:50:16 schreef mark harwood:
I'm chasing down a bug in my application where multiple threads were readingand caching the same filter (same very common term, big index) and causedan Out of Memory exception when I would expect there to be plenty ofmemory to spare. There's a number of layers to this app to investigate (I was using theXMLQueryParser and the CachedFilter tag too) but CachingWrapperFilterunderpins all this stuff and I was led to this code in it...

  public BitSet bits(IndexReader reader) throws IOException {
    if (cache == null) {
      cache = new WeakHashMap();
    }

    synchronized (cache) {  // check cache
      BitSet cached = (BitSet) cache.get(reader);
      if (cached != null) {
        return cached;
      }
    }

    final BitSet bits = filter.bits(reader);

    synchronized (cache) {  // update cache
      cache.put(reader, bits);
    }

    return bits;
  }


The first observation is - why the use of"final" for the variable "bits" ? Would there be anyside-effects to this?

Perhaps more worryingly I can see that multiple threads asking for the same bitset simultaneously arelikely to unnecessarily read the same data from the same reader (butultimately only one bitset should end up cached). My app only had 2 simultaneous threads on the same reader so I don't see how that accounts for the large memory bloat I saw. In a high traffic environment though, I can see multiple requests for a popular term getting bottle-necked here creating the same bitset and causing an OOM error. It looks like this multiple-load scenario could/should be avoided with some careful synchronisation.

Unfortunately I've been unable to reproduce my OOM problem outside of the live environment so can't fully pinpoint my particular issue or the solution just yet.

Thoughts?
Mark




      __________________________________________________________
Sent from Yahoo! Mail - a smarter inbox http://uk.mail.yahoo.com


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]








---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to