Re: Suggested fix for JDK-4724038 (Add unmap method to MappedByteBuffer)

Peter Levart Wed, 09 Sep 2015 07:46:37 -0700


On 09/09/2015 04:21 PM, Peter Levart wrote:

Hi Uwe,
As I thought, the problem for some seems to be non-prompt unmapping ofmapped address space held by otherwise unreachable mapped bytebuffers. The mapped address space doesn't live in the Java heap anddoesn't represent a heap memory pressure, so GC doesn't kick-inautomatically when one would like. One could help by manuallytriggering GC with System.gc() in such situations. The problem is howto detect such situations. Direct byte buffers(ByteBuffer.allocateDirect) maintain a count of bytes currentlyallocated and don't allow allocation of native memory beyond certainconfigured limit (-XX:MaxDirectMemorySize=<size>). Before throwingOutOfMemoryError, the ByteBuffer.allocateDirect() request tries it'sbest to free direct memory allocated by otherwise unreachable directByteBuffers (using System.gc() to trigger GC and helping processreferences).
Would similar approach - configured limit for FileChannel.map()pedaddress space be of any help to Lucene applications? Is it possible toestimate the max. amount of address space a particular Luceneapplication may need at any one time so that mapping over such limitcould be considered an application error?

Perhaps the number of bytes mapped is not always a correct quantity totrack. Maybe Lucene needs tracking the number of mapped regions orsomething else? I think it would be best to leave to the application todecide and implement the tracking and also triggering GC at times whenit approaches the limit. All that is missing currently fromMappedByteBuffer API for that purpose is a notification to theapplication after it has been unmapped.


Regards, Peter

Regards, Peter

On 09/09/2015 12:51 PM, Uwe Schindler wrote:
Hi,
Dawid Weiss and I are both involved in the Apache Lucene project andwe know the problems with MappedByteBuffer and unmapping. Dawidalready responded with a source code link to our impl (which needs touse the hacky cleaner() approach; also look at the heavydocumentation in this class):https://github.com/apache/lucene-solr/blob/trunk/lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java
So we would be very happy to get this issue resolved! The cleaner()hack is enabled by default in Lucene if the JVM supports it (so wewon't break if JIGSAW prevents this, but our *large* users wouldheavily complain).
This is fundamentally about *integrity* of the runtime. It followsthereare security implications, but it’s still fundamentally anintegrity issue
and guarding an unsafe operation with a Security Manager is
unfortunately an insufficient solution.
Right, and just to add that there has been many attempts over the years
to find solutions to this issue. I think the closest was atomimcally
remapping but that wasn't feasible on all platforms and also didn'tfree
up the address space in a timely manner.
So we should really find a solution here. I was talking with severalpeople on various conferences (Rory O'Donnel or Mark Reinhold) and wehad some ideas how to solve this. My idea how to solve this isexplained below (I am not a JVM internals or Hotspot guy, so excusesome obviously "wrong" assumptions):
Actually there are 2 issues, not only one. The first issue is, asmentioned before: you cannot unmap via API. This is needed for manyapps, including Apache Lucene, for a reason which comes more from"another" bug, and this is my issue #2 (see below).
First, unmapping for Lucene is very important at the moment, becausewe operate on the Lucene indexes purely using mmap (see [1]), whichmay be several hundreds of Gigabytes easily. On highly dynamicsystems, Lucene often maps new files (also very largeones ) andrelies on the fact, that older, deleted files are unmapped in time(this does not need to be ASAP, just "in time"). So we have those 2"bugs", which force us to unmap:
(1) disk space issues / delete after last close (POSIX) vs. No deleteat all (Windows)
- disk space: we have seen customers running out of disk space onLucene, because unmapping wasn’t done in time and therefore POSIXwith delete on last close cannot free the disk space, although thefile was already deleted. The problem you are seeing on Windows thatyou cannot delete, is therefore worse on Linux, because it is hiddento the user - you cannot free the disk space of the deleted file!Lucene creates and deletes files all the time while indexing realtimedata (e.g. think of Github's very dynamic code search index, which isbacked by Lucene/Elasticsearch).- virtual memory: If you map huge files (several hundreds ofGigabytes) and they are not unmapped in time, you may run out ofvirtual address space. This especially affects Windows, because itdoes not use the full 46 bits (or like that) of addresses. Soeffectively you can only map like 4 Terabytes on Windows. If you havefragmentation of address space this gets worse (In Lucene, we map inchunks of 1 GiB because of the signed 32 bit integer limit ofByteBuffer, so fragmentation is not our biggest issue).
(2) It takes veeeeeeeeeeeeeeeery long time until the unmappingactually occurs!
This is the real bug! If the garbage collector would clean up thebuffers asap, we would not need to unmap from user code. In Lucene wejust delay the file delete on Windows, so we are not really affectedby the file deletion inability (but that would be nice if it could befixed).
If you look at the usage pattern of those huge, mapped files, youwill see why they are in most cases *never ever* unmappedautomatically: Lucene maps very large files and uses them for longertime. So the MappedByteBuffer object gets migrated to oldergenerations on the heap. Garbage collection there happens, of course,very delayed. That would not be the most problematic part, but thereis a second issue: The MappedByteBuffer object is just a very smallobject (in heap size measurement: just an object header and a fewpointers), so the garbage collector does not see it as heavy! It'sjust a very small like 30 bytes object instance. Why should theGarbage collector clean it up? And in fact it will almost never dothis! The garbage collector cannot see that our 30 bytes objectinstance "sits" on something like 300 Gigabytes of virtual memory anddisk space!
One proposal to fix this would be to add something like an internalOpenJDK Java Annotation or similar where you can "mark" heavyobjects, so Garbage collector could free them by preference (similarto sun.misc.Contended).
For the Apache Lucene team,
Uwe
[1]http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html
-----
Uwe Schindler
uschind...@apache.org
ASF Member, Apache Lucene PMC / Committer
Bremen, Germany
http://lucene.apache.org/



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Suggested fix for JDK-4724038 (Add unmap method to MappedByteBuffer)

Reply via email to