Re: unsafe memory access

Andy Seaborne Sat, 22 Jul 2017 12:52:46 -0700


On 22/07/17 18:03, Andy Seaborne wrote:

On 22/07/17 12:10, Dave Reynolds wrote:
# Summary
We've started seeing an unusual JVM error message in some fusekideployments:
java.lang.InternalError: a fault occurred in a recent unsafe memoryaccess operation in compiled Java code
I don't think Jena itself does any Java unsafe operations.

"The web" seems to think that mmap files, and running out of disk spacecan cause this. Or running out of shared mapped space.


Also:
http://users.jena.apache.narkive.com/n6Tddf3t/jena-fuseki-0-2-5-java-lang-internalerror-during-sparql-update

Jetty may well do, especially old architecture Jetty 8.
I don't think this is a Jena issue so not raising a Jira, but ifanyone has seen this and has any workarounds or correlations thatmight help track it down I'd be grateful for any hints.
# Details

This is a largish service [1] with around 400MT, running Fuseki/TDB.
   fuseki1 1.3.0
Fuseki1 uses Jetty 8 which is an old architecture.

Fuseki2 uses Jetty 9.
   java openjdk 1.8.0_131
   ubuntu 16.04
It has been running stably for over a year on AWS EC2 servers. Severalweeks ago we shifted to a newer EC2 instance type (i3.large) which hasfaster (nvme) disks. It has been solid [2] for the last few weeksanswering a standard set of large queries daily. Currently running ontwo load balanced instances.
Then suddenly both servers have started failing with the above errormessage when attempting the same large queries that have been workingup till now. The query is a relatively straightforward select with asort but returns several GB of (streaming) results taking 3-5 minutes- so definitely memory and disk intensive.
This occurred on both instances at that the same time. The onlycorrelation was that we did a system update on those servers the daybefore and this would have been the first time the big queries ransince that. So it seems quite likely that something in the systemupdate has prompted this. However, I can't see any likely culprits -the update included libexpoxy, lxcfs, libc-bin, systemd, ureadahead,man-db, update-initramfs but no java update and we've not changed thefuseki version. The systems both rebooted cleanly after the update.
Searching for mentions of that error generally suggest JVM bugs,running out of memory or disk. The systems have plenty of spare diskand memory. In particular they are 16GB and typically run with 5GBused and nearly all the rest in buff/cache (leaving about 100Mbactually free, which is typical).
A reboot has cleared the errors on both servers and I have no way ofreliably recreating the problem. Might not even come back :) However,worrying enough I thought I'd post it in case anyone else hasseen/sees anything similar.
Dave

[1] http://environment.data.gov.uk/water-quality/view/landing
[2] There is an issue with nvme disks on Unbutu which leads to diskread errors but there's a known work around we've applied and we runquite a few of this server class at the moment without problems. Inany case, none of the errors associated with the disk driver issuesappear in the syslogs for these instances.

Re: unsafe memory access

Reply via email to