Re: huge fuseki memory usage; NIO errors; heap NOT running out

Dan Pritts Thu, 14 Jun 2018 19:49:23 -0700

So the issue is that memory goes up, that is the heap expands to themaximum Xmx size set? The JVM does not return any heap back to the OS(as far as I know) so if all the applications grow their heaps, thereal RAM to match that or swapping may result.

Hi Andy,


thanks for taking the time to help.

The problem is that the NON-HEAP memory usage skyrockets.

I "allocate" memory for the heap. The gc logs suggested that I wasnever exceeding 6GB of heap in use, even when things went to hell. So Iset the heap to 10GB.

Now that I know we're using NIO, I "allocate" memory for NIO to hold theentire index in ram. the db is 2.4GB on disk. I don't know NIO wellbut this seems plausible.


let's throw another gig at java for its own internal use.

That would add up to 10 + 2.4 + 1 = 13.4GB of memory i might expect javato use. There's nothing else on the server except apache, linux, and afew system daemons (postfix, etc).

I upgraded to 3.7 and put fuseki on its own AWS instance last night. RAMwas 16GB and swap 10GB.

once today it filled ram & swap such that linux whacked the jvmprocess. Two other times today it was swapping heavily (5GB or swapused) and we restarted fuseki before the system ran out of swap.

For some reason, the JVM running fuseki+jetty is going nuts with itsmemory usage. It *is* using more heap than usual when this happens, butit's not using more than the 10GB I allocated. At least, not accordingto the garbage collection logs.

We have had this problem a few times in the past - memory usage wouldspike drastically. We'd always attributed it to a slow memory leak, anddecided we should restart fuseki regularly. But in the last coupleweeks it's happened probably a dozen times.

after the third time today, I put it on a 32GB instance. Of course, theproblem hasn't happened since.

A couple of possibilities:
1/ A query does an ORDER BY that involves a large set of results tosort. This then drives up the heap requirement, the JVM gorws the heapand now the process is larger. There may well be a CPU spike at thistime.
2/ Updates are building up. The journal isn't flushed to the maindatabase until there is a quiet moment and with the high query rateyou may get bursts of time when it is not quiet. The updates are safein the journal (the commit happened) but also in-memory as an overlayon the database. The overlays are collapsed when there are no readersor writers.
What might be happening is that there isn't a quiet moment.

The traffic is certainly steady - it was about 1500 hits/minute todaywhen we first crashed.

Big sudden jump would imply a big update as well.

Setting the log into INFO (and, yes, at load it does get big)
What you are looking for is overlaps of query/updates so that the logshows true concurrent execution (i.e [1] starts, [2] starts, [1]finishes logged after [2] starts) around the time the size growsquickly and check the size of updates.

I will look for this. I am dubious, though. We don't make many writes,and those we do are not very big. Our dataset is metadata about ourarchive. The archive is 50 years old, and grows steadily but slowly.

we had disabled the fuseki log but left httpd logging enabled becauseeach was huge. Unfortunately the updates were all in POSTs, which ihadn't noticed until i went looking just now. So I will have to waituntil next time.


thanks
danno

Re: huge fuseki memory usage; NIO errors; heap NOT running out

Reply via email to