I'm having an issue with Graylog continuously falling behind with log 
processing, and the MasterCache filling up til the 10G of Heap Space maxes 
out and crashes. The really weird thing is that a week ago, everything was 
processing fine and I was taking between 1500-2000 msg/s. Now I barely get 
over 500-750 msg/s. I don't think ElasticSearch is the issue because none 
of the OutputCache or Buffer is increasing.

I'm wondering if it has something to do with this: Number of indices (80) 
higher than limit (20). Running retention for 60 indices. It doesn't look 
like Graylog is properly rotating indexes and running this retention 
instead. 

After restarting graylog2 and emptying cache...
[util][caches][2014-05-06T08:46:04.850-07:00] InputCache size: 5758
[util][caches][2014-05-06T08:46:04.850-07:00] OutputCache size: 0
[util][buffers][2014-05-06T08:46:04.850-07:00] OutputBuffer is at 0.0%. 
[0/2048]
[util][buffers][2014-05-06T08:46:04.850-07:00] ProcessBuffer is at 
33.251953%. [681/2048]
[util][heap][2014-05-06T08:46:04.850-07:00] Used memory (MB): 1465
[util][heap][2014-05-06T08:46:04.850-07:00] Free memory (MB): 8330
[util][heap][2014-05-06T08:46:04.850-07:00] Total memory (MB): 9814
[util][heap][2014-05-06T08:46:04.850-07:00] Max memory (MB): 9814
[util][written][2014-05-06T08:46:04.850-07:00] Messages written to all 
outputs: 1561


After MasterCache fills up a bit
[util][caches][2014-05-06T08:42:18.109-07:00] InputCache size: 2487587
[util][caches][2014-05-06T08:42:18.109-07:00] OutputCache size: 0
[util][buffers][2014-05-06T08:42:18.109-07:00] OutputBuffer is at 0.0%. 
[0/2048]
[util][buffers][2014-05-06T08:42:18.109-07:00] ProcessBuffer is at 
40.429688%. [828/2048]
[util][heap][2014-05-06T08:42:18.109-07:00] Used memory (MB): 6392
[util][heap][2014-05-06T08:42:18.109-07:00] Free memory (MB): 3736
[util][heap][2014-05-06T08:42:18.109-07:00] Total memory (MB): 10129
[util][heap][2014-05-06T08:42:18.109-07:00] Max memory (MB): 10129
[util][written][2014-05-06T08:42:18.109-07:00] Messages written to all 
outputs: 3100


ES Node config: (GLNode0 is the Graylog server). I know mlockall is false, 
and is configured to be true, but these are virtualized servers and there 
are some issues there.

{
  "ok" : true,
  "cluster_name" : "Graylog2",
  "nodes" : {
    "X.X.X.X" : {
      "name" : "GLNode1",
      "transport_address" : "inet[/X.X.X.X:9300]",
      "hostname" : "X.X.X.X",
      "version" : "0.90.10",
      "http_address" : "inet[/X.X.X.X:9200]",
      "attributes" : {
        "master" : "true"
      },
      "process" : {
        "refresh_interval" : 1000,
        "id" : 1611,
        "max_file_descriptors" : 32000,
        "mlockall" : false
      }
    },
    "X.X.X.X" : {
      "name" : "GLNode0",
      "transport_address" : "inet[/X.X.X.X:9350]",
      "hostname" : "X.X.X.X",
      "version" : "0.90.10",
      "attributes" : {
        "client" : "true",
        "data" : "false",
        "master" : "false"
      },
      "process" : {
        "refresh_interval" : 1000,
        "id" : 28382,
        "max_file_descriptors" : 4096,
        "mlockall" : false
      }
    },
    "X.X.X.X" : {
      "name" : "GLNode2",
      "transport_address" : "inet[/X.X.X.X:9300]",
      "hostname" : "X.X.X.X",
      "version" : "0.90.10",
      "http_address" : "inet[/X.X.X.X:9200]",
      "attributes" : {
        "master" : "false"
      },
      "process" : {
        "refresh_interval" : 1000,
        "id" : 4508,
        "max_file_descriptors" : 32000,
        "mlockall" : false
      }
    }
  }
}

-- 
You received this message because you are subscribed to the Google Groups 
"graylog2" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to