Hello, We are using graylog-collector v0.4.2 on AIX 7.1 and we can start the logger and it works for awhile (couple of hours) and then it suddenly stops sending logs with no error messages in the logs..
I also changed the COLLECTOR_JAVA_DEFAULTS to -Xms512m -Xmx768m but no luck. On the graylog-server the collector is visible while it is working, when it stops sending logs the collector is no longer visible on the server side but the graylog-collector process is still running in AIX with no errors in logs. I have to kill the process (kill -9 since kill -15 does not stop it) and then restart the process. Any idea how to troubleshoot the cause of this? *The collector.conf consists of this..* server-url = "http://172.16.x.x:12900" enable-registration = true collector-id = "file:config/collector-id" inputs { 1 { type = "file" path = "/usr1/dbs/1.lg" outputs = "gelf-tcp" } 2 { type = "file" path = "/usr1/dbs/2.lg" outputs = "gelf-tcp" } 3 { type = "file" path = "/usr1/dbs/3.lg" outputs = "gelf-tcp" } 4 { type = "file" path = "/usr1/dbs/4.lg" outputs = "gelf-tcp" } 5 { type = "file" path = "/usr1/dbs/5.lg" outputs = "gelf-tcp" } 6 { type = "file" path = "/usr1/dbs/6.lg" outputs = "gelf-tcp" } 7 { type = "file" path = "/usr1/dbs/7.lg" outputs = "gelf-tcp" } 8 { type = "file" path = "/usr1/dbs/8.lg" outputs = "gelf-tcp" } } outputs { gelf-tcp { type = "gelf" host = "172.16.x.x" port = 12201 client-tls = false client-queue-size = 5120 client-connect-timeout = 10000 client-reconnect-delay = 2000 client-tcp-no-delay = true client-send-buffer-size = 32768 } } *In the log all we see is...* [main] cli.commands.Run - Starting Collector v0.4.2 (commit 2609a38) [main] cli.commands.Run - Running on AIX AIX 7.1 (ppc64) [main] collector.utils.CollectorId - Collector ID: 19c8fecb-0f04-4503-9943-15cfbac98458 [main] outputs.gelf.GelfOutput - Starting GELF transport: org.graylog2.gelfclient.GelfConfiguration@3c8510bd [main] cli.commands.Run - Service RUNNING: BufferProcessor [RUNNING] [main] cli.commands.Run - Service RUNNING: FileObserver [RUNNING] [main] cli.commands.Run - Service RUNNING: MetricService [RUNNING] [main] cli.commands.Run - Service RUNNING: MemoryReporterService [RUNNING] [main] cli.commands.Run - Service RUNNING: HeartbeatService [RUNNING] [main] cli.commands.Run - Service RUNNING: FileInput{id='1', path-set='SinglePathSet{path=/usr1/dbs/1.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: FileInput{id='2', path-set='SinglePathSet{path=/usr1/dbs/2.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: FileInput{id='3', path-set='SinglePathSet{path=/usr1/dbs/3.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: FileInput{id='4', path-set='SinglePathSet{path=/usr1/dbs/4.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: FileInput{id='5', path-set='SinglePathSet{path=/usr1/dbs/5.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: FileInput{id='6', path-set='SinglePathSet{path=/usr1/dbs/6.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: GelfOutput{port='12201', id='gelf-tcp', client-send-buffer-size='32768', host='172.16.x.x', inputs='', client-reconnect-delay='2000', client-connect-timeout='10000', client-tcp-no-delay='true', client-queue-size='5120'} [main] cli.commands.Run - Service RUNNING: FileInput{id='7', path-set='SinglePathSet{path=/usr1/dbs/7.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'} [main] cli.commands.Run - Service RUNNING: FileInput{id='8', path-set='SinglePathSet{path=/usr1/dbs/8.lg}', reader-buffer-size='102400', message-fields='MessageFields{}', charset='UTF-8', outputs='gelf-tcp', content-splitter='NEWLINE', reader-interval='100'}', reader-interval='100'} -- You received this message because you are subscribed to the Google Groups "Graylog Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/graylog2/3806ada3-9eb1-4126-a415-d7c0bf997ae1%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
