Our new Graylog instance has been running for awhile without so much as a
hiccup. Recently, I added new log sources from a Security Onion sensor
containing BRO and Suricata logs. It doesn't appear these new inputs have
caused any noticeable load on the system, at least not until I run them
through a pipeline processor. I am using grok and standard regex functions
in the pipeline rules to parse out bro_conn, bro_dns, etc.
Today, I noticed the output stopped during what I would consider peak load
around 11:30a<ish> CST and to the best of my recollection no changes had
been made directly proceeding the stoppage.
As I have been adding pipeline rules to parse out messages, it seems
something happens where logs stop writing to the elasticsearch nodes. I
don't see anything in the server.log that looks like a smoking gun If I
restart the graylog-server.service the logs will not begin to clear from
the output buffer. However, If I stop the graylog-server.service and then
start it, logs begin to flow again. I do not have to restart any other
service after the manual stop/start of graylog.
The only log I see that seems like it would be related is below. However, I
am not sure if it is relevant.
2016-12-07T12:27:53.601-06:00 WARN [DeadEventLoggingListener] Received
unhandled event of type <org.graylog2.plugin.lifecycles.Lifecycle> from
event bus <AsyncEventBus{graylog-eventbus}>
System info:
top - 15:07:03 up 18 days, 22:46, 1 user, load average: 0.68, 1.06, 1.12
Tasks: 420 total, 1 running, 419 sleeping, 0 stopped, 0 zombie
%Cpu(s): 5.3 us, 1.6 sy, 0.0 ni, 93.0 id, 0.0 wa, 0.0 hi, 0.0 si,
0.0 st
KiB Mem : 49282804 total, 34329916 free, 11754760 used, 3198128 buff/cache
KiB Swap: 1048572 total, 1048572 free, 0 used. 37276428 avail Mem
Graylog 2.1.1+01d50e5 starting up
JRE: 1.8.0_111 on Linux 3.10.0-327.36.3.el7.x86_64
OS: CentOS Linux 7 (Core) amd64
JVM arguments: -Xms8g -Xmx8g -XX:NewRatio=1 -XX:+ResizeTLAB
-XX:+UseConcMarkSweepGC -XX:+CMSConc currentMTEnabled
-XX:+CMSClassUnloadingEnabled -XX:+UseParNewGC
-XX:-OmitStackTraceInFastThrow
-Dlog4j.configurationFile=file:///etc/graylog/server/log4j2.xml
-Djava.library.path=/opt/graylog-server/lib/sigar
-Dgraylog2.installation_source=unknown
transparent_hugepage=false
I noticed when checking messages against the pipeline simulator, I see
inconsistent results in the execution times. I see this from the same
messages and I also see it with different messages at different times so it
isn't something I can reproduce at will.
Sample message:
<13>1 2016-12-08T02:36:57+00:00 BROsensor bro_notice - - -
1481164616.440593|CYxMwXBbL0TvQQKm9|10.10.10.10|51841|208.10.10.10|443|-|-|-|tcp|SSL::Invalid_Server_Cert|SSL
certificate validation failed with (self signed certificate in certificate
chain)|[email protected],CN=ADN,OU=ADN,O=External,L=Toledo,ST=Ohio,C=US|10.10.10.10|208.10.10.10|443|-|BROsensor-eth1-1|Notice::ACTION_LOG|3600.000000|F|-|-|-|-|-
Sample rule using grok:
rule "Extract bro_notice log fields"
when
has_field("message") AND
contains(value: to_string($message.application_name), search: "bro_notice"
, ignore_case: true)
then
let m = grok(
"^(?<ts>%{NUMBER}|-).?(?<uid>%{WORD}|-).?(?<id_orig_h>%{IP}|-).?(?<id_orig_p>%{INT}|-).?(?<id_resp_h>%{IP}|-).?(?<id_resp_p>%{INT}|-).?(?<fuid>%{WORD}|-).?(?<file_mime_type>%{WORD}/%{WORD}|-).?(?<file_desc>%{WORD}|-).?(?<proto>%{WORD}|-).?(?<note>%{WORD}::%{WORD}|-).?(?<msg>%{BRO_URL}).?(?<sub>%{BRO_URL}).?(?<src>%{IP}|-).?(?<dst>%{IP}|-).?(?<p>%{INT}|-).?(?<n>%{INT}|-).?(?<peer_desc>%{HOSTNAME}|-).?(?<actions>%{WORD}::%{WORD}|-).?(?<suppress_for>%{INT}\\.%{INT}|-).?(?<dropped>[TF]).?(?<remote_location_country_code>%{WORD}|-).?(?<remote_location_region>%{WORD}|-).?(?<remote_location_city>%{WORD}|-).?(?<remote_location_latitude>%{INT}\\.%{INT}|-).?(?<remote_location_longitude>%{INT}\\.%{INT}|-).?$"
, to_string($message.message), true);
set_fields(m);
end
Simulation results test 1:
These are the results of processing the loaded message. Processing took 54,
366 µs.
1 μs
Starting message processing
51 μs
Message c93a5361-bcee-11e6-9154-78e7d17bef2e running [Pipeline 'Bro IDS
Logs' (58464b367d5d445614cc4f27)] for streams [584195567d5d44076c4160f7]
84 μs
Enter Stage 0
90 μs
Evaluate Rule 'Extract bro_conn log fields' (584190b57d5d44076c415c14)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
105 μs
Evaluation not satisfied Rule 'Extract bro_conn log fields' (
584190b57d5d44076c415c14) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
107 μs
Evaluate Rule 'Extract bro_dns log fields' (5843002a7d5d44076c42de64) in
Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
116 μs
Evaluation not satisfied Rule 'Extract bro_dns log fields' (
5843002a7d5d44076c42de64) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
119 μs
Evaluate Rule 'Extract bro_http log fields' (584372dc7d5d443ea323761e)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
126 μs
Evaluation not satisfied Rule 'Extract bro_http log fields' (
584372dc7d5d443ea323761e) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
129 μs
Evaluate Rule 'Extract bro_tunnels log fields' (58476fb77d5d445614cd828a
) in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
136 μs
Evaluation not satisfied Rule 'Extract bro_tunnels log fields' (
58476fb77d5d445614cd828a) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
138 μs
Evaluate Rule 'Extract bro_notice log fields' (584795807d5d445614cdaa4e)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
147 μs
Evaluation satisfied Rule 'Extract bro_notice log fields' (
584795807d5d445614cdaa4e) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
150 μs
Evaluate Rule 'Extract bro_weird log fields' (5847a07f7d5d445614cdb5e2)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
157 μs
Evaluation not satisfied Rule 'Extract bro_weird log fields' (
5847a07f7d5d445614cdb5e2) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
161 μs
Evaluate Rule 'Extract bro_software log fields' (
5847b6017d5d445614cdcc85) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
168 μs
Evaluation not satisfied Rule 'Extract bro_software log fields' (
5847b6017d5d445614cdcc85) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
179 μs
Execute Rule 'Extract bro_notice log fields' (584795807d5d445614cdaa4e)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
54,335 μs
Completed Stage 0 for Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27
), continuing to next stage
54,345 μs
Exit Stage 0
54,366 μs
Finished message processing
Simulation results test 2:
These are the results of processing the loaded message. Processing took 381
µs.
3 μs
Starting message processing
36 μs
Message ead67e30-bcef-11e6-9154-78e7d17bef2e running [Pipeline 'Bro IDS
Logs' (58464b367d5d445614cc4f27)] for streams [584195567d5d44076c4160f7]
62 μs
Enter Stage 0
67 μs
Evaluate Rule 'Extract bro_conn log fields' (584190b57d5d44076c415c14)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
80 μs
Evaluation not satisfied Rule 'Extract bro_conn log fields' (
584190b57d5d44076c415c14) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
82 μs
Evaluate Rule 'Extract bro_dns log fields' (5843002a7d5d44076c42de64) in
Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
88 μs
Evaluation not satisfied Rule 'Extract bro_dns log fields' (
5843002a7d5d44076c42de64) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
91 μs
Evaluate Rule 'Extract bro_http log fields' (584372dc7d5d443ea323761e)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
97 μs
Evaluation not satisfied Rule 'Extract bro_http log fields' (
584372dc7d5d443ea323761e) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
100 μs
Evaluate Rule 'Extract bro_tunnels log fields' (58476fb77d5d445614cd828a
) in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
106 μs
Evaluation not satisfied Rule 'Extract bro_tunnels log fields' (
58476fb77d5d445614cd828a) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
109 μs
Evaluate Rule 'Extract bro_notice log fields' (584795807d5d445614cdaa4e)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
116 μs
Evaluation satisfied Rule 'Extract bro_notice log fields' (
584795807d5d445614cdaa4e) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
120 μs
Evaluate Rule 'Extract bro_weird log fields' (5847a07f7d5d445614cdb5e2)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
134 μs
Evaluation not satisfied Rule 'Extract bro_weird log fields' (
5847a07f7d5d445614cdb5e2) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
136 μs
Evaluate Rule 'Extract bro_software log fields' (
5847b6017d5d445614cdcc85) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
142 μs
Evaluation not satisfied Rule 'Extract bro_software log fields' (
5847b6017d5d445614cdcc85) in Pipeline 'Bro IDS Logs' (
58464b367d5d445614cc4f27)
148 μs
Execute Rule 'Extract bro_notice log fields' (584795807d5d445614cdaa4e)
in Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27)
369 μs
Completed Stage 0 for Pipeline 'Bro IDS Logs' (58464b367d5d445614cc4f27
), continuing to next stage
372 μs
Exit Stage 0
381 μs
Finished message processing
I am open to suggestions for where to look next.
Regards,
Brandon
--
You received this message because you are subscribed to the Google Groups
"Graylog Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/graylog2/7ebd7e00-8152-4efe-9dd8-54554091b083%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.