Edward Zhang created EAGLE-2:
--------------------------------

             Summary: watch message process backlog in Eagle UI
                 Key: EAGLE-2
                 URL: https://issues.apache.org/jira/browse/EAGLE-2
             Project: Eagle
          Issue Type: Improvement
         Environment: production
            Reporter: Edward Zhang


Message latency is a key factor for Eagle to enable realtime security 
monitoring. For hdfs audit log monitoring, kafka is used as datasource. So 
there is always some gap between current max offset in kafka and processed 
offset in eagle. The gap is the backlog which eagle should consume quickly as 
much as quickly. If the gap can be sampled for every minute or 20 seconds, then 
we understand if eagle is catching up or is lagging behind more.

The command to get current max offset in kafka is 
bin/kafka-run-class.sh kafka.tools.GetOffsetShell --broker-list xxxx --topic 
hdfs_audit_log --time -1

and Storm-kafka spout would store processed offset in zookeeper, in the 
following znode:
/consumers/hdfs_audit_log/eagle.hdfsaudit.consumer/partition_0 

So technically we can get the gap and write that to eagle service then in UI we 
can watch the backlog



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to