Re: Architecture improvement discussion

Sidharth Kumar Thu, 25 Oct 2018 03:24:02 -0700

Hi Edward,


Can we not have enable/disable features to decide the users whether they
want them or not.

Warm Regards

Sidharth Kumar | Mob: +91 8197 555 599| Skype: itzsidharth


On Thu 25 Oct, 2018, 3:30 PM Zhao Qingwen, <[email protected]> wrote:

> Hi Edward,
>
> In the new architecture, is the storage(hbase) taken off?
> How do the adaptors store the data? for example, hadoop namenode metrics.
>
> Best Regards,
> Qingwen Zhao | 赵晴雯
>
>
>
>
>
> Edward Zhang <[email protected]> 于2018年10月12日周五 上午10:49写道：
>
> > Hi Eaglers,
> >
> > I would like to start some discussion about architecture improvement for
> > Apache Eagle based on community experience and feedback. The improvement
> is
> > targeted for simplifying installation and development of Apache Eagle.
> >
> > Eagle's main responsibility is to report abnormality instantly by
> applying
> > policies on streaming data. Eagle consists of two major components,
> Policy
> > Engine and Adaptors. Policy Engine is a standalone application which
> > provides REST API to manage policy lifecycle for different data sources
> and
> > provides runtime to evaluate policy on streaming data.  Adaptors are
> those
> > applications which fetch/process log/metrics from outside and send data
> to
> > policy engine for alerting purpose.
> >
> > But right now in Eagle code base, it is not clearly focusing on the two
> > components. For example the current source code includes map/reduce
> > job/task log retrieval/cleanup/analysis which is very useful but probably
> > Eagle only needs the portion of data retrieval/cleanup part and so data
> can
> > be streamed into policy engine for alerting purpose. For job/task
> analysis
> > part, it can be maintained in other project.
> >
> > First let me list the main modules Eagle source code consists of.
> > - eagle core
> >     - policy engine (coordinator, runtime, and web)
> >     - monitor application management
> >     - eagle query framework - for querying time series data from hbase
> > - eagle adaptors
> >      - gc log fetch/processing and alerting
> >      - metric fetch/processing and alerting, including name node, data
> > node, hbase etc.
> >      - jpm: job performance management.
> >             - haoop yarn queue statistics fetch/processing
> >             - hadoop mapreduce history job log processing
> >             - hadoop mapreduce running job processing
> >             - spark history job log processing
> >             - spark running job processing
> >             - jpm web application
> >             - hadoop job analyzer
> >        - security monitoring
> >              - hdfs audit log fetch/processing
> >              - hdfs auth log fetch/processing
> >              - hbase audit log fetch/processing
> >              - hive log fetch/processing
> >              - maprfs audit log fetch/processing
> >              - oozie audit log fetch/processing
> >         - hadoop topology stats fetch/processing
> > - eagle server
> >
> > It is very obvious that it is not scale for Eagle community to maintain
> so
> > large amount of monitoring adaptors especially when Hadoop/Spark versions
> > are evolving pretty fast.
> >
> > My suggestion is Eagle ONLY focus on policy engine and some default
> > important adaptors but remove/separate some unrelated functionalities.
> For
> > policy engine, it would be nice if it can run on popular streaming engine
> > besides Apache Storm so that it can be easily deployed for community
> users.
> > For default important adaptors, I may suggest Eagle have ONLY HDFS audit
> > log, Hadoop running job, Spark running job, HDFS namenode metrics etc.
> For
> > unrelated functionalities, we can either remove them from Eagle code base
> > or separate them into standalone executable if that is still really
> needed
> > under Apache Eagle monitoring umbrella by community.
> >
> > So the proposed Eagle code base would be like:
> > - policy engine
> >      - coordinator
> >      - runtime
> >      - web
> > - adaptors
> >     - hdfs audit log
> >     - Hadoop running job
> >     - Spark running job
> >     - HDFS namenode metrics
> >     - Hadoop yarn queue metrics
> > - extensions (some non default adaptors contributed by community)
> > - executables (standalone executables which are legacy)
> >
> > It would be great if you can provide more feedback on this discussion.
> >
> > (By the way, I also had a lot of discussion with Hao, Chen, Eagle PMC
> > member and core developer about this topic based on his experience of
> > engaging Eagle users.)
> >
> > Thanks
> > Edward
> >
>

Re: Architecture improvement discussion

Reply via email to