Re: Directly create chukwa records?

James Seigel Sat, 27 Feb 2010 13:29:17 -0800

Awesome!

Sent from my mobile. Please excuse the typos.


On 2010-02-27, at 1:52 PM, "Eric Yang" <ey...@yahoo-inc.com> wrote:

There will be a converter from sequence file to other file format.If a new
file format has been decided to replace sequence file.

Regards,
Eric


On 2/26/10 8:04 PM, "James Seigel" <ja...@tynt.com> wrote:
It sounds like there is some exciting work being done on the demuxprocess.I was just wondering if you are planning to be backwards compatiblewith 0.3
format for /repos as you move forward .

Cheers
james


On 2010-02-26, at 10:38 AM, Eric Yang wrote:
On 2/26/10 4:43 AM, "Guillermo Pérez" <bi...@tuenti.com> wrote:
One related thing is that I want to modify the "cluster" where weputthe files, because we will receive syslog data with several typesofevents that we want to store in different clusters to analyze,backup,
archive separately. I have seen that you can modify the
Record.tagsField and that we use a regexp for extracting the
destination cluster. This is a bit akward, isn't? I don't want tokeepa tagsField just for that. I'm using a field "event_type" and Ihave
modified the extraction/engine/RecordUtil.java, so if that field
exists, "event_" + <event_type> will be used as cluster. This isthe
proper way to go, or there is a better solution for this?.
I don't think you need to modify RecordUtil.java for thispurpose. Thebackfill java program is taking first parameter as cluster.Hence, youcould easily change event_type as the first parameter before youbackfill.
Another question is where I could start looking on how to build
reports and aggregated results of the custom ChukwaRecords I'm
inserting.
There is currently no formal solution to generate report fromChukwaRecords.There is org.apache.hadoop.chukwa.dataloader.MetricDataLoaderwhich loadsChukwaRecords into mysql database base on mdl.xml file. Afterdata isloaded, you could use hicc.sh to start the webserver, andvisualize the datain Chukwa SQL Client widget. However, I must warn you thatMetricDataLoaderis deprecated, and the future plan to generate report fromChukwaRecords is
as follow:
Having a post demux data loader which wait to receive newChukwaRecordsfiles, and merge with the existing ChukwaRecords files through asecond MRjob. The second MR job also produces low resolution of the datafor report.
/chukwa/repos/TYPE/DATE <-- Original data goes here.
/chukwa/report/TYPE/[yearly,monthly,weekly,daily] <-- SummarizedJSON data
goes here.
The report JSON will be fixed to 300 data points per series,optimized for
graphing.  I am taking it slow on the actual implementation because
ChukwaRecords should be move to a faster seralization format.It's another
area that needs to be improved for the future plan to work.

Regards,
Eric
James Seigel
ja...@tynt.com
http://www.tynt.com
Captain Hammer

Re: Directly create chukwa records?

Reply via email to