Hi, Sorry for replying late I was trying with what you have suggested. Yes it worked for me. Rotation factor increased my file size but now have other issue :)
@Issue : When chukwa demuxer gets the log for the processing , it is getting distributed in 2 directories : 1) After correct processing , it generates .evt files. 2) Chuwa parser does not parse the data properly and end up giving ..InError directory. Rotation Time : 5 min to 1 Hour 1. SYSTEM LOGS Log File used : message1 Datatype used : SysLog Error : java.text.ParseException: Unparseable date: "y 4 06:12:38 p" 2. Hadoop Logs Log File Used : Hadoop datanode logs , Hadoop TaskTracker logs Datatype Used : HadoopLog Error : java.text.ParseException: Unparseable date: "0 for block blk_1617125" 3. Chuwa Agent Logs Log File Used : Chuwa Agent logs Datatype Used : chuwaAgent Error : org.json.JSONException: A JSONObject text must begin with '{' at character 1 of post thread ChukwaHttpSender - collected 1 chunks I am wondering why data is getting into these INError directory. Is there any way we can get correct evt files after demuxing rather than these INError.evt files. Thanks Stuti From: Jerome Boulon [mailto:jbou...@netflix.com] Sent: Thursday, May 27, 2010 1:01 AM To: chukwa-user@hadoop.apache.org Subject: Re: Problem in chukwa output Hi, The demux is grouping you data per date/hour/TimeWindow so yes, 1 .done file could be split into multiple .evt file depending on the content/timestamp of your data. Generally, if you have a SysLogInError directory, it's because the parser throws an exception and you should have some files in there. You may want to take a look at this wiki page to get an idea of Demux data flow. http://wiki.apache.org/hadoop/Chukwa_Processes_and_Data_Flow Regards, /Jerome. On 5/26/10 10:55 AM, "Stuti Awasthi" <stuti_awas...@persistent.co.in> wrote: Hi all, I am facing some problems in chukwa output. The following are the process flow in Collector : I worked with single .done file of 16MB in size for the analysis 1) Logs were collected in /logs directory. 2) After demux processing the output was stored in /repos directory. Following is the structure inside repos: /repos /SysLog Total Size : 1MB /20100503/ *.evt /20100504/*.evt /SysLogInError Total Size : 15MB /..../*.evt I have 2 doubts : I noticed that my single log file was spilt into multiple .evt file. My output file contained 2 folders inside / SysLog .Is this the correct behaviour that a single .done file is split into n number of .evt files and in different directory structure? There was a directory of SysLogInError generated but there was no ERROR in the log file. I was not sure when this directory gets created? Any pointers will be helpful. Thanks Stuti DISCLAIMER ========== This e-mail may contain privileged and confidential information which is the property of Persistent Systems Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Ltd. does not accept any liability for virus infected mails. DISCLAIMER ========== This e-mail may contain privileged and confidential information which is the property of Persistent Systems Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Ltd. does not accept any liability for virus infected mails.