Thanks. I will try that out. Any idea on the second one - What will be the pig syntax for using this function "getDuration();
Thanks, Sumit. ________________________________ From: Dmitriy Ryaboy <[email protected]> To: [email protected] Sent: Fri, 22 April, 2011 11:22:54 PM Subject: Re: Getting the total Mapred time I may be misunderstanding what you are asking. The tricky part is measuring MR time *without* wait time, which one cannot control (it depends mostly on the size and utilization level of your cluster). This tricky bit is what PigStats helps you with. If you just want to measure the full time, including wait time, you can just time the process like you would time anything else. Use the "time" unix command, or if you are in Java, use System.currentTimeMillis() before you start executing and again when the procedure returns. On Thu, Apr 21, 2011 at 10:18 PM, sumit ghosh <[email protected]> wrote: > Hi, > > How to get the actual time spent in doing all the map-reduce operations > while > executing a pig script. It should exclude the time wait for the scheduler - > and > any other waiting time. Please help. > > Another question is - how to use the function getDuration() in: > >http://pig.apache.org/docs/r0.8.0/api/org/apache/pig/tools/pigstats/PigStats.html >l > > Can you please help me with the Pig Statements. > > Thanks, > Sumit >
