Benjamin Reed
Fri, 04 Apr 2008 11:30:23 -0700
The statistics are not updated in a thread safe way. They are global statistics, so they will be across jobs, and since they aren't thread safe they may be wrong. Other than the numbers I think that the rest should be thread safe assuming that the underlying Hadoop code is thread safe, which it looks to be. I would think for your application the stats should really be attached to an object that represents the store or dump method object right? (Or at least accessible through that object.) ben Michael Harris wrote: > Hello, > > > > I have written a pig application that does a fixed set of queries > on-demand through a web interface. I am trying to get the progress of > the queries from the PigServer, but I have noticed that the source of > the progress data is all static fields in the MapReduceLauncher. Clearly > my webapp must be able to handle multiple concurrent pig queries (and be > thread-safe) and I would like to report the progress of each individual > query (job set) to the end user. Do these static fields indicate that I > would get the progress of multiple concurrent queries initiated by > different PigServer instances? or would I get the overall progress of > the MapReduceLauncher for all queries currently being executed? > > > > Thanks, > Michael > > >