Rohini: We are still looking into that. The file I named ‘output7’ in this thread is used as input to the next DAG. We’re still analyzing how it may be different in the two environments, if at all. In that DAG, although number of input records is the same, output records diverges. We’re looking into why that is.
Thanks, Kurt From: Rohini Palaniswamy <[email protected]<mailto:[email protected]>> Reply-To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Date: Thursday, May 5, 2016 at 11:46 AM To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Subject: Re: data discrepancies related to parallelism Haven't seen this before. The Pig's stats output counters seem to be exactly same for records. Which output do you the see the data being incorrect? On Thu, May 5, 2016 at 11:23 AM, Hitesh Shah <[email protected]<mailto:[email protected]>> wrote: Thanks for the info, Kurt. You may wish to post this question to the Pig lists too to see if anyone has seen this. — Hitesh > On May 5, 2016, at 11:05 AM, Kurt Muehlner > <[email protected]<mailto:[email protected]>> wrote: > > Hi Hitesh, > > We are using Pig 0.15.0 and Tez 0.8.2. > > Thanks, > Kurt > > > > On 5/5/16, 11:00 AM, "Hitesh Shah" > <[email protected]<mailto:[email protected]>> wrote: > >> What version are you running with? >> >> thanks >> — Hitesh
