Can you please forward the script and Job Counters? Cluster size - # of Map Reduce slots would be good too.
Thanks, Prashant On Mon, Apr 2, 2012 at 5:27 PM, sonia gehlot <[email protected]> wrote: > Hi, > > I have a really large data set of about 10 to 15 billion rows. I wanted to > do some aggregates like sum, count distinct, max etc but this is taking > forever to run the script. > > What hints or properties should I set to improve performance. > > Please let me know. > > Thanks, > Sonia >
