Can you please forward the script and Job Counters? Cluster size - # of Map
Reduce slots would be good too.

Thanks,
Prashant

On Mon, Apr 2, 2012 at 5:27 PM, sonia gehlot <[email protected]> wrote:

> Hi,
>
> I have a really large data set of about 10 to 15 billion rows. I wanted to
> do some aggregates like sum, count distinct, max etc but this is taking
> forever to run the script.
>
> What hints or properties should I set to improve performance.
>
> Please let me know.
>
> Thanks,
> Sonia
>

Reply via email to