Hi Sandy,

Crunch uses something similar to Hadoop's MultipleOutputFormat to allow
writing multiple outputs in multiple formats from the same job. This leads
to different counters being used for output, as there can be multiple
outputs (and therefore multiple counters) from a single job.

The main implementation class of this is o.a.c.io.CrunchOutputs, and the
counters that contain the actual output count are published in the counter
group with the name of that class, and the counter name of out<d>, where
<d> is the index of the output for the job (i.e. starting from 0).

- Gabriel



On Fri, Jun 7, 2013 at 10:54 AM, Sandy Ryza <[email protected]> wrote:

> Hey All,
>
> Does Crunch not use the normal MR channels for outputting stuff?  I'm
> noticing that when I look at a job's Counters, the output records are
> always 0, even when I know data has been written.
>
> thanks
> -Sandy
>

Reply via email to