Are you storing the data in sequence files? -Joey
On Fri, May 20, 2011 at 10:33 AM, W.P. McNeill <[email protected]> wrote: > The keys are Text and the values are large custom data structures serialized > with Avro. > > I also have counters for the job that generates these files that gives me > this information but sometimes...Well, it's a long story. Suffice to say > that it's nice to have a post-hoc method too. :-) > > The identity mapper sounds like the way to go. > -- Joseph Echeverria Cloudera, Inc. 443.305.9434
