Binning operation for the generation of Hive partitioned data

Elliot West Tue, 22 Apr 2014 04:12:27 -0700

Hello,

I'm evaluating Apache Crunch as a possible replacement for some our data
processing frameworks that run on Hadoop. I can find crunch constructs that
map to most types of operation that we perform in our processes. However,
we frequently bin data by a date field for the purpose of generating
partitioned Hive tables - a fairly common operation I believe. I can't find
a similar binning operation in the crunch user manual and was wondering
if/how this would be achieve with Apache Crunch?


Cheers - Elliot.

Binning operation for the generation of Hive partitioned data

Reply via email to