PigStorage and ORC inputFormat

Abbass MAROUNI Wed, 21 May 2014 07:27:12 -0700

Hi all,

PigStoarge parsing a csv file, did I get it right :


HDFS_Block -> TextInputFormat -> (Key:offset, Value:line) -> PigStorage ->
Tuple -> Mapper ?

If so, what are the input/output (key, value) pairs of the mapper ?

How does formats like RC/ORC (that promise to read less input) work ?

HDFS_Block -> ORCInputFormat (concerned columns only) -> (Key, Value) ->
ORCParser ? -> Tuple -> Mapper ?

Best regards,

PigStorage and ORC inputFormat

Reply via email to