wolfgang hoschek created PIG-3588:
-------------------------------------
Summary: Integrate Pig with Morphlines.
Key: PIG-3588
URL: https://issues.apache.org/jira/browse/PIG-3588
Project: Pig
Issue Type: New Feature
Components: internal-udfs
Affects Versions: 0.12.0
Reporter: wolfgang hoschek
Integrate Pig with Morphlines.
Specifically, add support to Pig to call a morphline as a UDTF to leverage
existing morphlines ETL functionality. Often, some flexible massaging needs to
happen to get the input data into the shape that's desired, and morphline logic
helps do this in a user-friendly, pluggable, efficient, pipelined manner. This
issue basically boils down to transforming an input tuple into a morphline
record, feeding the record into the morphline processing API, and finally
converting zero or more morphline output records into corresponding Pig tuples.
Some background is here:
http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html
http://cloudera.github.io/cdk/docs/current/cdk-morphlines/index.html
--
This message was sent by Atlassian JIRA
(v6.1#6144)