Tongjie Chen created PIG-3986:
---------------------------------
Summary: PigSplit to support multiple split class
Key: PIG-3986
URL: https://issues.apache.org/jira/browse/PIG-3986
Project: Pig
Issue Type: Improvement
Reporter: Tongjie Chen
Currently one PigSplit wraps one to many input split and pig assign one
PigSplit to one mapper; however when it serializes the split class name, it
expects all input split to be of same class, hence it serializes class name
only once --- the first split (see code snippet at the end).
To support PigSplit wrap multi split class, we can serialize each split along
with its own class name. This would allow each split to be
deserialized/restored correctly. Of course, LoadFunc would need to dispatch
input split to appropriate record reader.
--
This message was sent by Atlassian JIRA
(v6.2#6252)