Tongjie Chen created PIG-3986:
---------------------------------

             Summary: PigSplit to support multiple split class
                 Key: PIG-3986
                 URL: https://issues.apache.org/jira/browse/PIG-3986
             Project: Pig
          Issue Type: Improvement
            Reporter: Tongjie Chen


Currently one PigSplit wraps one to many input split and pig assign one 
PigSplit to one mapper; however when it serializes the split class name, it 
expects all input split to be of same class, hence it serializes class name 
only once --- the first split (see code snippet at the end).

To support PigSplit wrap multi split class, we can serialize each split along 
with its own class name.    This would allow each split to be 
deserialized/restored correctly.  Of course, LoadFunc would need to dispatch 
input split to appropriate record reader.




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to