Mohit Sabharwal created PIG-4549:
------------------------------------

             Summary: Set CROSS operation parallelism for Spark engine
                 Key: PIG-4549
                 URL: https://issues.apache.org/jira/browse/PIG-4549
             Project: Pig
          Issue Type: Sub-task
          Components: spark
    Affects Versions: spark-branch
            Reporter: Mohit Sabharwal
            Assignee: Mohit Sabharwal
             Fix For: spark-branch


Spark engine should set parallelism to be used for CROSS operation by GFCross 
UDF.

If not set, GFCross throws an exception:
{code}
                String s = cfg.get(PigImplConstants.PIG_CROSS_PARALLELISM + "." 
+ crossKey);
                if (s == null) {
                    throw new IOException("Unable to get parallelism hint from 
job conf");
                }
{code}

Estimating parallelism for Spark engine is a TBD item. Until that is done, for 
CROSS to work, we should use the default parallelism value in GFCross.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to