[ https://issues.apache.org/jira/browse/PIG-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16031937#comment-16031937 ]
Rohini Palaniswamy commented on PIG-4059: ----------------------------------------- [~szita], With PIG-4941, support for SplitLocationInfo was added which is available only from Hadoop 2.5 (MAPREDUCE-5896). So Pig 0.17 will only work with Hadoop 2.5 and above. Please document that the minimum supported version for 0.17 is Hadoop 2.5 in the release notes. > Pig on Spark > ------------ > > Key: PIG-4059 > URL: https://issues.apache.org/jira/browse/PIG-4059 > Project: Pig > Issue Type: New Feature > Components: spark > Reporter: Rohini Palaniswamy > Assignee: Praveen Rachabattuni > Labels: spork > Fix For: spark-branch, 0.17.0 > > Attachments: Pig-on-Spark-Design-Doc.pdf, Pig-on-Spark-Scope.pdf > > > Setting up your development environment: > 0. download spark release package(currently pig on spark only support spark > 1.6). > 1. Check out Pig Spark branch. > 2. Build Pig by running "ant jar" and "ant -Dhadoopversion=23 jar" for > hadoop-2.x versions > 3. Configure these environmental variables: > export HADOOP_USER_CLASSPATH_FIRST="true" > Now we support “local” and "yarn-client" mode, you can export system variable > “SPARK_MASTER” like: > export SPARK_MASTER=local or export SPARK_MASTER="yarn-client" > 4. In local mode: ./pig -x spark_local xxx.pig > In yarn-client mode: > export SPARK_HOME=xx; > export SPARK_JAR=hdfs://example.com:8020/xxxx (the hdfs location where > you upload the spark-assembly*.jar) > ./pig -x spark xxx.pig -- This message was sent by Atlassian JIRA (v6.3.15#6346)