[ 
https://issues.apache.org/jira/browse/PIG-4616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120965#comment-15120965
 ] 

liyunzhang_intel commented on PIG-4616:
---------------------------------------

Summarize some reasons for the unit test failures:
1. The job number is different between mr and spark mode. For multiple stores 
case like TestPigRunner#testDisablePigCounters2, there will be two spark jobs 
rather than one.
{code}
A = load '" + INPUT_FILE + "' as (a0:int, a1:int, a2:int);
B = filter A by a0 > 3;
store A into 'output';
store B into 'tmp/output';
{code}

The sparkplan has 1 sparkOperator after multiquery optimization but it has two 
POStore. In spark mode,
1 POStore generates 1 spark job(see 
org.apache.pig.tools.pigstats.spark.SparkPigStats#addJobStats). Two POStore
generates 2 spark jobs.


2.TestPigRunner#testGetHadoopCounters fails due to we did not implement 
SparkJobStats#getHadoopCounters.In PIG-4616_1.patch, we only collect 
hdfs_bytes_read, hdfs_bytes_written  these two metrics data to HDFS_BYTES_READ 
and HDFS_BYTES_WRITTEN counter.
  

> Fix UT errors of TestPigRunner in Spark mode
> --------------------------------------------
>
>                 Key: PIG-4616
>                 URL: https://issues.apache.org/jira/browse/PIG-4616
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>            Reporter: Xianda Ke
>            Assignee: Xianda Ke
>             Fix For: spark-branch
>
>         Attachments: PIG-4616.patch, PIG-4616_1.patch
>
>
> Following unit test failed:
> org.apache.pig.test.TestPigRunner.returnCodeTest
> org.apache.pig.test.TestPigRunner.testEmptyFileCounter
> org.apache.pig.test.TestPigRunner.testDisablePigCounters2
> org.apache.pig.test.TestPigRunner.simpleTest
> org.apache.pig.test.TestPigRunner.simpleTest2
> org.apache.pig.test.TestPigRunner.MQDepJobFailedTest
> org.apache.pig.test.TestPigRunner.scriptsInDfsTest
> org.apache.pig.test.TestPigRunner.testGetHadoopCounters
> org.apache.pig.test.TestPigRunner.simpleMultiQueryTest
> org.apache.pig.test.TestPigRunner.testDuplicateCounterName
> org.apache.pig.test.TestPigRunner.testRegisterExternalJar
> org.apache.pig.test.TestPigRunner.simpleMultiQueryTest2
> org.apache.pig.test.TestPigRunner.testDuplicateCounterName2
> org.apache.pig.test.TestPigRunner.returnCodeTest2
> org.apache.pig.test.TestPigRunner.orderByTest
> org.apache.pig.test.TestPigRunner.testDisablePigCounters
> org.apache.pig.test.TestPigRunner.testLongCounterName
> org.apache.pig.test.TestPigRunner.testEmptyFileCounter2



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to