[ 
https://issues.apache.org/jira/browse/PIG-3780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Cheolsoo Park updated PIG-3780:
-------------------------------

    Attachment: PIG-3780-1.patch

In the attached patch, I am doing three things-
# Set "tez.session.reuse" to false in properties object and reuse its reference 
to start PigServer everywhere. I noticed that cluster.getProperties() returns a 
copy of object rather than a reference to it, so setting any properties without 
keeping the reference never takes effect.
# TestCustomerPartitioner was failing because "part-r-00000" was hardcoded in 
test cases. But the Tez output filename doesn't follow this convention. I 
changed the test to use FileStatus instead.
# TestTezCompiler was broken due to mismatch in gold files. This is probably 
because we haven't run unit tests for a while. I regenerated them to reflect 
recent changes.

I have confirmed all the unit tests in test-tez pass.

> Tez mini cluster tests run for a very long time with TezSession reuse on
> ------------------------------------------------------------------------
>
>                 Key: PIG-3780
>                 URL: https://issues.apache.org/jira/browse/PIG-3780
>             Project: Pig
>          Issue Type: Sub-task
>          Components: tez
>    Affects Versions: tez-branch
>            Reporter: Cheolsoo Park
>            Assignee: Cheolsoo Park
>             Fix For: tez-branch
>
>         Attachments: PIG-3780-1.patch
>
>
> In the current tez branch, mini cluster unit tests are very slow. The reason 
> is as follows:
> * TezSession reuse is by default on.
> * Each test case runs, and it waits for Tez AM to terminate.
> *  After Tez AM times out (usually after several minutes), another test case 
> runs.
> Two questions that I have are:
> # Why doesn't TezSession reuse work in mini cluster?
> # Why is TezSession reuse not disabled in some tests (e.g. TestAccumulator) 
> where we explicitly set "tez.session.reuse" to false?
> As for #2, I realized that "tez.session.reuse" was never set in the 
> properties object that is passed to PigServer. I am going to upload a patch 
> that fixes this problem in this jira.
> As for #1, I don't have an answer yet. But I think we can fix this in a 
> separate jira once we get Tez unit tests working again.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to