[jira] [Commented] (PIG-5441) Pig skew join tez grace reducer fails to find shuffle data

2023-05-24 Thread Yi Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/PIG-5441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17726009#comment-17726009
 ] 

Yi Zhang commented on PIG-5441:
---

[~knoguchi] can you add unit test as separate jira? I am not actively working 
on Pig itself and don't have bandwidth right now. Thank you! 

> Pig skew join tez grace reducer fails to find shuffle data
> --
>
> Key: PIG-5441
> URL: https://issues.apache.org/jira/browse/PIG-5441
> Project: Pig
>  Issue Type: Bug
>  Components: tez
>Affects Versions: 0.17.0
>Reporter: Yi Zhang
>Assignee: Yi Zhang
>Priority: Major
> Fix For: 0.18.0
>
>
> User pig tez skew join encountered issue of not finding shuffle data from the 
> sampler aggregate vertex. The right side join has >1 reducers.
> For workaround adjust tez.runtime.transfer.data-via-events.max-size to avoid 
> spill to disk for the sampler aggregation vertex. 
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (PIG-5441) Pig skew join tez grace reducer fails to find shuffle data

2023-05-24 Thread Koji Noguchi (Jira)


[ 
https://issues.apache.org/jira/browse/PIG-5441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17725786#comment-17725786
 ] 

Koji Noguchi commented on PIG-5441:
---

It would be nice if you can add a unit test.  
(However, if you don't have bandwidth I understand.  I can try to add the test 
later as a separate jira.)

> Pig skew join tez grace reducer fails to find shuffle data
> --
>
> Key: PIG-5441
> URL: https://issues.apache.org/jira/browse/PIG-5441
> Project: Pig
>  Issue Type: Bug
>  Components: tez
>Affects Versions: 0.17.0
>Reporter: Yi Zhang
>Assignee: Yi Zhang
>Priority: Major
> Fix For: 0.18.0
>
>
> User pig tez skew join encountered issue of not finding shuffle data from the 
> sampler aggregate vertex. The right side join has >1 reducers.
> For workaround adjust tez.runtime.transfer.data-via-events.max-size to avoid 
> spill to disk for the sampler aggregation vertex. 
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (PIG-5441) Pig skew join tez grace reducer fails to find shuffle data

2023-05-24 Thread Rohini Palaniswamy (Jira)


[ 
https://issues.apache.org/jira/browse/PIG-5441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17725781#comment-17725781
 ] 

Rohini Palaniswamy commented on PIG-5441:
-

+1. Can you just attach the patch to jira ?

> Pig skew join tez grace reducer fails to find shuffle data
> --
>
> Key: PIG-5441
> URL: https://issues.apache.org/jira/browse/PIG-5441
> Project: Pig
>  Issue Type: Bug
>  Components: tez
>Affects Versions: 0.17.0
>Reporter: Yi Zhang
>Assignee: Yi Zhang
>Priority: Major
> Fix For: 0.18.0
>
>
> User pig tez skew join encountered issue of not finding shuffle data from the 
> sampler aggregate vertex. The right side join has >1 reducers.
> For workaround adjust tez.runtime.transfer.data-via-events.max-size to avoid 
> spill to disk for the sampler aggregation vertex. 
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (PIG-5441) Pig skew join tez grace reducer fails to find shuffle data

2023-05-24 Thread Rohini Palaniswamy (Jira)


 [ 
https://issues.apache.org/jira/browse/PIG-5441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rohini Palaniswamy updated PIG-5441:

Fix Version/s: 0.18.0
 Assignee: Yi Zhang
   Status: Patch Available  (was: Open)

> Pig skew join tez grace reducer fails to find shuffle data
> --
>
> Key: PIG-5441
> URL: https://issues.apache.org/jira/browse/PIG-5441
> Project: Pig
>  Issue Type: Bug
>  Components: tez
>Affects Versions: 0.17.0
>Reporter: Yi Zhang
>Assignee: Yi Zhang
>Priority: Major
> Fix For: 0.18.0
>
>
> User pig tez skew join encountered issue of not finding shuffle data from the 
> sampler aggregate vertex. The right side join has >1 reducers.
> For workaround adjust tez.runtime.transfer.data-via-events.max-size to avoid 
> spill to disk for the sampler aggregation vertex. 
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (PIG-5440) Extra jars needed for hive3

2023-05-24 Thread Rohini Palaniswamy (Jira)


[ 
https://issues.apache.org/jira/browse/PIG-5440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17725780#comment-17725780
 ] 

Rohini Palaniswamy commented on PIG-5440:
-

+1

> Extra jars needed for hive3
> ---
>
> Key: PIG-5440
> URL: https://issues.apache.org/jira/browse/PIG-5440
> Project: Pig
>  Issue Type: Improvement
>Reporter: Koji Noguchi
>Assignee: Koji Noguchi
>Priority: Minor
> Attachments: pig-5440-v01.patch, pig-5440-v02.patch
>
>
> When testing Hive3,  e2e tests were failing with 
> {{Caused by: java.lang.NoClassDefFoundError: 
> org/apache/hadoop/hive/llap/security/LlapSigner$Signable}}  etc. 
> Updating dependent classes. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)