[
https://issues.apache.org/jira/browse/PIG-4483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rohini Palaniswamy updated PIG-4483:
------------------------------------
Fix Version/s: 0.15.0
Labels: (was: newbie)
Status: Patch Available (was: Open)
> Pig on Tez output statistics shows storing to same directory twice for union
> ----------------------------------------------------------------------------
>
> Key: PIG-4483
> URL: https://issues.apache.org/jira/browse/PIG-4483
> Project: Pig
> Issue Type: Improvement
> Affects Versions: 0.14.0
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
> Fix For: 0.15.0
>
> Attachments: PIG-4483-1.patch
>
>
> For the below script
> A = LOAD 'data1';
> B = LOAD 'data2';
> C = UNION A, B;
> STORE C into 'data3';
> Output message is shown as below due to vertex group and storing from
> separate vertices.
> Successfully stored 10 records (xxx bytes) in: "data3"
> Successfully stored 20 records (yyy bytes) in: "data3"
> Even though it is correct it can be confusing for users and they have to sum
> it up before comparing to Pig on MR output message. OutputStats with same
> filename should be combined and shown as
> Successfully stored 30 records (xxx bytes) in: "data3"
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)