[ https://issues.apache.org/jira/browse/PIG-5371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Laszlo Bodor updated PIG-5371: ------------------------------ Description: Attached [^simpleTest.out]. It seems like HDFS counter 'HDFS_BYTES_WRITTEN' returns the byte count not only for the result of pig store operator, but it includes the jar files as well. {code} 2018-11-23 10:14:52,661 [IPC Server handler 5 on 54929] INFO org.apache.hadoop.hdfs.StateChange - BLOCK* allocate blk_1073741827_1003, replicas=127.0.0.1:54934, 127.0.0.1:54930, 127.0.0.1:54943 for /tmp/temp-157262781/tmp-1057655772/automaton-1.11-8.jar ... 2018-11-23 10:14:52,735 [PacketResponder: BP-26001448-10.200.50.195-1542964474138:blk_1073741827_1003, type=HAS_DOWNSTREAM_IN_PIPELINE, downstreams=2:[127.0.0.1:54930, 127.0.0.1:54943]] INFO org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace - src: /127.0.0.1:54978, dest: /127.0.0.1:54934, bytes: 176285, op: HDFS_WRITE, cliID: DFSClient_NONMAPREDUCE_-1959727442_1, offset: 0, srvID: 108c4000-1ae0-402e-82cf-bf403629c0f7, blockid: BP-26001448-10.200.50.195-1542964474138:blk_1073741827_1003, duration(ns): 57162859 {code} was: Attached [^simpleTest.out]. It seems like HDFS counter 'HDFS_BYTES_WRITTEN' returns the byte count not only for the result of pig store operator, but it includes the jar files as well. {code} {code} > Hdfs bytes written assertions fail in TestPigRunner > --------------------------------------------------- > > Key: PIG-5371 > URL: https://issues.apache.org/jira/browse/PIG-5371 > Project: Pig > Issue Type: Bug > Reporter: Laszlo Bodor > Assignee: Laszlo Bodor > Priority: Major > Attachments: PIG-5371.01.patch, simpleTest.out > > > Attached [^simpleTest.out]. It seems like HDFS counter 'HDFS_BYTES_WRITTEN' > returns the byte count not only for the result of pig store operator, but it > includes the jar files as well. > {code} > 2018-11-23 10:14:52,661 [IPC Server handler 5 on 54929] INFO > org.apache.hadoop.hdfs.StateChange - BLOCK* allocate blk_1073741827_1003, > replicas=127.0.0.1:54934, 127.0.0.1:54930, 127.0.0.1:54943 for > /tmp/temp-157262781/tmp-1057655772/automaton-1.11-8.jar > ... > 2018-11-23 10:14:52,735 [PacketResponder: > BP-26001448-10.200.50.195-1542964474138:blk_1073741827_1003, > type=HAS_DOWNSTREAM_IN_PIPELINE, downstreams=2:[127.0.0.1:54930, > 127.0.0.1:54943]] INFO > org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace - src: > /127.0.0.1:54978, dest: /127.0.0.1:54934, bytes: 176285, op: HDFS_WRITE, > cliID: DFSClient_NONMAPREDUCE_-1959727442_1, offset: 0, srvID: > 108c4000-1ae0-402e-82cf-bf403629c0f7, blockid: > BP-26001448-10.200.50.195-1542964474138:blk_1073741827_1003, duration(ns): > 57162859 > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)