[
https://issues.apache.org/jira/browse/PIG-4788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15171385#comment-15171385
]
liyunzhang_intel commented on PIG-4788:
---------------------------------------
[~pallavi.rao]: thanks for your review. After investigating the code and trying
to implement your option(use PigSplitSpark in place of PigSplit), i find it is
difficult to do that because we need copy a lot of code than just copying
PigSplit to PigSplitSpark. So agree with [~xuefuz]. we can leave this open
until we merge the branch to trunk.
> the value BytesRead metric info always returns 0 even the length of input
> file is not 0 in spark engine
> -------------------------------------------------------------------------------------------------------
>
> Key: PIG-4788
> URL: https://issues.apache.org/jira/browse/PIG-4788
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: liyunzhang_intel
> Fix For: spark-branch
>
> Attachments: PIG-4788.patch
>
>
> In
> [JobMetricsLinstener#onTaskEnd|https://github.com/apache/pig/blob/spark/src/org/apache/pig/tools/pigstats/spark/SparkJobStats.java#L140],
> taskMetrics.inputMetrics().get().bytesRead() always returns 0 even the
> length of input file is not zero.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)