[
https://issues.apache.org/jira/browse/SPARK-17038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15419571#comment-15419571
]
Shixiong Zhu commented on SPARK-17038:
--------------------------------------
Good catch. Could you submit a PR to fix it, please?
> StreamingSource reports metrics for lastCompletedBatch instead of
> lastReceivedBatch
> -----------------------------------------------------------------------------------
>
> Key: SPARK-17038
> URL: https://issues.apache.org/jira/browse/SPARK-17038
> Project: Spark
> Issue Type: Bug
> Components: Streaming
> Affects Versions: 1.6.2, 2.0.0
> Reporter: Oz Ben-Ami
> Priority: Minor
> Labels: metrics
>
> StreamingSource's lastReceivedBatch_submissionTime,
> lastReceivedBatch_processingTimeStart, and
> lastReceivedBatch_processingTimeEnd all use data from lastCompletedBatch
> instead of lastReceivedBatch. In particular, this makes it impossible to
> match lastReceivedBatch_records with a batchID/submission time.
> This is apparent when looking at StreamingSource.scala, lines 89-94.
> I would guess that just replacing Completed->Received in those lines would
> fix the issue, but I haven't tested it.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]