guoxiaolongzte created SPARK-23270: -------------------------------------- Summary: FileInputDStream Streaming UI 's records should not be set to the default value of 0, it should be the total number of rows of new files. Key: SPARK-23270 URL: https://issues.apache.org/jira/browse/SPARK-23270 Project: Spark Issue Type: Bug Components: DStreams Affects Versions: 2.4.0 Reporter: guoxiaolongzte Attachments: 1.png
FileInputDStream Streaming UI 's records should not be set to the default value of 0, it should be the total number of rows of new files. ^-------------------------------------------in FileInputDStream.scala start-------------------------------------^ val inputInfo = StreamInputInfo(id, {color:#FF0000}0{color}, metadata) {color:#FF0000}// set to the default value of 0{color} ssc.scheduler.inputInfoTracker.reportInfo(validTime, inputInfo) case class StreamInputInfo( inputStreamId: Int, numRecords: Long, metadata: Map[String, Any] = Map.empty) -------------------------------------in FileInputDStream.scala end--------------------------- ^-------------------------------------------in DirectKafkaInputDStream.scala start-------------------------------------^ val inputInfo = StreamInputInfo(id, {color:#FF0000}rdd.count{color}, metadata) {color:#FF0000}//set to rdd count as numRecords{color} ssc.scheduler.inputInfoTracker.reportInfo(validTime, inputInfo) case class StreamInputInfo( inputStreamId: Int, numRecords: Long, metadata: Map[String, Any] = Map.empty) -------------------------------------in DirectKafkaInputDStream.scala end----------------------- test method: ./bin/spark-submit --class org.apache.spark.examples.streaming.HdfsWordCount examples/jars/spark-examples_2.11-2.4.0-SNAPSHOT.jar /spark/tmp/ -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org