[ 
https://issues.apache.org/jira/browse/SPARK-13650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15179347#comment-15179347
 ] 

Mario Briggs commented on SPARK-13650:
--------------------------------------

Running locally on my MAC, here is the output

{quote}
[Stage 0:>                                                          (0 + 0) / 
2]16/03/04 10:38:18 INFO VerifiableProperties: Verifying properties
16/03/04 10:38:18 INFO VerifiableProperties: Property auto.offset.reset is 
overridden to smallest
16/03/04 10:38:18 INFO VerifiableProperties: Property group.id is overridden to 
16/03/04 10:38:18 INFO VerifiableProperties: Property zookeeper.connect is 
overridden to 

-------------------------------------------
Time: 1457068098000 ms
-------------------------------------------
10


[Stage 6:=======================================>                   (2 + 0) / 3]

[Stage 6:=======================================>                   (2 + 0) / 3]

{quote}

The '10' is the output from the first batch interval at time '1457068098000 ms' 
. Thereafter, the only output for the next ~2 minutes is the 'stage 6'. A ^C 
fails to stop the app and need to do a 'kill -9 pid'

> Usage of the window() function on DStream
> -----------------------------------------
>
>                 Key: SPARK-13650
>                 URL: https://issues.apache.org/jira/browse/SPARK-13650
>             Project: Spark
>          Issue Type: Bug
>          Components: Streaming
>    Affects Versions: 1.5.2, 1.6.0, 2.0.0
>            Reporter: Mario Briggs
>            Priority: Minor
>
> Is there some guidance of the usage of the Window() function on DStream. Here 
> is my academic use-case for which it fails.
> Standard word count
>  val ssc = new StreamingContext(sparkConf, Seconds(6))
>  val messages = KafkaUtils.createDirectStream(...)
>  val words = messages.map(_._2).flatMap(_.split(" "))
>  val window = words.window(Seconds(12), Seconds(6)) 
>  window.count().print()
> For the first batch interval it gives the count and then it hangs (inside the 
> unionRDD)
> I say the above use-case is academic since one can achieve similar 
> fuctionality by using instead the more compact API
>        words.countByWindow(Seconds(12), Seconds(6))
> which works fine. 
> Is the first approach above not the intended way of using the .window() API



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to