[ 
https://issues.apache.org/jira/browse/SPARK-5252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-5252:
-----------------------------
    Component/s: PySpark
                 Examples

Looks like you have an environment problem:

{code}
java.io.IOException: HADOOP_HOME or hadoop.home.dir are not set.
{code}

Can you resolve this and then see if you have this problem?

> Streaming StatefulNetworkWordCount example hangs
> ------------------------------------------------
>
>                 Key: SPARK-5252
>                 URL: https://issues.apache.org/jira/browse/SPARK-5252
>             Project: Spark
>          Issue Type: Bug
>          Components: Examples, PySpark, Streaming
>    Affects Versions: 1.2.0
>         Environment: Ubuntu Linux
>            Reporter: Lutz Buech
>         Attachments: debug.txt
>
>
> Running the stateful network word count example in Python (on one local node):
> https://github.com/apache/spark/blob/master/examples/src/main/python/streaming/stateful_network_wordcount.py
> At the beginning, when no data is streamed, empty status outputs are 
> generated, only decorated by the current Time, e.g.:
> -------------------------------------------
> Time: 2015-01-14 17:58:20
> -------------------------------------------
> -------------------------------------------
> Time: 2015-01-14 17:58:21
> -------------------------------------------
> As soon as I stream some data via netcat, no new status updates will show. 
> Instead, one line saying
> [Stage <number>:====================================================>         
>                  (2 + 0) / 3]
> where <number> is some integer number, e.g. 132. There is no further output 
> on stdout.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to