Luke Cwik created BEAM-2265:
-------------------------------

             Summary: Python word count gets stuck during application 
termination on Windows
                 Key: BEAM-2265
                 URL: https://issues.apache.org/jira/browse/BEAM-2265
             Project: Beam
          Issue Type: Bug
          Components: sdk-py
            Reporter: Luke Cwik
            Assignee: Ahmet Altay


Using virtualenv 15 + python 2.7.13 + pip 9.0.1 on Windows 2016

Example logs from DirectRunner:
{code}
(beamRC2)PS C:\Users\lcwik\.virtualenvs\beamRC2> python -m 
apache_beam.examples.wordcount --input ".\input\*" --output l
ocal_counts
No handlers could be found for logger "oauth2client.contrib.multistore_file"
INFO:root:Missing pipeline option (runner). Executing pipeline using the 
default runner: DirectRunner.
INFO:root:Running pipeline with DirectRunner.
{code}

Application gets stuck here, pressing ctrl-z gets it unstuck and the remainder 
below is logged
{code}
INFO:root:Starting finalize_write threads with num_shards: 1, batches: 1, 
num_threads: 1
INFO:root:Renamed 1 shards in 0.14 seconds.
INFO:root:number of empty lines: 47851
INFO:root:average word length: 4
{code}

Output is correct, so it seems as though the bug is somewhere in shutdown.
Happens when using a local or gs path with the DirectRunner or using 
DataflowRunner. Enabling DEBUG logging did not add any additional details.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to