Luke Cwik created BEAM-2265:
-------------------------------
Summary: Python word count gets stuck during application
termination on Windows
Key: BEAM-2265
URL: https://issues.apache.org/jira/browse/BEAM-2265
Project: Beam
Issue Type: Bug
Components: sdk-py
Reporter: Luke Cwik
Assignee: Ahmet Altay
Using virtualenv 15 + python 2.7.13 + pip 9.0.1 on Windows 2016
Example logs from DirectRunner:
{code}
(beamRC2)PS C:\Users\lcwik\.virtualenvs\beamRC2> python -m
apache_beam.examples.wordcount --input ".\input\*" --output l
ocal_counts
No handlers could be found for logger "oauth2client.contrib.multistore_file"
INFO:root:Missing pipeline option (runner). Executing pipeline using the
default runner: DirectRunner.
INFO:root:Running pipeline with DirectRunner.
{code}
Application gets stuck here, pressing ctrl-z gets it unstuck and the remainder
below is logged
{code}
INFO:root:Starting finalize_write threads with num_shards: 1, batches: 1,
num_threads: 1
INFO:root:Renamed 1 shards in 0.14 seconds.
INFO:root:number of empty lines: 47851
INFO:root:average word length: 4
{code}
Output is correct, so it seems as though the bug is somewhere in shutdown.
Happens when using a local or gs path with the DirectRunner or using
DataflowRunner. Enabling DEBUG logging did not add any additional details.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)