[ 
https://issues.apache.org/jira/browse/BEAM-2266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16008184#comment-16008184
 ] 

Thomas Weise commented on BEAM-2266:
------------------------------------

We need to add support for larger bundles, I think that is the only way to get 
a better default behavior? Unless we want to modify the example to let the user 
influence the shards, like it was done here:

https://github.com/tweise/apex-samples/blob/master/beam-apex-wordcount/src/main/java/com/example/myapexapp/Application.java#L186


> WordCount on ApexRunner writes one word count per file
> ------------------------------------------------------
>
>                 Key: BEAM-2266
>                 URL: https://issues.apache.org/jira/browse/BEAM-2266
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-apex
>            Reporter: Kenneth Knowles
>            Assignee: Thomas Weise
>
> This has been a common issue with most streaming runners. Running on Apex you 
> get one "word: <count>" entry per file. Perhaps something smarter could be 
> done as a default in such cases. This will also occur with single element 
> bundles, though in this case there is a single element per key so single-key 
> bundles will result in single-element bundles.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to