Yes, just tested, it happens only with the flink runner.

Agree to create a Jira.

Regards
JB

On 06/01/2016 03:41 AM, Davor Bonaci wrote:
This will be a runner-specific issue. It would be the best to file a
JIRA issue for this.

On Tue, May 31, 2016 at 9:46 AM, Jean-Baptiste Onofré <[email protected]
<mailto:[email protected]>> wrote:

    Hi Pawel,

    does it happen only with the Flink runner ? I bet it happens with
    any runner.

    Let me take a look.

    Regards
    JB

    On 05/30/2016 01:38 AM, Pawel Szczur wrote:

        Hi,

        I'm running a pipeline with Flink backend, Beam bleeding edge,
        Oracle
        Java 1.8, maven 3.3.3, linux64.

        The pipeline is run with --parallelism=6.

        Adding .withoutSharding()causes a TextIO sink to write only one
        of the
        shards.

        Example use:
        data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats"));
        vs.
        
data.apply(TextIO.Write.named("write-debug-csv").to("/tmp/some-stats")*.withoutSharding()*);

        Result:
        Only part of data is written to file. After comparing to sharded
        output,
        it seems to be just one of shard files.

        Cheers,
        Pawel


    --
    Jean-Baptiste Onofré
    [email protected] <mailto:[email protected]>
    http://blog.nanthrax.net
    Talend - http://www.talend.com



--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com

Reply via email to