Luke Cwik created BEAM-5488:
-------------------------------
Summary: DirectRunner not producing output on TextIO
withWindowedWrites() and withNumShards(1)
Key: BEAM-5488
URL: https://issues.apache.org/jira/browse/BEAM-5488
Project: Beam
Issue Type: Bug
Components: sdk-java-core
Reporter: Luke Cwik
Source of bug (Slack userĀ [https://the-asf.slack.com/team/UCVN8DK7V]) andĀ
[https://stackoverflow.com/questions/52445414/apache-beam-not-saving-unbounded-data-to-text-file].
Example provided:
{code:java}
public static void main(String[] args) {
ExerciseOptions options =
PipelineOptionsFactory.fromArgs(args).withValidation().as(ExerciseOptions.class);
Pipeline pipeline = Pipeline.create(options);
pipeline
.apply("Read Messages from Pubsub",
PubsubIO
.readStrings()
.fromTopic(options.getTopicName()))
.apply("Set event timestamp", ParDo.of(new DoFn<String, String>() {
@ProcessElement
public void processElement(ProcessContext context) {
context.outputWithTimestamp(context.element(), Instant.now());
}
}))
.apply("Windowing",
Window.into(FixedWindows.of(Duration.standardMinutes(5))))
.apply("Write to File",
TextIO
.write()
.withWindowedWrites()
.withNumShards(1)
.to(options.getOutputPrefix()));
pipeline.run();
}
{code}
Produces output when executed on the DataflowRunner, does not produce output on
the DirectRunner.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)