Hi Luke, Thanks for the explanation, that does make sense, I was just curious as to why.
Best Regards, Pulasthi On Wed, Oct 30, 2019 at 1:40 PM Luke Cwik <lc...@google.com> wrote: > A lot of the logic is around handling various error scenarios. > > You should notice that the majority of that graph is about passing around > metadata around what files were written and what errors there were. That > metadata is tiny in comparison and should only be a blip when compared to > writing the files themselves. > > On Sun, Oct 20, 2019 at 10:17 PM Pulasthi Supun Wickramasinghe < > pulasthi...@gmail.com> wrote: > >> Hi Dev's >> >> I was trying to understand the transformations created for the >> following pipeline, which seems to be pretty simple from the looks of it. >> But the graph created seems to be pretty complex. I have attached a rough >> sketch of the graph that I understood from debugging the code below [1]. >> Was a little bit puzzled as to why so many transformations are introduced >> for the write() operation, is this the normal behavior for I/O operations >> or am I missing something? doesn't this introduced a lot of unwanted >> overhead to a simple operation? >> >> PCollection<String> result = >> p.apply(GenerateSequence.from(0).to(10)) >> .apply( >> ParDo.of( >> new DoFn<Long, String>() { >> @ProcessElement >> public void processElement(ProcessContext c) throws >> Exception { >> c.output(c.element().toString()); >> } >> })); >> >> result.apply(TextIO.write().to(new URI(resultPath).getPath() + >> "/part")); >> >> >> [1] >> beam graph >> <https://docs.google.com/drawings/d/1Ptk8XQiiee5vymXrUZMYNQIS8iEnexKD4ucbZ4Uk-CE/edit?usp=drive_web> >> >> Best Regards, >> Pulasthi >> -- >> Pulasthi S. Wickramasinghe >> PhD Candidate | Research Assistant >> School of Informatics and Computing | Digital Science Center >> Indiana University, Bloomington >> cell: 224-386-9035 <(224)%20386-9035> >> > -- Pulasthi S. Wickramasinghe PhD Candidate | Research Assistant School of Informatics and Computing | Digital Science Center Indiana University, Bloomington cell: 224-386-9035