Hi, you are most likely missing dependency with s3 filesystem (beam-sdks-java-io-amazon-web-services).
Best, D. Sent from my iPhone > On 10 Aug 2019, at 18:27, jitendra sharma <[email protected]> wrote: > > Hi, > > I am getting below error reading files from S3. Could you please help me what > could be the problem? > > Caused by: org.apache.flink.client.program.ProgramInvocationException: > java.lang.Exception: The data preparation for task 'GroupReduce (GroupReduce > at Checking file pattern/Via > MatchAll/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey)' , caused an error: > Error obtaining the sorted input: Thread 'SortMerger Reading Thread' > terminated due to an exception: No filesystem found for scheme s3 > at > org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:265) > at > org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:464) > at > org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:452) > at > org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:62) > at > org.apache.beam.runners.flink.FlinkPipelineExecutionEnvironment.executePipeline(FlinkPipelineExecutionEnvironment.java:122) > at org.apache.beam.runners.flink.FlinkRunner.run(FlinkRunner.java:110) > ... 25 more > Caused by: java.lang.Exception: The data preparation for task 'GroupReduce > (GroupReduce at Checking file pattern/Via > MatchAll/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey)' , caused an error: > Error obtaining the sorted input: Thread 'SortMerger Reading Thread' > terminated due to an exception: No filesystem found for scheme s3 > at org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:479) > at > org.apache.flink.runtime.operators.BatchTask.invoke(BatchTask.java:368) > at org.apache.flink.runtime.taskmanager.Task.run(Task.java:703) > at java.lang.Thread.run(Thread.java:748) > Caused by: java.lang.RuntimeException: Error obtaining the sorted input: > Thread 'SortMerger Reading Thread' terminated due to an exception: No > filesystem found for scheme s3 > at > org.apache.flink.runtime.operators.sort.UnilateralSortMerger.getIterator(UnilateralSortMerger.java:619) > at > org.apache.flink.runtime.operators.BatchTask.getInput(BatchTask.java:1108) > at > org.apache.flink.runtime.operators.GroupReduceDriver.prepare(GroupReduceDriver.java:99) > at org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:473) > ... 3 more > > Regards, > Jitendra Sharma
