[
https://issues.apache.org/jira/browse/BEAM-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16008091#comment-16008091
]
ASF GitHub Bot commented on BEAM-2277:
--------------------------------------
GitHub user aviemzur opened a pull request:
https://github.com/apache/beam/pull/3115
[BEAM-2277] Fix URI_SCHEME_PATTERN in FileSystems
Be sure to do all of the following to help us incorporate your contribution
quickly and easily:
- [ ] Make sure the PR title is formatted like:
`[BEAM-<Jira issue #>] Description of pull request`
- [ ] Make sure tests pass via `mvn clean verify`.
- [ ] Replace `<Jira issue #>` in the title with the actual Jira issue
number, if there is one.
- [ ] If this contribution is large, please file an Apache
[Individual Contributor License
Agreement](https://www.apache.org/licenses/icla.pdf).
---
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/aviemzur/beam
fix-uri-scheme-pattern-in-filesystems
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/beam/pull/3115.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #3115
----
commit ca05ed560888a2f2a86442b89e23fb7f49e1acfd
Author: Aviem Zur <[email protected]>
Date: 2017-05-12T13:02:54Z
[BEAM-2277] Fix URI_SCHEME_PATTERN in FileSystems
----
> IllegalArgumentException when using Hadoop file system for WordCount example.
> -----------------------------------------------------------------------------
>
> Key: BEAM-2277
> URL: https://issues.apache.org/jira/browse/BEAM-2277
> Project: Beam
> Issue Type: Bug
> Components: sdk-java-extensions
> Reporter: Aviem Zur
> Assignee: Aviem Zur
> Priority: Blocker
> Fix For: 2.0.0
>
>
> IllegalArgumentException when using Hadoop file system for WordCount example.
> Occurred when running WordCount example using Spark runner on a YARN cluster.
> Command-line arguments:
> {code:none}
> --runner=SparkRunner --inputFile=hdfs:///user/myuser/kinglear.txt
> --output=hdfs:///user/myuser/wc/wc
> {code}
> Stack trace:
> {code:none}
> java.lang.IllegalArgumentException: Expect srcResourceIds and destResourceIds
> have the same scheme, but received file, hdfs.
> at
> org.apache.beam.sdk.repackaged.com.google.common.base.Preconditions.checkArgument(Preconditions.java:122)
> at
> org.apache.beam.sdk.io.FileSystems.validateSrcDestLists(FileSystems.java:394)
> at org.apache.beam.sdk.io.FileSystems.copy(FileSystems.java:236)
> at
> org.apache.beam.sdk.io.FileBasedSink$WriteOperation.copyToOutputFiles(FileBasedSink.java:626)
> at
> org.apache.beam.sdk.io.FileBasedSink$WriteOperation.finalize(FileBasedSink.java:516)
> at
> org.apache.beam.sdk.io.WriteFiles$2.processElement(WriteFiles.java:592)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)