shunping opened a new pull request, #35849: URL: https://github.com/apache/beam/pull/35849
There is a bug in `_join_url_or_filepath`, which appends duplicated prefix if we provide an absolute path of file when defining providers. Below is a simple code to reproduce. ```python from apache_beam.yaml.yaml_provider import _join_url_or_filepath # Result: gs://my_bucket/gs://my_bucket/my_jar.jar print(_join_url_or_filepath('gs://my_bucket', 'gs://my_bucket/my_jar.jar')) # Result: https://storage.mtls.cloud.google.com/my_bucket/my_jar.jar print(_join_url_or_filepath('gs://my_bucket', 'https://storage.mtls.cloud.google.com/your_bucket/my_jar.jar')) ``` internal bug: 437204659 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@beam.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org