[
https://issues.apache.org/jira/browse/BEAM-8399?focusedWorklogId=383310&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-383310
]
ASF GitHub Bot logged work on BEAM-8399:
----------------------------------------
Author: ASF GitHub Bot
Created on: 07/Feb/20 01:37
Start Date: 07/Feb/20 01:37
Worklog Time Spent: 10m
Work Description: udim commented on pull request #10223: [BEAM-8399] Add
--hdfs_full_urls option (wip)
URL: https://github.com/apache/beam/pull/10223#discussion_r376174956
##########
File path: sdks/python/apache_beam/io/hadoopfilesystem_test.py
##########
@@ -323,7 +375,7 @@ def test_create_success(self):
url = self.fs.join(self.tmpdir, 'new_file')
handle = self.fs.create(url)
self.assertIsNotNone(handle)
- url = self.fs._parse_url(url)
+ _, url = self.fs._parse_url(url)
Review comment:
There will be a separate `test_parse_url` to test these return values.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 383310)
Time Spent: 1h 40m (was: 1.5h)
> Python HDFS implementation should support filenames of the format
> "hdfs://namenodehost/parent/child"
> ----------------------------------------------------------------------------------------------------
>
> Key: BEAM-8399
> URL: https://issues.apache.org/jira/browse/BEAM-8399
> Project: Beam
> Issue Type: Improvement
> Components: sdk-py-core
> Reporter: Chamikara Madhusanka Jayalath
> Assignee: Udi Meiri
> Priority: Major
> Time Spent: 1h 40m
> Remaining Estimate: 0h
>
> "hdfs://namenodehost/parent/child" and "/parent/child" seems to be the
> correct filename formats for HDFS based on [1] but we currently support
> format "hdfs://parent/child".
> To not break existing users, we have to either (1) somehow support both
> versions by default (based on [2] seems like HDFS does not allow colons in
> file path so this might be possible) (2) make
> "hdfs://namenodehost/parent/child" optional for now and change it to default
> after few versions.
> We should also make sure that Beam Java and Python HDFS file-system
> implementations are consistent in this regard.
>
> [1][https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html]
> [2] https://issues.apache.org/jira/browse/HDFS-13
>
> cc: [~udim]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)