[
https://issues.apache.org/jira/browse/ARROW-1319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111211#comment-16111211
]
Martin Durant commented on ARROW-1319:
--------------------------------------
Methods that I don't think exist, and some that have different names (and maybe
already aliased). Not all are "basic filesystem" operations.
'delegate_token'
'disconnect' (maybe not required)
'get' = download
'get_block_locations',
'getmerge' (many remote files to one local file)
'glob',
'head',
'makedirs',
'mv' = rename
'put' = upload
'read_block' (delimited read)
'renew_token',
'rm' = delete
'set_replication',
'tail',
'touch'
On files: readlines/iteration (maybe better with io.TextIOWrapper); flush?; not
sure if all standard file methods are there (readable, read1...)
Methods implemented in unreleased hdfs3:
'cancel_token',
'concat' (limited to whole blocks for hadoop 1.6)
'create_encryption_zone',
'list_encryption_zones',
> [Python] Add additional HDFS filesystem methods
> -----------------------------------------------
>
> Key: ARROW-1319
> URL: https://issues.apache.org/jira/browse/ARROW-1319
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Martin Durant
>
> The python library hdfs3 http://hdfs3.readthedocs.io/en/latest/api.html
> contains a wider set of file-system methods than arrow's python bindings.
> These are probably simple to implement for arrow-hdfs.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)