[ 
https://issues.apache.org/jira/browse/ARROW-1319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16111211#comment-16111211
 ] 

Martin Durant commented on ARROW-1319:
--------------------------------------

Methods that I don't think exist, and some that have different names (and maybe 
already aliased). Not all are "basic filesystem" operations.
 'delegate_token'
 'disconnect' (maybe not required)
 'get' = download
 'get_block_locations',
 'getmerge' (many remote files to one local file)
 'glob',
 'head',
 'makedirs',
 'mv' = rename
 'put' = upload
 'read_block' (delimited read)
 'renew_token',
 'rm' = delete
 'set_replication',
 'tail',
 'touch'

On files: readlines/iteration (maybe better with io.TextIOWrapper); flush?; not 
sure if all standard file methods are there (readable, read1...)

Methods implemented in unreleased hdfs3:
 'cancel_token',
 'concat' (limited to whole blocks for hadoop 1.6)
 'create_encryption_zone',
 'list_encryption_zones',


> [Python] Add additional HDFS filesystem methods
> -----------------------------------------------
>
>                 Key: ARROW-1319
>                 URL: https://issues.apache.org/jira/browse/ARROW-1319
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Martin Durant
>
> The python library hdfs3 http://hdfs3.readthedocs.io/en/latest/api.html 
> contains a wider set of file-system methods than arrow's python bindings. 
> These are probably simple to implement for arrow-hdfs.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to