[
https://issues.apache.org/jira/browse/ARROW-473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17026798#comment-17026798
]
Antoine Pitrou commented on ARROW-473:
--------------------------------------
Is this still desirable? Note that we may not keep the old HDFS API exposed.
The new API uses the FileSystem abstraction.
> [C++/Python] Add public API for retrieving block locations for a particular
> HDFS file
> -------------------------------------------------------------------------------------
>
> Key: ARROW-473
> URL: https://issues.apache.org/jira/browse/ARROW-473
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++, Python
> Reporter: Wes McKinney
> Priority: Major
> Labels: filesystem, hdfs, pull-request-available
> Time Spent: 20m
> Remaining Estimate: 0h
>
> This is necessary for applications looking to schedule data-local work.
> libhdfs does not have APIs to request the block locations directly, so we
> need to see if the {{hdfsGetHosts}} function will do what we need. For
> libhdfs3 there is a public API function
--
This message was sent by Atlassian Jira
(v8.3.4#803005)