[
https://issues.apache.org/jira/browse/ARROW-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rok Mihevc updated ARROW-1320:
------------------------------
External issue URL: https://github.com/apache/arrow/issues/17350
> hdfs block locations
> --------------------
>
> Key: ARROW-1320
> URL: https://issues.apache.org/jira/browse/ARROW-1320
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Martin Durant
> Priority: Major
>
> To provide a function which can return the set of machines on which the data
> blocks of a given hdfs file are stored. This is best for scheduling systems
> (e.g., dask) which can move the computation to the machine which has the
> data, and so cut out network data traffic.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)