[ 
https://issues.apache.org/jira/browse/ARROW-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rok Mihevc updated ARROW-1320:
------------------------------
    External issue URL: https://github.com/apache/arrow/issues/17350

> hdfs block locations
> --------------------
>
>                 Key: ARROW-1320
>                 URL: https://issues.apache.org/jira/browse/ARROW-1320
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Martin Durant
>            Priority: Major
>
> To provide a function which can return the set of machines on which the data 
> blocks of a given hdfs file are stored. This is best for scheduling systems 
> (e.g., dask) which can move the computation to the machine which has the 
> data, and so cut out network data traffic.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to