[
https://issues.apache.org/jira/browse/ARROW-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rok Mihevc updated ARROW-4820:
------------------------------
External issue URL: https://github.com/apache/arrow/issues/21337
> [Python] hadoop class path derived not correct
> ----------------------------------------------
>
> Key: ARROW-4820
> URL: https://issues.apache.org/jira/browse/ARROW-4820
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.12.0, 0.12.1
> Reporter: Tiger068
> Assignee: Tiger068
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.13.0
>
> Attachments: image-2019-03-11-16-39-25-195.png
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> in hdfs.py, the method _derive_hadoop_classpath add jar files under
> $HADOOP_HOME into hadoop classpath,but the hadoop config directory is not
> contained in classpath.
>
> when hadoop HA mode enabled,the hdfs uri like this: hdfs://ns
> when the HADOOP_CONF_DIR directory is not in the hadoop classpath,the libhdfs
> can not locate the right hdfs-site.xml, in the HA mode, hdfs service name
> was parsed as host name ,it is not correct
> !image-2019-03-11-16-39-25-195.png!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)