[
https://issues.apache.org/jira/browse/ARROW-3503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368242#comment-17368242
]
Ian Cook commented on ARROW-3503:
---------------------------------
I think it is fairly uncommon for {{HADOOP_BIN}} to be in some other location.
[~wbzhao] are you interested in rebasing and updating the PR
([https://github.com/apache/arrow/pull/2748)] to resolve conflicts that have
been introduced by other commits since the PR was originally created? If not I
think we could close this as "won't fix". Thanks.
> [Python] Allow config hadoop_bin in pyarrow hdfs.py
> ----------------------------------------------------
>
> Key: ARROW-3503
> URL: https://issues.apache.org/jira/browse/ARROW-3503
> Project: Apache Arrow
> Issue Type: New Feature
> Components: Python
> Reporter: Wenbo Zhao
> Priority: Major
> Labels: filesystem, pull-request-available
> Time Spent: 1h 40m
> Remaining Estimate: 0h
>
> Currently, the hadoop_bin is either from `HADOOP_HOME` or the `hadoop`
> command.
> [https://github.com/apache/arrow/blob/master/python/pyarrow/hdfs.py#L130]
> However, in some of environment setup, hadoop_bin could be some other
> location. Can we do something like
>
> {code:java}
> if 'HADOOP_BIN' in os.environ:
> hadoop_bin = os.environ['HADOOP_BIN']
> elif 'HADOOP_HOME' in os.environ:
> hadoop_bin = '{0}/bin/hadoop'.format(os.environ['HADOOP_HOME'])
> else:
> hadoop_bin = 'hadoop'
> {code}
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)