dsevilla opened a new pull request, #47457: URL: https://github.com/apache/arrow/pull/47457
Refactor host validation to allow 'hdfs://default' as a valid input to correctly fix #25324 when used via `from_uri()`. ### Rationale for this change While `fs.HadoopFileSystem()` could be created with the special host 'default' to make the underlying hdfs library to search for the correct URL in $HADOOP_CONF_DIR/core-site.xml, the fix does not work using `from_uri()`. This is because `from_uri()` has to have a complete protocol URI (hdfs://default), but the underlying hdfs library must have exactly 'default', without the URI type. The code includes the previous fix and fixes also this case. ### What changes are included in this PR? A correct fix of #25324 for the `from_uri()` case. ### Are these changes tested? Yes ### Are there any user-facing changes? Yes, now the `from_uri()` function should work with `hdfs://default...` URI. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org