dsevilla opened a new pull request, #47457:
URL: https://github.com/apache/arrow/pull/47457

   Refactor host validation to allow 'hdfs://default' as a valid input to 
correctly fix #25324 when used via `from_uri()`.
   
   ### Rationale for this change
   
   While `fs.HadoopFileSystem()` could be created with the special host 
'default' to make the underlying hdfs library to search for the correct URL in 
$HADOOP_CONF_DIR/core-site.xml, the fix does not work using `from_uri()`. This 
is because `from_uri()` has to have a complete protocol URI (hdfs://default), 
but the underlying hdfs library must have exactly 'default', without the URI 
type. The code includes the previous fix and fixes also this case.
   
   ### What changes are included in this PR?
   
   A correct fix of #25324 for the `from_uri()` case.
   
   ### Are these changes tested?
   
   Yes
   
   ### Are there any user-facing changes?
   
   Yes, now the `from_uri()` function should work with `hdfs://default...` URI.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to