Silemo opened a new issue, #450:
URL: https://github.com/apache/arrow-rs-object-store/issues/450

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   <!--
   A clear and concise description of what the problem is. Ex. I'm always 
frustrated when [...] 
   (This section helps Arrow developers understand the context and *why* for 
this feature, in addition to  the *what*)
   -->
   Young developer here. As part of a larger project I am currently working on 
I need to access hdfs using your object_store interface. I am currently looking 
to a couple of repositories that tried adding support for HDFS, either having a 
wrapper around libhdfs or native support.
   
   **Describe the solution you'd like**
   <!--
   A clear and concise description of what you want to happen.
   -->
   I would like to make these solutions general enough so that they can be 
added to this repository as part of the object_store crate.
   
   **Describe alternatives you've considered**
   <!--
   A clear and concise description of any alternative solutions or features 
you've considered.
   -->
   The choice that needs to be taken is between using 
https://github.com/datafusion-contrib/datafusion-objectstore-hdfs/tree/master  
which is a wrapper around libhdfs. Or try to generalise the solution of 
https://github.com/Kimahriman/hdfs-native . 
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   As I am a young developer, I have a limited idea of which best practices and 
design choices are better suited for this task and I could use some help. 
Furthermore I could benefit from the feedback of someone that has contributed 
directly to the object_store crate.
   
   Here's a list of question that might help me get started, feel free to add 
any comment or recommendation:
   * When contributing to such a large change (more than 70 lines change) you 
still recommend a single PR? How should I prepare the work for the contribution?
   * HDFS, being a file system operates more similarly to your Local File 
system implementation than S3, GCP or Azure. Do you prefer then that I create a 
single hdfs.rs file as local.rs, or you still would opt for a folder?
   * Is there something in particular I should keep in mind when implementing 
for hdfs or contributing to this repo?
   
   Thanks in advance for your response
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to