tustvold opened a new issue, #4886:
URL: https://github.com/apache/arrow-rs/issues/4886

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   <!--
   A clear and concise description of what the problem is. Ex. I'm always 
frustrated when [...] 
   (This section helps Arrow developers understand the context and *why* for 
this feature, in addition to  the *what*)
   -->
   
   [Avro](https://avro.apache.org/docs/1.11.1/specification/) is a widely used 
binary, row-oriented data encoding. It is very similar to protobuf, and has 
seen very wide adoption in the data ecosystem, especially for streaming 
workloads.
   
   **Describe the solution you'd like**
   <!--
   A clear and concise description of what you want to happen.
   -->
   
   A new arrow_avro crate will provide vectorised support for reading and 
writing avro data. The APIs should be designed in such a way as to work for the 
various different container formats for avro encoded data, including [single 
object 
encoding](https://avro.apache.org/docs/1.11.1/specification/#single-object-encoding),
 [object container 
files](https://avro.apache.org/docs/1.11.1/specification/#object-container-files)
 and 
[message](https://avro.apache.org/docs/1.11.1/specification/#protocol-wire-format)
 even if first-class support is not provided for all these framing mechanisms.
   
   **Describe alternatives you've considered**
   <!--
   A clear and concise description of any alternative solutions or features 
you've considered.
   -->
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   
   DataFusion has some avro support, however, it is based on the row-based 
apache_avro crate and is therefore likely extremely sub-optimal.
   
   FYI @Samrose-Ahmed @sarutak @devinjdangelo I intend to work on this, but any 
help with reviews / testing would be most welcome
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to