judahrand opened a new issue, #4847:
URL: https://github.com/apache/arrow-rs/issues/4847

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   
   I would like to be able to access the data for an encoded column from each 
row.
   
   I've been looking at the `arrow-row` format for use in a machine learning 
'Feature Store' application. I'm trying to find the most efficient way to move 
between Arrow format and a key-value store which stores values on a 'cell' 
basis. I am aware that the `arrow-row` format is not designed for this sort of 
use-case and is in fact not ideal since it fiddles with some of the data to 
make sure it lexicographically sorts nicely and is also not guaranteed to have 
stable encodings. Having said all of this it struck me as the fastest way to 
begin to investigate this idea.
   
   Having said that I thought that I'd open an issue to see if this 
functionality might have a wider use case and if a PR to introduce it might be 
accepted?
   
   **Describe the solution you'd like**
   <!--
   A clear and concise description of what you want to happen.
   -->
   
   **Describe alternatives you've considered**
   <!--
   A clear and concise description of any alternative solutions or features 
you've considered.
   -->
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to