[PR] feat(python): Add visitor pattern + builders for column sequences [arrow-nanoarrow]

via GitHub Fri, 03 May 2024 20:15:20 -0700


paleolimbot opened a new pull request, #454:
URL: https://github.com/apache/arrow-nanoarrow/pull/454


   Assembling columns from chunked things is rather difficult to do and is a 
valid thing that somebody might want to assemble from Arrow data. This PR adds 
some helpers to assemble concatenated buffers that somebody could pass to numpy 
(or some other library).
   
   Still working on the dispatch of supported types and null handling...
   
   
   ```python
   import nanoarrow as na
   import pyarrow as pa
   import pandas as pd
   from nanoarrow import visitor
   
   url = 
"https://github.com/apache/arrow-experiments/raw/main/data/arrow-commits/arrow-commits.arrows";
   array = na.ArrayStream.from_url(url).read_all()
   
   {k: type(v) for k, v, in visitor.to_columns(array).items()}
   #> {'commit': list, 'time': list, 'files': list, 'merge': list, 'message': 
list}
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[PR] feat(python): Add visitor pattern + builders for column sequences [arrow-nanoarrow]

Reply via email to