paleolimbot opened a new pull request, #454: URL: https://github.com/apache/arrow-nanoarrow/pull/454
Assembling columns from chunked things is rather difficult to do and is a valid thing that somebody might want to assemble from Arrow data. This PR adds some helpers to assemble concatenated buffers that somebody could pass to numpy (or some other library). Still working on the dispatch of supported types and null handling... ```python import nanoarrow as na import pyarrow as pa import pandas as pd from nanoarrow import visitor url = "https://github.com/apache/arrow-experiments/raw/main/data/arrow-commits/arrow-commits.arrows" array = na.ArrayStream.from_url(url).read_all() {k: type(v) for k, v, in visitor.to_columns(array).items()} #> {'commit': list, 'time': list, 'files': list, 'merge': list, 'message': list} ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
