datapythonista commented on issue #1613:
URL: https://github.com/apache/arrow-rs/issues/1613#issuecomment-1146809855
I'm planning to work on this. What I'd personally do, is to have many small
examples of increasing complexity. So, besides examples and recipes, it can be
used as a tutorial, to learn Arrow topics step by step.
If people are happy with this, I'll start working on PRs for the next:
- Creating arrays for primitive types
- With the array constructor (e.g. `Int32Array::from(vec![...])`)
- With a builder (using `append_value` and `append_null)
- With `collect()`
- Creating arrays with null values. I'm unsure about this one, if the above
are simple enough, probably we can have this in the above examples. But worth
having this here for consideration for now
- Creating arrays of more complex types (e.g. `Dictionary`, `Struct`...)
- Creating `Schema`
- Creating `RecordBatch`
- Reading from different formats
- Parquet
- CSV
- JSON
- Writing to different formats (same)
- Data manipulation and kernels. Will expand on this when the rest are
done, for now just couple of simple examples to have something.
Not sure how feasible it is, but would be amazing if we could render those
examples (which will have documentation explaining what's going on) direct to
the Arrow cookbook. I think it's a bit tricky, but doable. And I think it's
better than having to maintain two different cookbooks/examples, or just having
them in one place.
Feedback on any of this very welcome.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]