tschaub commented on issue #37807: URL: https://github.com/apache/arrow/issues/37807#issuecomment-1747611641
Looks useful, @chelseajonesr. My only real current use case has been to create Parquet data for tests. I've written a [`test.ParquetFromJSON()` function](https://github.com/planetlabs/gpq/blob/v0.19.0/internal/test/test.go#L24-L54) for this purpose. Maybe also specific to my use case, but this relies on incrementally building up a schema based on a configurable number of input (JSON) rows - to allow for cases where `nulls` may be present in early rows and the appropriate field type isn't known until reading more data. So I have an [Arrow schema builder](https://github.com/planetlabs/gpq/blob/v0.19.0/internal/pqutil/arrow.go) for this. This does't yet cover all the types you might encounter with an arbitrary struct - I'm just adding support for the cases I need to handle. So while I think it could be useful to have something in this library to generate Arrow data from a slice of structs (to compliment the current `parquet.NewSchemaFromStruct()` function), I just wanted to say that I don't have an urgent need for this now. I'll close this unless someone else thinks it is a worthwhile issue to keep open. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
