That sounds like a great suggestion to me. On Wed, Mar 5, 2025 at 12:41 PM Andrew Lamb <andrewlam...@gmail.com> wrote:
> I would like to request before the VARIANT spec changes are finalized that > we have example data in parquet-testing. > > This topic came up (well, I brought it up) on the sync call today. > > In my opinion, having example files would reduce the overhead of new > implementations dramatically. At least there should be example of > * variant columns (no shredding) > * variant columns with shredding > > Some description of what those files contained ("expected contents"). For > prior art, here is what Dewey did for the geometry type[1][2]. > > When looking for prior discussions, I found a great quote from Gang Wu[3] > on this topic: > > > I'd say that a lesson learned is that we should publish example files > for any > > new feature to the parquet-testing [1] repo for interoperability tests. > > Thank you for your consideration, > Andrew > > > > > [1] https://github.com/apache/parquet-testing/pull/70 > [2] https://github.com/geoarrow/geoarrow-data > [3]: https://lists.apache.org/thread/71d7p9lprhf514jnt5dgnw4wfmn8ykzt >