That sounds like a great suggestion to me.

On Wed, Mar 5, 2025 at 12:41 PM Andrew Lamb <andrewlam...@gmail.com> wrote:

> I would like to request before the VARIANT spec changes are finalized that
> we have example data in parquet-testing.
>
> This topic came up (well, I brought it up) on the sync call today.
>
> In my opinion, having example files would reduce the overhead of new
> implementations dramatically. At least there should be example of
> * variant columns (no shredding)
> * variant columns with shredding
>
> Some description of what those files contained ("expected contents"). For
> prior art, here is what Dewey did for the geometry type[1][2].
>
> When looking for prior discussions, I found a great quote from Gang Wu[3]
> on this topic:
>
> >  I'd say that a lesson learned is that we should publish example files
> for any
> > new feature to the parquet-testing [1] repo for interoperability tests.
>
> Thank you for your consideration,
> Andrew
>
>
>
>
> [1] https://github.com/apache/parquet-testing/pull/70
> [2] https://github.com/geoarrow/geoarrow-data
> [3]: https://lists.apache.org/thread/71d7p9lprhf514jnt5dgnw4wfmn8ykzt
>

Reply via email to