I would like to request before the VARIANT spec changes are finalized that
we have example data in parquet-testing.

This topic came up (well, I brought it up) on the sync call today.

In my opinion, having example files would reduce the overhead of new
implementations dramatically. At least there should be example of
* variant columns (no shredding)
* variant columns with shredding

Some description of what those files contained ("expected contents"). For
prior art, here is what Dewey did for the geometry type[1][2].

When looking for prior discussions, I found a great quote from Gang Wu[3]
on this topic:

>  I'd say that a lesson learned is that we should publish example files
for any
> new feature to the parquet-testing [1] repo for interoperability tests.

Thank you for your consideration,
Andrew




[1] https://github.com/apache/parquet-testing/pull/70
[2] https://github.com/geoarrow/geoarrow-data
[3]: https://lists.apache.org/thread/71d7p9lprhf514jnt5dgnw4wfmn8ykzt

Reply via email to