I would like to start a discussion to help organize and rally anyone interested in adding new encodings to Parquet.
I am pretty sure there are many people interested in adding new encodings, but there are only a few mentions on the mailing list, such as pcode [1] and FSST/ALP/FastLanes [2]. Prateek mentioned on the sync call today that he is working on evaluating some potential encodings and hopes to have some information to share soon, and Julien mentioned he had spoken to someone else who might be doing something similar. Now that Julien has defined a process to extend the spec[3] I think the steps are much clearer. So, I would like to invite anyone interested in adding new encodings to respond and let us know if you are willing to help evaluate new encodings and prototype integrations into Parquet implementations? Andrew [1]: https://lists.apache.org/thread/bdmfcj4g6y1ccd3mfgrp7d43d73s6zf6 [2]: https://lists.apache.org/thread/s3o9jk0hr942pv6ono4ymnvvj6pfdsdw [3]: https://github.com/apache/parquet-format/blob/master/proposals/README.md
