The Apache ORC community has created a separate orc-format repo [1] to hold format specs. It can help us decouple the versions of format and implementation.
IMO, it is now a good time to discuss the next step to evolve the ORC format. To give my two cents, following items are what we can do: - Follow up with the ORC Format v2 proposal [2] - Parquet feature parity [3] - Lance feature parity [4] Considering the activity in the community, I'd like to hear different opinions before taking any action. Any suggestions are welcome. [1] https://github.com/apache/orc-format [2] https://orc.apache.org/specification/ORCv2 [3] https://github.com/apache/parquet-format [4] https://lancedb.github.io/lance/format.html Best, Gang