Hi: I'm developing rust version of reader which reads parquet into arrow array. To verify the correct of this reader, I use the following approach:
1. Define schema with protobuf. 2. Generate json data of this schema using other language with more sophisticated implementation (e.g. java) 3. Generate parquet data of this schema using other language with more sophisticated implementation (e.g. java) 4. Write tests to read json file, and parquet file into memory (arrow array), then compare json data with arrow data. I think with this method we can guarantee the correctness of arrow reader because json format is ubiquitous and their implementation are more stable. Any comment is appreciated.