alkis commented on PR #2: URL: https://github.com/apache/parquet-benchmark/pull/2#issuecomment-2367232388
If we are going to take synthetic footers we should put them in `footer/synthetic` or some other directory. When one runs benchmarks against them it should be clear these are not realistic and shouldn't be "weighted" the same way as real ones. Plus how are these generated? Do they Statistics, custom key-val data? From experience it is these auxiliary structures that mostly contribute to size and thus parsing speed. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org