alkis commented on PR #2:
URL: https://github.com/apache/parquet-benchmark/pull/2#issuecomment-2367232388

   If we are going to take synthetic footers we should put them in 
`footer/synthetic` or some other directory. When one runs benchmarks against 
them it should be clear these are not realistic and shouldn't be "weighted" the 
same way as real ones.
   
   Plus how are these generated? Do they Statistics, custom key-val data? From 
experience it is these auxiliary structures that mostly contribute to size and 
thus parsing speed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to