rizaon commented on PR #2:
URL: https://github.com/apache/parquet-benchmark/pull/2#issuecomment-2375078220

   Apache Impala has some unique synthetic parquet files that we use for unit 
tests.
   
   Parquet files with deep nesting, used by 
[TestMaxNestingDepth](https://github.com/apache/impala/blob/da9400d63ce62e37ea531e7fe6564bf4ac2e0e45/tests/query_test/test_nested_types.py#L899)
   
https://github.com/apache/impala/blob/master/testdata/max_nesting_depth/README
   
   
[widetable_1000_cols](https://github.com/apache/impala/blob/da9400d63ce62e37ea531e7fe6564bf4ac2e0e45/testdata/datasets/functional/functional_schema_template.sql#L2451)
 table with 1000 columns parquet file.
   
   
[lineitem_sixblocks.parquet](https://github.com/apache/impala/blob/master/testdata/LineItemMultiBlock/lineitem_sixblocks.parquet)
 with 400 row groups.
   
https://github.com/apache/impala/blob/master/testdata/LineItemMultiBlock/README.dox
   
   Are they interesting to take in?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to