rizaon commented on PR #2: URL: https://github.com/apache/parquet-benchmark/pull/2#issuecomment-2375078220
Apache Impala has some unique synthetic parquet files that we use for unit tests. Parquet files with deep nesting, used by [TestMaxNestingDepth](https://github.com/apache/impala/blob/da9400d63ce62e37ea531e7fe6564bf4ac2e0e45/tests/query_test/test_nested_types.py#L899) https://github.com/apache/impala/blob/master/testdata/max_nesting_depth/README [widetable_1000_cols](https://github.com/apache/impala/blob/da9400d63ce62e37ea531e7fe6564bf4ac2e0e45/testdata/datasets/functional/functional_schema_template.sql#L2451) table with 1000 columns parquet file. [lineitem_sixblocks.parquet](https://github.com/apache/impala/blob/master/testdata/LineItemMultiBlock/lineitem_sixblocks.parquet) with 400 row groups. https://github.com/apache/impala/blob/master/testdata/LineItemMultiBlock/README.dox Are they interesting to take in? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org