[
https://issues.apache.org/jira/browse/PARQUET-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gabor Szadovszky updated PARQUET-1644:
--------------------------------------
Summary: Clean up some benchmark code and docs. (was: Benchmark module
needs some attention.)
> Clean up some benchmark code and docs.
> --------------------------------------
>
> Key: PARQUET-1644
> URL: https://issues.apache.org/jira/browse/PARQUET-1644
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Reporter: Ryan Skraba
> Assignee: Ryan Skraba
> Priority: Minor
> Labels: pull-request-available
>
> Strictly following the instructions on the
> [parquet-benchmarks|https://github.com/apache/parquet-mr/tree/fcc5d1a5a669570de3daeafd3f3b7788aa618536/parquet-benchmarks]
> module doesn't give meaningful results.
> It appears some new benchmarks enter into conflict with the globs specified
> for others, not all benchmarks are run, and some iterations of write
> benchmarks aren't evalulated due to unexpected "file already exists ..."
> fail-fast returns in the data generator.
> This should be cleaned up to encourage using and implementing benchmarks on
> Parquet code.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)