[
https://issues.apache.org/jira/browse/ORC-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun updated ORC-1189:
-------------------------------
Description:
* Since 5/12, NYC Taxi dataset used in benchmarks no longer exists as CSV's;
has been replaced with Parquet
https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
bq. On 05/13/2022, we are making the following changes to trip record files:
All files will be stored in the Parquet format. Please see the ‘Working With
Parquet Format’ under the Data Dictionaries and MetaData section.
* Running any benchmark fails with "java.util.ServiceConfigurationError"
because one benchmark cannot be instantiated
* Some documentation could be more helpful, e.g. generate command calling
itself "convert" in help page
was:
* NYC Taxi dataset used in benchmarks no longer exists as CSV's; has been
replaced with Parquet
* Running any benchmark fails with "java.util.ServiceConfigurationError"
because one benchmark cannot be instantiated
* Some documentation could be more helpful, e.g. generate command calling
itself "convert" in help page
> Benchmark Taxi Dataset, Stability, Documentation Issues
> -------------------------------------------------------
>
> Key: ORC-1189
> URL: https://issues.apache.org/jira/browse/ORC-1189
> Project: ORC
> Issue Type: Bug
> Reporter: Martin Loncaric
> Priority: Minor
>
> * Since 5/12, NYC Taxi dataset used in benchmarks no longer exists as CSV's;
> has been replaced with Parquet
> https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
> bq. On 05/13/2022, we are making the following changes to trip record files:
> All files will be stored in the Parquet format. Please see the ‘Working With
> Parquet Format’ under the Data Dictionaries and MetaData section.
> * Running any benchmark fails with "java.util.ServiceConfigurationError"
> because one benchmark cannot be instantiated
> * Some documentation could be more helpful, e.g. generate command calling
> itself "convert" in help page
--
This message was sent by Atlassian Jira
(v8.20.7#820007)