zhztheplayer opened a new pull request, #10554:
URL: https://github.com/apache/incubator-gluten/pull/10554
Add a new option`--data-source`:
```bash
--data-source=<dataSource>
Datasource used to generate data and to create
tables
Default: parquet
```
Default value is `parquet`.
The data generated will be in folder whose name has indicator of the data
source type:
```
root@hongze-zhang-VMware20-1:/tmp# ls
/tmp/tpcds-generated-1.0-parquet-partitioned
call_center catalog_returns customer customer_demographics
household_demographics inventory promotion ship_mode store_returns
time_dim web_page web_sales
catalog_page catalog_sales customer_address date_dim
income_band item reason store store_sales
warehouse web_returns web_site
```
The feature would help with the support of lake formats in gluten-it, in
later PRs. For example, we should ideally have `--data-source=delta` supported
eventually.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]