zhztheplayer opened a new pull request, #10554:
URL: https://github.com/apache/incubator-gluten/pull/10554

   Add a new option`--data-source`:
   
   ```bash
         --data-source=<dataSource>
                                Datasource used to generate data and to create
                                  tables
                                  Default: parquet
   ```
   
   Default value is `parquet`.
   
   The data generated will be in folder whose name has indicator of the data 
source type:
   
   ```
   root@hongze-zhang-VMware20-1:/tmp# ls 
/tmp/tpcds-generated-1.0-parquet-partitioned
   call_center   catalog_returns  customer          customer_demographics  
household_demographics  inventory  promotion  ship_mode  store_returns  
time_dim   web_page     web_sales
   catalog_page  catalog_sales    customer_address  date_dim               
income_band             item       reason     store      store_sales    
warehouse  web_returns  web_site
   ```
   
   The feature would help with the support of lake formats in gluten-it, in 
later PRs. For example, we should ideally have `--data-source=delta` supported 
eventually. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to