comphead commented on PR #18985: URL: https://github.com/apache/datafusion/pull/18985#issuecomment-3607977466
> When I ran the command locally > > ```shell > ./benchmarks/bench.sh run tpcds > ``` > > I got a bunch of errors like > > > Warning registering call_center: Table file does not exist: /Users/andrewlamb/Software/datafusion/benchmarks/data/call_center.parquet > > Details > @comphead what would you think about updating the `bench.sh data tpcds` command so it automatically downloaded the using `wget`? > > For example something like > > ```shell > mkdir -p benchmarks/data/tpcds_sf1 > wget https://github.com/apache/datafusion-benchmarks/raw/refs/heads/main/tpcds/data/sf1/call_center.parquet -O benchmarks/data/tpcds_sf1/call_center.parquet > ``` > > ? I provided instructions to clone the repo. I was also confused if the data file is not found then schema cannot be inferred and query parsing failed. WDYT if I throw an error if no input data exists, pointing them to repo clone? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
