pmcgleenon commented on issue #9404: URL: https://github.com/apache/arrow-datafusion/issues/9404#issuecomment-1984569235
I have a basic question about the time being reported for the queries. If you take a look at the shell script https://github.com/ClickHouse/ClickBench/blob/main/datafusion/run.sh#L34 the variable RES holds the result, it's taking the text from the 7th column in the 2nd line ``` RES=`datafusion-cli -f $CREATE_SQL_FILE /tmp/query.sql 2>&1 | grep "Query took" | sed -n 2p | awk '{print $7}'` ``` If I run the cli there are two query times being reported (assuming one for creating the table and one for the query as per the note in the `run.sh`. ``` /arrow-datafusion/datafusion-cli/target/release/datafusion-cli -f create_partitioned.sql /tmp/query.sql 2>&1 DataFusion CLI v36.0.0 0 rows in set. Query took 0.034 seconds. +---------------------+-----------+ | m | pageviews | +---------------------+-----------+ | 2013-07-15T12:40:00 | 513 | | 2013-07-15T12:41:00 | 457 | | 2013-07-15T12:42:00 | 470 | | 2013-07-15T12:43:00 | 468 | | 2013-07-15T12:44:00 | 453 | | 2013-07-15T12:45:00 | 462 | | 2013-07-15T12:46:00 | 481 | | 2013-07-15T12:47:00 | 458 | | 2013-07-15T12:48:00 | 466 | | 2013-07-15T12:49:00 | 467 | +---------------------+-----------+ 10 rows in set. Query took 0.154 seconds. ``` I just wanted to double-check that using the time from the 2nd line of the output for the result is correct? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
