danielhumanmod commented on PR #1380:
URL:
https://github.com/apache/datafusion-ballista/pull/1380#issuecomment-3758575639
But one wired thing is the performance seems not improve based on benchmark
test result, I used the `Running the Ballista Benchmarks` setting
After
```
st --port 60000 --query 10 --path ./data --format tbl --iterations 3`
Running benchmarks with the following options: BallistaBenchmarkOpt { query:
10, debug: false, expected_results: None, iterations: 3, batch_size: 8192,
path: "./data", file_format: "tbl", partitions: 2, host: Some("localhost"),
port: Some(60000), output_path: None }
Running benchmark with queries 10:
["select\n c_custkey,\n c_name,\n sum(l_extendedprice * (1 -
l_discount)) as revenue,\n c_acctbal,\n n_name,\n c_address,\n
c_phone,\n c_comment\nfrom\n customer,\n orders,\n lineitem,\n
nation\nwhere\n c_custkey = o_custkey\n and l_orderkey = o_orderkey\n
and o_orderdate >= date '1993-10-01'\n and o_orderdate < date '1994-01-01'\n
and l_returnflag = 'R'\n and c_nationkey = n_nationkey\ngroup by\n
c_custkey,\n c_name,\n c_acctbal,\n c_phone,\n n_name,\n
c_address,\n c_comment\norder by\n revenue desc\nlimit 20"]
Query 10 iteration 0 took 2533.0 ms and returned 20 rows
Query 10 iteration 1 took 1853.6 ms and returned 20 rows
Query 10 iteration 2 took 1958.3 ms and returned 20 rows
Query 10 avg time: 2114.96 ms
```
Before
```
st --port 60000 --query 10 --path ./data --format tbl --iterations 3`
Running benchmarks with the following options: BallistaBenchmarkOpt { query:
10, debug: false, expected_results: None, iterations: 3, batch_size: 8192,
path: "./data", file_format: "tbl", partitions: 2, host: Some("localhost"),
port: Some(60000), output_path: None }
Running benchmark with queries 10:
["select\n c_custkey,\n c_name,\n sum(l_extendedprice * (1 -
l_discount)) as revenue,\n c_acctbal,\n n_name,\n c_address,\n
c_phone,\n c_comment\nfrom\n customer,\n orders,\n lineitem,\n
nation\nwhere\n c_custkey = o_custkey\n and l_orderkey = o_orderkey\n
and o_orderdate >= date '1993-10-01'\n and o_orderdate < date '1994-01-01'\n
and l_returnflag = 'R'\n and c_nationkey = n_nationkey\ngroup by\n
c_custkey,\n c_name,\n c_acctbal,\n c_phone,\n n_name,\n
c_address,\n c_comment\norder by\n revenue desc\nlimit 20"]
Query 10 iteration 0 took 2739.5 ms and returned 20 rows
Query 10 iteration 1 took 1855.9 ms and returned 20 rows
Query 10 iteration 2 took 1850.1 ms and returned 20 rows
Query 10 avg time: 2148.49 ms
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]