Omega359 opened a new issue, #21625:
URL: https://github.com/apache/datafusion/issues/21625

   ### Describe the bug
   
   On my machine
   ```
   OS: Fedora Linux 43 (KDE Plasma Desktop Edition) x86_64
   Kernel: Linux 6.19.11-200.fc43.x86_64
   CPU: AMD Ryzen AI 9 HX 370 (24) @ 5.16 GHz
   GPU: AMD Radeon 890M Graphics [Integrated]
   Memory: 11.96 GiB / 86.02 GiB (14%)
   Swap: 0 B / 8.00 GiB (0%)
   ```
   with updated rust:
   ```
   $ rustup show
   Default host: x86_64-unknown-linux-gnu
   rustup home:  /home/bruce/.rustup
   
   installed toolchains
   --------------------
   stable-x86_64-unknown-linux-gnu (default)
   nightly-x86_64-unknown-linux-gnu
   1.91.0-x86_64-unknown-linux-gnu
   1.92.0-x86_64-unknown-linux-gnu
   1.93.0-x86_64-unknown-linux-gnu
   1.94.0-x86_64-unknown-linux-gnu (active)
   
   active toolchain
   ----------------
   name: 1.94.0-x86_64-unknown-linux-gnu
   active because: overridden by 
'/home/bruce/dev/datafusion2/rust-toolchain.toml'
   installed targets:
     x86_64-unknown-linux-gnu
   ```
   
   Running against main:
   `cd benchmarks;./bench.sh data tpch;./bench.sh run tpch 18` will hang
   ```
   $ ./bench.sh run tpch 18
   ***************************
   DataFusion Benchmark Script
   COMMAND: run
   BENCHMARK: tpch
   QUERY: 18
   DATAFUSION_DIR: /home/bruce/dev/datafusion2/benchmarks/..
   BRANCH_NAME: HEAD
   DATA_DIR: /home/bruce/dev/datafusion2/benchmarks/data
   RESULTS_DIR: /home/bruce/dev/datafusion2/benchmarks/results/HEAD
   CARGO_COMMAND: cargo run --release
   PREFER_HASH_JOIN: true
   SIMULATE_LATENCY: false
   ***************************
   RESULTS_FILE: 
/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json
   Running tpch benchmark...
   + cargo run --release --bin dfbench -- tpch --iterations 5 --path 
/home/bruce/dev/datafusion2/benchmarks/data/tpch_sf1 --prefer_hash_join true 
--format parquet -o 
/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json --query 18
       Finished `release` profile [optimized] target(s) in 0.11s
        Running `/home/bruce/dev/datafusion2/target/release/dfbench tpch 
--iterations 5 --path /home/bruce/dev/datafusion2/benchmarks/data/tpch_sf1 
--prefer_hash_join true --format parquet -o 
/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json --query 18`
   Running benchmarks with the following options: RunOpt { query: Some(18), 
common: CommonOpt { iterations: 5, partitions: None, batch_size: None, 
mem_pool_type: "fair", memory_limit: None, sort_spill_reservation_bytes: None, 
debug: false, simulate_latency: false }, path: 
"/home/bruce/dev/datafusion2/benchmarks/data/tpch_sf1", file_format: "parquet", 
mem_table: false, output_path: 
Some("/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json"), 
disable_statistics: false, prefer_hash_join: true, enable_piecewise_merge_join: 
false, sorted: false, hash_join_buffering_capacity: 0 }
   ```
   
   git bisect points to [this 
commit](https://github.com/apache/datafusion/commit/6c5e241e6298e70077259b3a12840c3adab3c810)
 as the cause. Running the test at the commit just prior to that one succeeds. 
Running it at that commit fails.
   
   If prefer_hash_join is disabled the query will run as expected:
   ```
   PREFER_HASH_JOIN=false ./bench.sh run tpch 18
   ***************************
   DataFusion Benchmark Script
   COMMAND: run
   BENCHMARK: tpch
   QUERY: 18
   DATAFUSION_DIR: /home/bruce/dev/datafusion2/benchmarks/..
   BRANCH_NAME: HEAD
   DATA_DIR: /home/bruce/dev/datafusion2/benchmarks/data
   RESULTS_DIR: /home/bruce/dev/datafusion2/benchmarks/results/HEAD
   CARGO_COMMAND: cargo run --release
   PREFER_HASH_JOIN: false
   SIMULATE_LATENCY: false
   ***************************
   RESULTS_FILE: 
/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json
   Running tpch benchmark...
   + cargo run --release --bin dfbench -- tpch --iterations 5 --path 
/home/bruce/dev/datafusion2/benchmarks/data/tpch_sf1 --prefer_hash_join false 
--format parquet -o 
/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json --query 18
       Finished `release` profile [optimized] target(s) in 0.15s
        Running `/home/bruce/dev/datafusion2/target/release/dfbench tpch 
--iterations 5 --path /home/bruce/dev/datafusion2/benchmarks/data/tpch_sf1 
--prefer_hash_join false --format parquet -o 
/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json --query 18`
   Running benchmarks with the following options: RunOpt { query: Some(18), 
common: CommonOpt { iterations: 5, partitions: None, batch_size: None, 
mem_pool_type: "fair", memory_limit: None, sort_spill_reservation_bytes: None, 
debug: false, simulate_latency: false }, path: 
"/home/bruce/dev/datafusion2/benchmarks/data/tpch_sf1", file_format: "parquet", 
mem_table: false, output_path: 
Some("/home/bruce/dev/datafusion2/benchmarks/results/HEAD/tpch_sf1.json"), 
disable_statistics: false, prefer_hash_join: false, 
enable_piecewise_merge_join: false, sorted: false, 
hash_join_buffering_capacity: 0 }
   Query 18 iteration 0 took 206.5 ms and returned 57 rows
   Query 18 iteration 1 took 190.0 ms and returned 57 rows
   Query 18 iteration 2 took 188.5 ms and returned 57 rows
   Query 18 iteration 3 took 185.0 ms and returned 57 rows
   Query 18 iteration 4 took 192.1 ms and returned 57 rows
   Query 18 avg time: 192.42 ms
   + set +x
   Done
   ```
   
   ### To Reproduce
   
   This seems to be machine/OS specific. I've been unable to reproduce on other 
machines.
   
   ### Expected behavior
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to