milenkovicm commented on issue #1681:
URL:
https://github.com/apache/datafusion-ballista/issues/1681#issuecomment-4424731139
I've managed to reproduce it with SF1
reverted `datafusion.optimizer.prefer_hash_join` back to `true`
```
.set_bool("datafusion.optimizer.prefer_hash_join", true)
```
```
cargo run --bin tpch -- benchmark ballista -p
/Users/marko/TMP/tpch_data/tpch-data-sf1/ -f parquet -i 1 --port 50050 --host
127.0.0.1 -c datafusion.execution.target_partitions=14 -q 2
```
failed
```
called `Result::unwrap()` on an `Err` value:
Plan("ArrowError(ExternalError(Execution(\"Job fCM1di0 failed: Job failed due
to stage 11 failed: Task failed due to runtime execution error:
DataFusionError(Internal(\\\"Assertion failed: self.mode !=
PartitionMode::CollectLeft || left_partitions == 1: Invalid HashJoinExec, the
output partition count of the left child must be 1 in CollectLeft mode,consider
using CoalescePartitionsExec or the EnforceDistribution rule\\\"))\\n\")),
Some(\"\"))")
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]