milenkovicm commented on issue #1681:
URL: 
https://github.com/apache/datafusion-ballista/issues/1681#issuecomment-4424731139

   I've managed to reproduce it with SF1 
   
   reverted `datafusion.optimizer.prefer_hash_join` back to `true`
   
   ```
   .set_bool("datafusion.optimizer.prefer_hash_join", true)
   ```
   
   ```
   cargo run --bin tpch -- benchmark ballista -p 
/Users/marko/TMP/tpch_data/tpch-data-sf1/ -f parquet -i 1 --port 50050 --host 
127.0.0.1 -c datafusion.execution.target_partitions=14 -q 2
   ```
   
   failed
   ```
   called `Result::unwrap()` on an `Err` value: 
Plan("ArrowError(ExternalError(Execution(\"Job fCM1di0 failed: Job failed due 
to stage 11 failed: Task failed due to runtime execution error: 
DataFusionError(Internal(\\\"Assertion failed: self.mode != 
PartitionMode::CollectLeft || left_partitions == 1: Invalid HashJoinExec, the 
output partition count of the left child must be 1 in CollectLeft mode,consider 
using CoalescePartitionsExec or the EnforceDistribution rule\\\"))\\n\")), 
Some(\"\"))")
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to