martin-g opened a new issue, #1681:
URL: https://github.com/apache/datafusion-ballista/issues/1681

   **Describe the bug**
   
   TPC-H Q2 seems to have a problem on latest `main` (May 11 2026).
   It was working fine last week.
   
   **To Reproduce**
   
   ```
   datafusion-ballista/benchmarks on  1642-executor-metrics [$!?⇡] is 📦 
v53.0.0 via 🐍 v3.12.13 (ballista) via 🦀 v1.95.0
   ❯ python3 tpch.py --query q2 --path tpch-data
   Registering table part at path tpch-data/part.parquet
   Registering table supplier at path tpch-data/supplier.parquet
   Registering table partsupp at path tpch-data/partsupp.parquet
   Registering table customer at path tpch-data/customer.parquet
   Registering table orders at path tpch-data/orders.parquet
   Registering table lineitem at path tpch-data/lineitem.parquet
   Registering table nation at path tpch-data/nation.parquet
   Registering table region at path tpch-data/region.parquet
   Job k2Jf6V6 failed: Job failed due to stage 11 failed: Task failed due to 
runtime execution error: DataFusionError(Internal("Assertion failed: self.mode 
!= PartitionMode::CollectLeft || left_partitions == 1: Invalid HashJoinExec, 
the output partition count of the left child must be 1 in CollectLeft 
mode,consider using CoalescePartitionsExec or the EnforceDistribution rule"))
   
   Traceback (most recent call last):
     File 
"/Users/mgrigorov/git/apache/datafusion/datafusion-ballista/benchmarks/tpch.py",
 line 55, in <module>
       df.show()
     File 
"/Users/mgrigorov/git/apache/datafusion/datafusion-ballista/python/python/ballista/extension.py",
 line 64, in method_wrapper
       return getattr(df, func)(*argz, **kwargs)
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
   Exception: DataFusion error: Arrow error: External error: Execution error: 
Job k2Jf6V6 failed: Job failed due to stage 11 failed: Task failed due to 
runtime execution error: DataFusionError(Internal("Assertion failed: self.mode 
!= PartitionMode::CollectLeft || left_partitions == 1: Invalid HashJoinExec, 
the output partition count of the left child must be 1 in CollectLeft 
mode,consider using CoalescePartitionsExec or the EnforceDistribution rule"))
   ```
   
   **Expected behavior**
   A clear and concise description of what you expected to happen.
   
   **Additional context**
   Add any other context about the problem here.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to