korowa commented on issue #7931:
URL: 
https://github.com/apache/arrow-datafusion/issues/7931#issuecomment-1925841430

   I suppose this issue may be closed -- the reproducer based on data files 
from the description
   
   <details>
     <summary>Reproducer code</summary>
   
     ```
   use datafusion::error::Result;
   use datafusion::prelude::*;
   
   #[tokio::main]
   async fn main() -> Result<()> {
       let ctx = SessionContext::new();
   
       ctx.register_parquet(
           "sea",
           "/home/eduard/Downloads/sea",
           ParquetReadOptions::default(),
       )
       .await?;
   
       ctx.register_parquet(
           "air",
           "/home/eduard/Downloads/air",
           ParquetReadOptions::default(),
       )
       .await?;
   
       let sql = r##"
       SELECT count(1) FROM air INNER JOIN sea ON sea.station = air.station;
       "##;
   
       let df = ctx.sql(sql).await?;
       df.show().await?;
   
       Ok(())
   }
   ```
   
   </details>
   
   is able to complete with current main 
version(https://github.com/apache/arrow-datafusion/commit/840499fbd3c25bf82a6ee0641ef6888079e096da),
 while running it with 35.0.0 / 34.0.0 ends up with OOM


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to