adamreeve opened a new issue, #37453:
URL: https://github.com/apache/arrow/issues/37453

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   When running our benchmarks for 
[ParquetSharp](https://github.com/G-Research/ParquetSharp) against the new 
Arrow 13.0.0 release, one of the benchmarks was twice as slow compared to Arrow 
12.0.1. Digging into this further, I found that the change that caused this 
slow down was #35230, and it appears that the reason for the slowdown is this 
particular benchmark makes a lot of calls to `WriteBatch`, and there is now a 
lot of extra time spent looking up 
`properties_->page_index_enabled(descr_->path())` in 
`pages_change_on_record_boundaries()`.
   
   It looks to me like the column properties shouldn't change one a 
`TypedColumnWriter` is constructed, so it should be fine to compute this value 
once on construction to fix this performance regression.
   
   ### Component(s)
   
   Parquet


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to