Kev1n8 commented on PR #6181:
URL: https://github.com/apache/arrow-rs/pull/6181#issuecomment-2265786509

   I think there might be a bug when the length of `StringView` or `BinaryView` 
is greater than 12 if my tests are correct. The tests I added gave the wrong 
minimum and the wrong maximum which is not in its column.
   
   ```
   ---- statistics::test_utf8_view stdout ----
   thread 'statistics::test_utf8_view' panicked at 
parquet/./tests/arrow_reader/statistics.rs:268:13:
   assertion `left == right` failed: utf8_view: Mismatch with expected data 
page minimums
     left: StringViewArray
   [
     "a",
     "a",
     "h_longerthan12",
   ]
    right: StringViewArray
   [
     "a",
     "a",
     "e_longerthan12",
   ]
   ```
   
   And bypassing the min check, got wrong maximum too:
   ```
   assertion `left == right` failed: utf8_view: Mismatch with expected data 
page maximum
     left: StringViewArray
   [
     "d",
     "g_longerthan12",
     "i_longerthan12",
   ]
    right: StringViewArray
   [
     "d",
     "e_longerthan12",
     "i_longerthan12",
   ]
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to