ahmedabu98 commented on issue #33497:
URL: https://github.com/apache/beam/issues/33497#issuecomment-2580131125

   But while investigating, I found a bigger problem. Some records contain 
wrong/duplicate values for the partitioned column. For example, the above query 
gives me this result:
   <img width="469" alt="image" 
src="https://github.com/user-attachments/assets/50ab5c0b-e50b-4456-9846-e7ab089301f5";
 />
   When actually I would only expect one result: `{long=811, str=value_811}`
   
   **Another case:**
   I checked the warehouse and the `str=value_811` partition has only one 
datafile that contains only one record: `{long=811, str=value_811}`, as 
expected.
   The following query gives me no results:
   ```sql
   SELECT * FROM `apache-beam-testing.ahmedabualsaud.my_table_with_spec` where 
str like "value_812"
   ```
   But when I manually read the datafile under partition `str=value_812`, I 
find the correct record: `{long=812, str=value_812}`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to