ankitsultana opened a new issue, #14838:
URL: https://github.com/apache/pinot/issues/14838

   We are seeing segments running into ERROR state due to a race condition 
between Segment Refresh message and Offline to Online transition. This 
ultimately leads to `Inconsistent data read. Index data file xyz/columns.psf is 
possibly corrupted`.
   
   I haven't taken a deeper look into this, but I can share that the following 
events happened around the same time (in no particular order):
   
   - There was a node replacement.
   - There was a segment compaction job that processes this segment around the 
same time.
   
   ```
   {"@timestamp":"2025-01-18T06:33:47.092+00:00","message":"Creating new 
inverted index for segment: some_table__13__2492__20250114T2318Z, column: 
lorem_ipsum","logger_name":"org.apache.pinot.segment.local.segment.index.loader.invertedindex.InvertedIndexHandler","thread_name":"HelixTaskExecutor-message_handle_thread_33","level":"INFO"}
   {"@timestamp":"2025-01-18T06:33:47.098+00:00","message":"Need to create new 
inverted index for segment: some_table__13__2492__20250114T2318Z, column: 
foobar","logger_name":"org.apache.pinot.segment.local.segment.index.loader.invertedindex.InvertedIndexHandler","thread_name":"HelixTaskExecutor-message_handle_thread_43","level":"INFO"}
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to