DanielLeens commented on issue #11008: URL: https://github.com/apache/seatunnel/issues/11008#issuecomment-4627269736
I checked the current MongoDB CDC path, and this does look like a real connector bug direction rather than just a sink-side configuration problem. The key point is that the MongoDB CDC deserialization path currently reads `operationType` directly from the change-event `Struct` without first checking whether that field is actually present. In the current code this happens in both the MongoDB CDC deserialization path and the output-buffer rewrite path. If a record shape reaches SeaTunnel without `operationType`, you get exactly the `DataException` shown in this issue. So the thread is worth keeping open as a bug. To narrow down which MongoDB record shape is missing that field, we still need three details: 1. the MongoDB server version and deployment mode (standalone / replica set / sharded) 2. the full stack trace from the first failing record, including the lines immediately before the `DataException` 3. whether this happens during the initial snapshot stage, or only after the first change-stream event arrives Also, because Apache SeaTunnel is an international community, please keep the follow-up details in English if possible so more contributors can help review the case. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
