haperski opened a new pull request, #34365: URL: https://github.com/apache/beam/pull/34365
Fix MongoDB connector to handle out-of-range dates ------------------------ ### Changes Made: - Modified the `mongodbio.py` file to include CodecOptions and DatetimeConversion. - Specifically, added `CodecOptions(datetime_conversion=DatetimeConversion.DATETIME_CLAMP)` to ensure that out-of-range dates are converted properly when reading data from MongoDB. - Updated the `_get_head_document_id` and `read` methods to use the new codec options when connecting to the MongoDB database. ### Results After Changes With the introduced changes, all instances of out-of-range dates can now be read and later transformed. (https://pymongo.readthedocs.io/en/stable/examples/datetimes.html#handling-out-of-range-datetimes) ### Discussion This issue has been discussed on the Apache Beam mailing list, with recommendation from XQ Hu to address this issue with PR. - [x] Fixes #30471 - [ ] Update `CHANGES.md` with noteworthy changes. - [ ] If this contribution is large, please file an Apache [Individual Contributor License Agreement](https://www.apache.org/licenses/icla.pdf). See the [Contributor Guide](https://beam.apache.org/contribute) for more tips on [how to make the review process smoother](https://github.com/apache/beam/blob/master/CONTRIBUTING.md#make-the-reviewers-job-easier). To check the build health, please visit [Build Status](https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md). GitHub Actions Tests Status (on master branch) ------------------------------------------------------------------------------------------------ [](https://github.com/apache/beam/actions?query=workflow%3A%22Build+python+source+distribution+and+wheels%22+branch%3Amaster+event%3Aschedule) [](https://github.com/apache/beam/actions?query=workflow%3A%22Python+Tests%22+branch%3Amaster+event%3Aschedule) [](https://github.com/apache/beam/actions?query=workflow%3A%22Java+Tests%22+branch%3Amaster+event%3Aschedule) [](https://github.com/apache/beam/actions?query=workflow%3A%22Go+tests%22+branch%3Amaster+event%3Aschedule) See [CI.md](https://github.com/apache/beam/blob/master/CI.md) for more information about GitHub Actions CI or the [workflows README](https://github.com/apache/beam/blob/master/.github/workflows/README.md) to see a list of phrases to trigger workflows. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@beam.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org