[
https://issues.apache.org/jira/browse/BEAM-9960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Corvin Deboeser updated BEAM-9960:
----------------------------------
Description:
When using MongoDBIO on a large collection and the source bundle size was
determined to be 1, then the response from the split vector command can be
larger than 16mb which is not supported by pymongo / MongoDB:
{{pymongo.errors.ProtocolError: Message length (33699186) is larger than server
max message size (33554432)}}
Environment: Was running this on Google Dataflow / Beam Python SDK 2.20.
was:
When using MongoDBIO on a large collection and the source bundle size was
determined to be 1, then the response from the split vector command can be
larger than 16mb which is not supported by pymongo / MongoDB:
{{pymongo.errors.ProtocolError: Message length (33699186) is larger than server
max message size (33554432)}}
> Python MongoDBIO fails when response of split vector command is larger than
> 16mb
> --------------------------------------------------------------------------------
>
> Key: BEAM-9960
> URL: https://issues.apache.org/jira/browse/BEAM-9960
> Project: Beam
> Issue Type: Bug
> Components: sdk-py-core
> Affects Versions: 2.20.0
> Reporter: Corvin Deboeser
> Priority: Major
>
> When using MongoDBIO on a large collection and the source bundle size was
> determined to be 1, then the response from the split vector command can be
> larger than 16mb which is not supported by pymongo / MongoDB:
> {{pymongo.errors.ProtocolError: Message length (33699186) is larger than
> server max message size (33554432)}}
>
> Environment: Was running this on Google Dataflow / Beam Python SDK 2.20.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)