Pranav Wadekar created FLINK-37273:
--------------------------------------
Summary: Dynamodb Streams getRecords call adaptive delay
Key: FLINK-37273
URL: https://issues.apache.org/jira/browse/FLINK-37273
Project: Flink
Issue Type: Improvement
Components: Connectors / DynamoDB
Affects Versions: aws-connector-5.0.0
Reporter: Pranav Wadekar
Fix For: 2.0.0
I am trying to read the data from DynamoDBStreams which is introduced in Flink
v18.0. It is working fine for large tables (10K RPM), and the efficiency is
also good.
However, the dynamodb stream cost for {{getRecords}} AWS requests. Although
it's not a problem for big tables, for small tables even if the records are not
committed Flink is continuously polling the records.
With the above behaviour, we are getting an unnecessary cost of 130$ per day.
Ideally, we need to scan the stream at specific intervals with some backoff
time.
I did not find anything regarding this.
Can we please implement the adaptive delay if there is any idleness in incoming
records? I can also contribute to the code.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)