[ 
https://issues.apache.org/jira/browse/BEAM-11815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexey Romanenko updated BEAM-11815:
------------------------------------
    Description: 
In processElement:

{code:java}
private static class ReadFn<T> extends DoFn<Read<T>, T> {
  @ProcessElement
  public void processElement(@Element Read<T> spec, OutputReceiver<T> out) {
    AmazonDynamoDB client = spec.getAwsClientsProvider().createDynamoDB();
    ScanRequest scanRequest = spec.getScanRequestFn().apply(null);
    scanRequest.setSegment(spec.getSegmentId());
    ScanResult scanResult = client.scan(scanRequest);
    out.output(spec.getScanResultMapperFn().apply(scanResult));
  }
}
{code}
 
 

if total requested items > 1M dynamodb will only send upto 1M of data  and 
expects client to read remaining items through LastEvaluatedKey (see 
[this|https://docs.aws.amazon.com/AWSJavaSDK/latest/javadoc/com/amazonaws/services/dynamodbv2/model/ScanRequest.html#setLimit-java.lang.Integer-])

 

  was:
In processElement:

 

 
{code:java}
private static class ReadFn<T> extends DoFn<Read<T>, T> {
 @ProcessElement
 public void processElement(@Element Read<T> spec, OutputReceiver<T> out) {
 AmazonDynamoDB client = spec.getAwsClientsProvider().createDynamoDB();
 ScanRequest scanRequest = spec.getScanRequestFn().apply(null);
 scanRequest.setSegment(spec.getSegmentId());
 ScanResult scanResult = client.scan(scanRequest);
 out.output(spec.getScanResultMapperFn().apply(scanResult));
 }
 }{code}
 

 

if total requested items > 1M dynamodb will only send upto 1M of data  and 
expects client to read remaining items through LastEvaluatedKey (see 
[this|https://docs.aws.amazon.com/AWSJavaSDK/latest/javadoc/com/amazonaws/services/dynamodbv2/model/ScanRequest.html#setLimit-java.lang.Integer-])

 


> fail to read more than 1M of items
> ----------------------------------
>
>                 Key: BEAM-11815
>                 URL: https://issues.apache.org/jira/browse/BEAM-11815
>             Project: Beam
>          Issue Type: Bug
>          Components: io-java-aws
>            Reporter: Mohammad
>            Priority: P1
>
> In processElement:
> {code:java}
> private static class ReadFn<T> extends DoFn<Read<T>, T> {
>   @ProcessElement
>   public void processElement(@Element Read<T> spec, OutputReceiver<T> out) {
>     AmazonDynamoDB client = spec.getAwsClientsProvider().createDynamoDB();
>     ScanRequest scanRequest = spec.getScanRequestFn().apply(null);
>     scanRequest.setSegment(spec.getSegmentId());
>     ScanResult scanResult = client.scan(scanRequest);
>     out.output(spec.getScanResultMapperFn().apply(scanResult));
>   }
> }
> {code}
>  
>  
> if total requested items > 1M dynamodb will only send upto 1M of data  and 
> expects client to read remaining items through LastEvaluatedKey (see 
> [this|https://docs.aws.amazon.com/AWSJavaSDK/latest/javadoc/com/amazonaws/services/dynamodbv2/model/ScanRequest.html#setLimit-java.lang.Integer-])
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to