[ 
https://issues.apache.org/jira/browse/ASTERIXDB-2945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397170#comment-17397170
 ] 

Ingo Müller commented on ASTERIXDB-2945:
----------------------------------------

I have tried patch set 3 and it seems to solve my problem fully: in the 96-vCPU 
instance with 48 I/O devices, I see about 4800% CPU utilization during query 
execution, and the queries run only marginally longer than if run against local 
files.

> Ensure S3 connection pool is large enough
> -----------------------------------------
>
>                 Key: ASTERIXDB-2945
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-2945
>             Project: Apache AsterixDB
>          Issue Type: Bug
>          Components: EXT - External data
>    Affects Versions: 0.9.7
>            Reporter: Wail Y. Alkowaileet
>            Assignee: Wail Y. Alkowaileet
>            Priority: Major
>             Fix For: 0.9.7
>
>
> This bug has been found by [~ingomueller.net]. 
> S3Client has a default value for the connection pool (15 concurrent 
> connections). When the number of partitions is greater than 15 (say 96),  
> threads will wait for a certain time for an available connection. If no 
> connection were made available, a Timeout exception will be thrown. Even if a 
> thread can wait, the default size of the connection pool could hinder the 
> query performance.
> To fix that, we need to make sure that the connection pool size equals to at 
> least the number of partitions in the cluster.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to