[ 
https://issues.apache.org/jira/browse/AMBARI-12657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Hurley updated AMBARI-12657:
-------------------------------------
    Attachment: AMBARI-12657.patch

> Cluster creates fail on larger deployments with SQL Azure DB
> ------------------------------------------------------------
>
>                 Key: AMBARI-12657
>                 URL: https://issues.apache.org/jira/browse/AMBARI-12657
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-server
>    Affects Versions: 2.0.0
>            Reporter: Jonathan Hurley
>            Assignee: Jonathan Hurley
>            Priority: Critical
>             Fix For: 2.1.1
>
>         Attachments: AMBARI-12657.patch
>
>
> We started doing larger cluster creates (48 workernodes) with SQL Azure DB as 
> an Ambari DB, and we are seeing below HTTP GET requests timeout on the client 
> side (even after retries), resulting in cluster create failures (15%). This 
> is a tracking Jira to resolve the CRUD failures.
> What I’m seeing is that DB CPU usage goes above 50% in some of my experiments 
> for 48 node clusters. This might explain why SQL is running slow.
> end_time            avg_cpu_percent            avg_data_io_percent    
> avg_log_write_percent                avg_memory_usage_percent
> 2015-08-05 18:51:24.153                40.89     0.00        0.62        0.67
> 2015-08-05 18:51:09.107                41.86     0.00        1.49        0.67
> 2015-08-05 18:50:54.090                24.36     0.00        0.08        0.67
> 2015-08-05 18:50:38.763                43.16     0.00        0.57        0.67
> 2015-08-05 18:50:23.700                65.03     0.00        0.51        0.67
> 2015-08-05 18:50:07.840                28.57     0.00        0.45        0.67
> 2015-08-05 18:49:49.480                39.78     0.00        0.42        0.67
> 2015-08-05 18:49:34.383                28.14     0.00        0.43        0.67
> Most expensive queries in terms of CPU time are below. 
> Basically, it’s this one query which consumes most of the CPU. Query plan is 
> also attached.
> {code}
> SELECT DISTINCT t0.request_id FROM host_role_command t0 WHERE NOT EXISTS 
> (SELECT @P0 FROM host_role_command t1 WHERE (t1.status IN 
> (@P1,@P2,@P3,@P4,@P5,@P6,@P7,@P8,@P9)))  ORDER BY t0.request_id ASC
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to