[
https://issues.apache.org/jira/browse/AMBARI-5800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14003543#comment-14003543
]
Sudhir Prakash commented on AMBARI-5800:
----------------------------------------
I don't have the log for the 55s startup, but that was during the initial
startup immediately after installation. Here is a log where it took 29 secs:
{code}
2014-05-16 13:51:20,850 INFO [main]: server.HiveServer2
(HiveStringUtils.java:startupShutdownMessage(605)) - STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting HiveServer2
STARTUP_MSG: host = hadoopvm1-1/39.0.8.1
STARTUP_MSG: args = [-hiveconf, hive.metastore.uris=", ", -hiveconf,
hive.log.file=hiveserver2.log, -hiveconf,
hive.log.dir=/var/opt/teradata/log/hive]
STARTUP_MSG: version = 0.13.0.2.1.2.0-402
STARTUP_MSG: classpath = /etc/hadoop/conf:/usr/lib/hadoop/lib....<truncated>
STARTUP_MSG: build =
git://s111-s8/var/opt/teradata/workspace/BIGTOP-HDP_RPM_REPO-baikal-GA-suse11/bigtop/build/hive/rpm/BUILD/hive-0.13.0.2.1.2.0
-r c19175be5345f7d4bc7ca0a8f95a09a4330de2f4; compiled by 'jenkins' on Sun Apr
27 18:52:34 EDT 2014
************************************************************/
2014-05-16 13:51:48,369 INFO [main]: hive.metastore
(HiveMetaStoreClient.java:open(297)) - Trying to connect to metastore with URI
thrift://byn001-1:9083
2014-05-16 13:51:48,436 INFO [main]: hive.metastore
(HiveMetaStoreClient.java:open(385)) - Connected to metastore.
2014-05-16 13:51:49,298 INFO [main]: session.SessionState
(SessionState.java:start(361)) - No Tez session required at this point.
hive.execution.engine=mr.
2014-05-16 13:51:49,307 INFO [main]: service.CompositeService
(SessionManager.java:init(70)) - HiveServer2: Async execution thread pool size:
100
2014-05-16 13:51:49,307 INFO [main]: service.CompositeService
(SessionManager.java:init(72)) - HiveServer2: Async execution wait queue size:
100
2014-05-16 13:51:49,308 INFO [main]: service.CompositeService
(SessionManager.java:init(74)) - HiveServer2: Async execution thread keepalive
time: 10
2014-05-16 13:51:49,309 INFO [main]: service.AbstractService
(AbstractService.java:init(89)) - Service:OperationManager is inited.
2014-05-16 13:51:49,309 INFO [main]: service.AbstractService
(AbstractService.java:init(89)) - Service:SessionManager is inited.
2014-05-16 13:51:49,309 INFO [main]: service.AbstractService
(AbstractService.java:init(89)) - Service:CLIService is inited.
2014-05-16 13:51:49,309 INFO [main]: service.AbstractService
(AbstractService.java:init(89)) - Service:ThriftBinaryCLIService is inited.
2014-05-16 13:51:49,310 INFO [main]: service.AbstractService
(AbstractService.java:init(89)) - Service:HiveServer2 is inited.
2014-05-16 13:51:49,310 INFO [main]: service.AbstractService
(AbstractService.java:start(104)) - Service:OperationManager is started.
2014-05-16 13:51:49,310 INFO [main]: service.AbstractService
(AbstractService.java:start(104)) - Service:SessionManager is started.
2014-05-16 13:51:49,310 INFO [main]: service.AbstractService
(AbstractService.java:start(104)) - Service:CLIService is started.
2014-05-16 13:51:49,472 INFO [main]: hive.metastore
(HiveMetaStoreClient.java:open(297)) - Trying to connect to metastore with URI
thrift://byn001-1:9083
2014-05-16 13:51:49,476 INFO [main]: hive.metastore
(HiveMetaStoreClient.java:open(385)) - Connected to metastore.
2014-05-16 13:51:49,525 INFO [main]: service.AbstractService
(AbstractService.java:start(104)) - Service:ThriftBinaryCLIService is started.
2014-05-16 13:51:49,526 INFO [main]: service.AbstractService
(AbstractService.java:start(104)) - Service:HiveServer2 is started.
2014-05-16 13:51:49,701 INFO [Thread-6]: thrift.ThriftCLIService
(ThriftBinaryCLIService.java:run(88)) - ThriftBinaryCLIService listening on
0.0.0.0/0.0.0.0:10000
{code}
> Race condition when starting all services causing Hive service check to fail
> ----------------------------------------------------------------------------
>
> Key: AMBARI-5800
> URL: https://issues.apache.org/jira/browse/AMBARI-5800
> Project: Ambari
> Issue Type: Bug
> Affects Versions: 1.6.0
> Environment: SLES11
> ambari-server-1.6.0-39
> hive-0.13.0.2.1.2.0-402
> Reporter: Sudhir Prakash
> Priority: Critical
>
> # I performed an install on a 7 node cluster
> # During the install, I noticed that the Hive service check failed with the
> error: {{Test connectivity to hive server Connection to byn001-1 on port
> 10000 failed: [Errno 111] Connection refused}}
> # I proceeded through the rest of the install wizard
> # Stop All
> # Start All and noticed the same error again
> I retried stop all/start all this time monitoring the Ambari start progess,
> the Hive Server2 logs, and a netstat of port 10000. What I noticed is that
> immediately after the start Hive is issued, the service check is run and
> fails. However, it takes about 55 seconds for HiveServer2 to actually start
> and claim port 10000.
> The start up sequence needs to be modified to wait for Hive to finish
> starting before running the service check.
> This issue is easily reproducible and has been seen by multiple people there.
--
This message was sent by Atlassian JIRA
(v6.2#6252)