[ 
https://issues.apache.org/jira/browse/HDDS-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16837712#comment-16837712
 ] 

Ajay Kumar commented on HDDS-1506:
----------------------------------

[~elek] thanks for filing this. Have you thought of workaround for this. I 
think it is specific to container environment, may be we can do special 
handling just for docker/k8.

> Ozone Manager can't be started with existing reverse dns
> --------------------------------------------------------
>
>                 Key: HDDS-1506
>                 URL: https://issues.apache.org/jira/browse/HDDS-1506
>             Project: Hadoop Distributed Data Store
>          Issue Type: Sub-task
>    Affects Versions: 0.4.0
>            Reporter: Elek, Marton
>            Assignee: Elek, Marton
>            Priority: Major
>
> I tried to start current ozone in kubernetes and ozone manager initialization 
> is failed with the following error:
> {code}
> 2019-05-09 08:40:23 INFO  OzoneManager:51 - registered UNIX signal handlers 
> for [TERM, HUP, INT]
> 2019-05-09 08:40:23 WARN  ScmUtils:63 - ozone.om.db.dirs is not configured. 
> We recommend adding this setting. Falling back to ozone.metadata.dirs instead.
> 2019-05-09 08:40:24 INFO  OzoneManager:1029 - Initializing secure 
> OzoneManager.
> 2019-05-09 08:40:24 ERROR OMCertificateClient:209 - Default certificate 
> serial id is not set. Can't locate the default certificate for this client.
> 2019-05-09 08:40:24 INFO  OMCertificateClient:588 - Certificate client init 
> case: 0
> 2019-05-09 08:40:24 INFO  OMCertificateClient:55 - Creating keypair for 
> client as keypair and certificate not found.
> 2019-05-09 08:40:24 INFO  OzoneManager:1035 - Init response: GETCERT
> 2019-05-09 08:40:24 INFO  OzoneSecurityUtil:103 - Adding 
> ip:192.168.11.208,host:om-0.om.default.svc.cluster.local
> 2019-05-09 08:40:24 INFO  OzoneSecurityUtil:107 - ip:127.0.0.1,host:localhost 
> not returned.
> 2019-05-09 08:40:24 ERROR OzoneManager:1421 - Incorrect om rpc address. 
> omRpcAdd:om-0.om:9862
> 2019-05-09 08:40:24 ERROR OzoneManager:888 - Failed to start the OzoneManager.
> java.lang.RuntimeException: Can't get SCM signed certificate. omRpcAdd: 
> om-0.om:9862
>       at 
> org.apache.hadoop.ozone.om.OzoneManager.getSCMSignedCert(OzoneManager.java:1422)
>       at 
> org.apache.hadoop.ozone.om.OzoneManager.initializeSecurity(OzoneManager.java:1041)
>       at org.apache.hadoop.ozone.om.OzoneManager.omInit(OzoneManager.java:994)
>       at 
> org.apache.hadoop.ozone.om.OzoneManager.createOm(OzoneManager.java:951)
>       at org.apache.hadoop.ozone.om.OzoneManager.main(OzoneManager.java:882)
> 2019-05-09 08:40:24 INFO  ExitUtil:210 - Exiting with status 1: 
> java.lang.RuntimeException: Can't get SCM signed certificate. omRpcAdd: 
> om-0.om:9862
> 2019-05-09 08:40:24 INFO  OzoneManager:51 - SHUTDOWN_MSG: 
> {code}
> The root of the problem is this method in OzoneManager:
> {code}
>  private static void getSCMSignedCert(CertificateClient client,
>       OzoneConfiguration config, OMStorage omStore) throws IOException {
>    ...
>     omRpcAdd = OmUtils.getOmAddress(config);
>     if (omRpcAdd == null || omRpcAdd.getAddress() == null) {
>       LOG.error("Incorrect om rpc add-ress. omRpcAdd:{}", omRpcAdd);
>       throw new RuntimeException("Can't get SCM signed certificate. " +
>           "omRpcAdd: " + omRpcAdd);
>     }
> {code}
> In My case omRpcAdd.getAddress() seems to be empty at the time of 
> initialization as the reverse dns entry will be available only when om is 
> started. Which is a classic chicken/egg problem: We need reverse dns for 
> initialization but reverse dns entry is added when the container is started. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to