Elek, Marton created HDDS-1397:
----------------------------------

             Summary: Avoid the usage of signal handlers in datanodes of the 
MiniOzoneClusters
                 Key: HDDS-1397
                 URL: https://issues.apache.org/jira/browse/HDDS-1397
             Project: Hadoop Distributed Data Store
          Issue Type: Bug
            Reporter: Elek, Marton
            Assignee: Elek, Marton


[~arpaga] showed me a problem that TestQueryNode.testHealthyNodesCount is 
failed in the CI check of HDDS-1339.

According to the logs the test is timed out because only 4 datanodes are 
started out of the 5.

The log also contained an exception from one datanode:

{code}
2019-04-04 00:26:33,583 WARN  ozone.HddsDatanodeService 
(LogAdapter.java:warn(59)) - failed to register any UNIX signal loggers: 
java.lang.IllegalStateException: Can't re-install the signal handlers.
    at org.apache.hadoop.util.SignalLogger.register(SignalLogger.java:77)
    at 
org.apache.hadoop.util.StringUtils.startupShutdownMessage(StringUtils.java:718)
    at 
org.apache.hadoop.util.StringUtils.startupShutdownMessage(StringUtils.java:707)
    at 
org.apache.hadoop.ozone.HddsDatanodeService.createHddsDatanodeService(HddsDatanodeService.java:126)
    at 
org.apache.hadoop.ozone.HddsDatanodeService.createHddsDatanodeService(HddsDatanodeService.java:108)
    at 
org.apache.hadoop.ozone.MiniOzoneClusterImpl$Builder.createHddsDatanodes(MiniOzoneClusterImpl.java:552)
{code}

The code which requires the signal handler is the following (signal handler is 
registered in the startupShutdownMessage)

{code}
  /**
   * Create an Datanode instance based on the supplied command-line arguments.
   * <p>
   * This method is intended for unit tests only. It suppresses the
   * startup/shutdown message and skips registering Unix signal handlers.
   *
   * @param args        command line arguments.
   * @param conf        HDDS configuration
   * @param printBanner if true, then log a verbose startup message.
   * @return Datanode instance
   */
  private static HddsDatanodeService createHddsDatanodeService(
      String[] args, Configuration conf, boolean printBanner) {
    if (args.length == 0 && printBanner) {
      StringUtils
          .startupShutdownMessage(HddsDatanodeService.class, args, LOG);
      return new HddsDatanodeService(conf);
    } else {
      new HddsDatanodeService().run(args);
      return null;
   }
{code}

As you can read from the comment it's expected to be called with 
printBanner=false to avoid the creation of the signal handler. 

Note: In the startupShutdownMessage method a new signal handler is registered 
and signal handlers can be registered only once:

{code}
//SignalLogger
  void register(final LogAdapter LOG) {
   if (registered) {
      throw new IllegalStateException("Can't re-install the signal handlers.");
    }
 ....
{code}

We have a dedicated method to create datanode service for the unit tests. The 
only thing what we need is to turn OFF the signal handler registration here. 
(The following code fragment shows the original state where the signal handler 
creation is requested with the true parameter value)

{code}
  @VisibleForTesting
  public static HddsDatanodeService createHddsDatanodeService(
      String[] args, Configuration conf) {
    return createHddsDatanodeService(args, conf, true);
  }
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to