[helix] branch master updated: add tutorial for Helix cloud support (#1172)

jxue Thu, 30 Jul 2020 13:45:44 -0700

This is an automated email from the ASF dual-hosted git repository.

jxue pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/helix.git



The following commit(s) were added to refs/heads/master by this push:
     new f50f9c1  add tutorial for Helix cloud support (#1172)
f50f9c1 is described below

commit f50f9c1d56a2a00ad15454cff4fdf71168a466db
Author: Meng Zhang <[email protected]>
AuthorDate: Thu Jul 30 13:44:28 2020 -0700

    add tutorial for Helix cloud support (#1172)
    
    Add tutorial for how to use cloud support in Helix for participant auto 
registration.
---
 .../src/site/markdown/tutorial_cloud_support.md    | 228 +++++++++++++++++++++
 .../src/site/markdown/tutorial_customized_view.md  |   8 +-
 .../images/ParticipantAutoRegistrationLogic.png    | Bin 0 -> 29541 bytes
 3 files changed, 232 insertions(+), 4 deletions(-)

diff --git a/website/1.0.1/src/site/markdown/tutorial_cloud_support.md 
b/website/1.0.1/src/site/markdown/tutorial_cloud_support.md
new file mode 100644
index 0000000..3f63179
--- /dev/null
+++ b/website/1.0.1/src/site/markdown/tutorial_cloud_support.md
@@ -0,0 +1,228 @@
+<!---
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+<head>
+  <title>Tutorial - Cloud Support</title>
+</head>
+
+## [Helix Tutorial](./Tutorial.html): Cloud Support
+
+There are emerging cases to use Helix in a cloud environment, especially in 
those well-known public cloud, e.g. Azure, AWS, GCP, etc. Compared to previous 
on premise use cases, Helix has faced both challenges and opportunities in 
providing cloud related support. 
+
+As a first step, Helix implemented the support for participant auto 
registration in a cloud environment, which leverages the common feature of 
public cloud and facilitates Helix users when they need to create a cluster and 
register the participants to the cluster.
+
+After a Helix cluster is created, there are two ways to add instances (or 
participants, we will use them interchangeably in this tutorial) to the 
cluster. One is manual add, where users call their own scripts to add instance 
config to the cluster for each participant; and the other is auto join, where 
users set the auto join config of the cluster to be true, and each participant 
populates its own hostname, port number, and other information in instance 
config during connection. However,  [...]
+        
+In a cloud environment, there is a good opportunity to achieve full automation 
as public cloud providers give domain information to each individual instance 
through a metadata endpoint. All of the above mentioned public cloud providers 
use a unique non-routable IP address (169.254.169.254) for this purpose. It can 
be accessed only from within the instance to retrieve the instance metadata 
information, which you can use to configure or manage the running instance. 
+
+More specifically, in AWS, Azure, and GCP, the query to the fixed IP address  
http://169.254.169.254/ inside each instance will return its metadata 
information that contains domain information. In AWS, the field is named as 
"placement"; in Azure, the field is named as "PlatformUpdateDomain"; and in 
GCP, the field is named as "zone". It is usually just an integer denoting which 
fault domain the instance belongs to. This particular feature of public cloud 
is leveraged by Helix in cluster c [...]
+
+### What Helix Provides for Cloud Environment
+* Provide definition of Helix cloud configs at cluster level and participant 
level
+* Provide enhanced REST and Java APIs for cluster creation and cloud config 
update
+* Provide generic interface for fetching and parsing cloud instance information
+* Provide the implementation of Azure cloud instance information processor
+* Provide the implementation of participant auto registration logic
+
+### How to Use Cloud Support
+First of all, you need to make sure that your environment is indeed a cloud 
environment. Otherwise, all the assumptions in this support will not hold. Then 
depending on what kind of cloud environment you are in, you will need to 
perform some or all of the following steps.
+  
+#### Define Cloud config at Cluster Level
+Helix provides cloud configs at two different levels, one is at cluster level, 
and the other is at participant level. We describe them separately.
+
+At cluster level, we have a new Znode called CloudConfig. It has a few fields 
that store the relatively static cloud information for the whole cluster. 
Similar to other existing configs, Helix provides cloud config builder, 
validation, get/set functions. As the following table shows, the first two 
fields are required, and the last three fields are optional. 
+
+`CLOUD_ENABLED` must be set to true if the user would like to use Helix cloud 
support. `CLOUD_PROVIDER` is the type of the cloud environment. Besides the few 
well-known public cloud providers, the user can also define his/her cloud 
environment as "CUSTOMIZED". `CLOUD_ID` is an optional field. It can be used to 
record any specific metadata information the user would like to record, e.g. 
the ID for the particular cluster inside a cloud environment, etc. If the user 
chooses to use the provi [...]
+          
+| Field | Meaning |
+| ------------------------- | ------ |
+| CLOUD_ENABLED | determine whether the cluster is inside cloud environment 
and use Helix cloud support |
+| CLOUD_PROVIDER | denote what kind of cloud environment the cluster is in, 
e.g. Azure, AWS, GCP, or CUSTOMIZED |
+| CLOUD_ID | the specific id in cloud environment that belongs to this cluster 
|
+| CLOUD_INFO_SOURCE | the source for retrieving the cloud information. |
+| CLOUD_INFO_PROCESSOR_NAME | the name of the function that processes the 
fetching and parsing of cloud information |
+
+Users could use either REST API or Java API to set these configs.
+
+##### REST API Examples
+Helix enhanced current cluster creation REST API as well as Java API with 
extra fields that represent cloud related input. For example, in the following 
modified `createCluster` API, `addCloudConfig` is a Boolean value denotes 
whether to create with cloud config, and the `cloudConfigManifest` is the cloud 
config string, which will be converted to a Znode record.
+
+```
+@PUT
+  @Path("{clusterId}")
+  public Response createCluster(@PathParam("clusterId") String clusterId, 
@DefaultValue("false") @QueryParam("recreate") String recreate,
+      @DefaultValue("false") @QueryParam("addCloudConfig") String 
addCloudConfig, String cloudConfigManifest)
+```
+
+Besides the enhanced cluster creation API, Helix also provides a set of cloud 
specific APIs in Java and REST that handles the get/add/update of cloud config. 
+   
+* Create cluster and add meanwhile cloud config related information to ZK.
+
+```
+$ curl -X PUT -H "Content-Type: application/json" 
http://localhost:1234/admin/v2/clusters/myCluster?addCloudConfig=true -d '
+{
+    "simpleFields" : 
+    {
+        "CLOUD_ENABLED" : "true",
+        "CLOUD_PROVIDER": "AWS",
+        "CLOUD_ID" : "12345"
+        "CLOUD_INFO_SOURCE": {"http://169.254.169.254/"}
+        "CLOUD_INFO_PROCESSOR_NAME": "AWSCloudInformationProcesser"
+    }
+}'
+```
+
+* Add cloud config to an existing cluster
+
+```
+$ curl -X PUT -H "Content-Type: application/json" 
http://localhost:1234/admin/v2/clusters/myCluster/cloudconfig -d '
+{
+    "simpleFields" : 
+    {
+        "CLOUD_ENABLED" : "true",
+        "CLOUD_PROVIDER": "AWS",
+        "CLOUD_ID" : "12345"
+        "CLOUD_INFO_SOURCE": {"http://169.254.169.254/"}
+        "CLOUD_INFO_PROCESSOR_NAME": "AWSCloudInformationProcesser"
+    }
+}'
+```
+
+* Delete the cloud config of a cluster
+
+```
+$ curl -X DELETE http://localhost:1234/admin/v2/clusters/myCluster/cloudconfig
+```
+
+* Get the cloud config of a cluster
+
+```
+$ curl -X GET http://localhost:1234/admin/v2/clusters/myCluster/cloudconfig
+```
+
+* Update the cloud config of a cluster
+
+```
+$ curl -X POST -H "Content-Type: application/json"  
http://localhost:1234/admin/v2/clusters/myCluster/cloudconfig?command=update -d 
'
+{
+    "simpleFields" : {
+        "CLOUD_ID" : "12345"
+    }
+}'
+```
+
+* Delete some of the fields in the cloud config 
+
+```
+$ curl -X POST -H "Content-Type: application/json" 
http://localhost:1234/admin/v2/clusters/myCluster/cloudconfig?command=delete -d 
'
+{
+    "simpleFields" : {
+        "CLOUD_ID" : "12345"
+    }
+}'
+```
+
+#### Define Cloud Configs at Participant Level
+At participant level, Helix allows users to provide detailed cloud properties, 
which is more related to the participant’s actions. For default value, Helix 
provides [Azure cloud 
properties](https://github.com/apache/helix/blob/master/helix-core/src/main/resources/azure-cloud.properties),
 including not only the default cluster level config values for Azure, like 
`CLOUD_INFO_SOURCE` and `CLOUD_INFO_PROCESSOR_NAME`, but also some participant 
specific values, such as http timeout when queryi [...]
+
+```
+cloud_info_source=http://169.254.169.254/metadata/instance?api-version=2019-06-04
+cloud_info_processor_name=AzureCloudInstanceInformationProcessor
+cloud_max_retry=5
+connection_timeout_ms=5000
+request_timeout_ms=5000
+```
+   
+If users would like to use their customized config for participants, they can 
input the specific properties through [Helix Manager 
Property](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/HelixManagerProperty.java)
 when composing [Zk Helix 
Manager](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/manager/zk/ZKHelixManager.java),
 which is passed into participants. [Helix Manager 
Property](https://github.com/apache [...]
+    
+```
+public ZKHelixManager(String clusterName, String instanceName, InstanceType 
instanceType,
+    String zkAddress, HelixManagerStateListener stateListener, 
HelixManagerProperty helixManagerProperty)
+```
+
+#### Implement Cloud Instance Information (if necessary)
+Helix has predefined a few fields in the interface of 
[CloudInstanceInformation](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/api/cloud/CloudInstanceInformation.java)
 that are commonly used in participant. Users are free to define other fields 
in the instance metadata that they think are useful. 
+  
+```
+public interface CloudInstanceInformation {
+  /**
+   * Get the the value of a specific cloud instance field by name
+   * @return the value of the field
+   */
+  String get(String key);
+
+  /**
+   * The enum contains all the required cloud instance field in Helix
+   */
+  enum CloudInstanceField {
+    INSTANCE_NAME,
+    FAULT_DOMAIN,
+    INSTANCE_SET_NAME
+  }
+}
+```
+
+#### Implement Cloud Instance Information Processor (if necessary)
+Helix provides an interface to fetch and parse cloud instance information as 
follows. Helix makes the interface generic enough so that users may implement 
their own fetching and parsing logic with maximum freedom. 
+
+```
+/**
+ * Generic interface to fetch and parse cloud instance information
+ */
+public interface CloudInstanceInformationProcessor<T extends Object> {
+
+  /**
+   * Get the raw cloud instance information
+   * @return raw cloud instance information
+   */
+  List<T> fetchCloudInstanceInformation();
+
+  /**
+   * Parse the raw cloud instance information in responses and compose 
required cloud instance information
+   * @return required cloud instance information
+   */
+  CloudInstanceInformation parseCloudInstanceInformation(List<T> responses);
+}
+```
+
+Helix also implements an example processor for Azure in 
[AzureCloudInstanceInformationProcessor](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/cloud/azure/AzureCloudInstanceInformationProcessor.java).
 The implementation includes fetching and parsing Azure cloud instance 
information. The fetching function retrieves instance information from Azure 
AIMS, and the parsing function validates the response, and selects the fields 
needed for participant aut [...]
+
+If users need to use Helix in another cloud environment, they can implement 
their own information processor. The implementation would be similar to Azure 
implementation, and the main difference would be in how to parse the response 
and retrieve interested fields. 
+   
+#### Config Properly for Participant Auto Registration to Work
+To make the participant auto registration work, users would need to make sure 
their cluster config is set properly. The most important one is the 
`allowParticipantAutoJoin` field in cluster config.
+
+```
+{
+    "id": "clusterName",
+    "listFields": {},
+    "mapFields": {},
+    "simpleFields": {
+        ......
+        "allowParticipantAutoJoin": "true"
+        ......
+    }
+}
+```
+
+This field is used in participant manager logic as a prerequisite for 
participants to do auto registration. The detailed logic is shown in the 
following flow chart. The related code is in [Participant 
Manager](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/manager/zk/ParticipantManager.java).
+
+![Participant Auto Registration 
Logic](./images/ParticipantAutoRegistrationLogic.png)
+
+If the participant decides that it should do auto registration based on the 
config, it will first query cloud config and decide what environment it is in. 
Based on this information, the participant will call the corresponding cloud 
instance information processor. Then with all the information, especially the 
domain information, the participant can auto register to the cluster without 
any manual effort.
diff --git a/website/1.0.1/src/site/markdown/tutorial_customized_view.md 
b/website/1.0.1/src/site/markdown/tutorial_customized_view.md
index 6ff9b11..1dad740 100644
--- a/website/1.0.1/src/site/markdown/tutorial_customized_view.md
+++ b/website/1.0.1/src/site/markdown/tutorial_customized_view.md
@@ -33,7 +33,7 @@ The following figure shows the high level architecture of 
customized view aggreg
 ### Terminologies
 * Customized state: A per partition state defined by users in a string format. 
Customized state exists under each participant. It may include different types 
of states. Each type of state is represented as a Znode itself and has 
different resources as its child Znode.
 * Customized state config: A cluster level config specifically used for 
customized state related config. For example, it can include a list of 
customized states that should be aggregated. 
-* Customized view: An aggregation result for customized states across all 
participants. It exists under the cluster and can also have a few different 
types of states depending on users' input. Each type of state is represented as 
a Znode itself and has different resources as it child Znode.
+* Customized view: An aggregation result for customized states across all 
participants. It exists under the cluster and can also have a few different 
types of states depending on users' input. Each type of state is represented as 
a Znode itself and has different resources as its child Znode.
 
 ### How to Use Customized View
 
@@ -42,7 +42,7 @@ Users are responsible for updating customized states in their 
application code.
 
 After instantiation, users should call the function in the factory with user 
defined parameters to build a [Customized State 
Provider](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/customizedstate/CustomizedStateProvider.java)
 object. 
     
-There are two ways to build [Customized State 
Provider](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/customizedstate/CustomizedStateProvider.java),
 and the difference is what kind of 
[HelixManager](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/HelixManager.java)
 is passed in. As the following code shows, the first one relies on Helix 
provided manager, while the second one needs a user-created Helix manager. 
+There are two ways to build [Customized State 
Provider](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/customizedstate/CustomizedStateProvider.java),
 and the difference is what kind of 
[HelixManager](https://github.com/apache/helix/blob/master/helix-core/src/main/java/org/apache/helix/HelixManager.java)
 is passed in. As the following code shows, the first one relies on a Helix 
provided manager, while the second one needs a user-created Helix manager. 
 
 ```
   public CustomizedStateProvider buildCustomizedStateProvider(String 
instanceName,
@@ -80,11 +80,11 @@ Here are some additional guidelines about how to use 
[Customized State Provider]
 
 * When a user would like to drop a certain instance by calling Helix delete 
instance API, Helix will delete the instance as well as all sub-paths under it 
with recursive deletion. Therefore, the customized state will also be deleted, 
and customized view will be updated when the instance is gone.
 
-* When a user would like to drop a certain resource by calling Helix delete 
resource API, customer will be responsible for deleting customized state of all 
partitions for that resource across all instances. This operation can be 
implemented in users' state transition logic.
+* When a user would like to drop a certain resource by calling Helix delete 
resource API, he/she will be responsible for deleting customized state of all 
partitions for that resource across all instances. This operation can be 
implemented in users' state transition logic.
 
 * When Helix rebalance happens, and a certain partition on a certain instance 
is moved to another instance, customers will need to handle the cleanup in the 
callback function currently provided by Helix in the state transition logic. 
 
-* When an unexpected disconnection happened in client side from Zookeeper, but 
does not trigger rebalance, Helix will still keep the customized state as it is 
and wait for the connection to be reset. 
+* When an unexpected disconnection happens in client side from Zookeeper, but 
does not trigger rebalance, Helix will still keep the customized state as it is 
and wait for the connection to be reset. 
 
 
 #### Enable Customized State Aggregation in Config
diff --git 
a/website/1.0.1/src/site/resources/images/ParticipantAutoRegistrationLogic.png 
b/website/1.0.1/src/site/resources/images/ParticipantAutoRegistrationLogic.png
new file mode 100644
index 0000000..cb0ac19
Binary files /dev/null and 
b/website/1.0.1/src/site/resources/images/ParticipantAutoRegistrationLogic.png 
differ

[helix] branch master updated: add tutorial for Helix cloud support (#1172)

Reply via email to