[ 
https://issues.apache.org/jira/browse/GOBBLIN-1837?focusedWorklogId=863328&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-863328
 ]

ASF GitHub Bot logged work on GOBBLIN-1837:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Jun/23 21:43
            Start Date: 01/Jun/23 21:43
    Worklog Time Spent: 10m 
      Work Description: umustafi commented on code in PR #3700:
URL: https://github.com/apache/gobblin/pull/3700#discussion_r1213713080


##########
gobblin-runtime/src/main/java/org/apache/gobblin/runtime/api/MysqlSchedulerLeaseDeterminationStore.java:
##########
@@ -0,0 +1,207 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.gobblin.runtime.api;
+
+import java.io.IOException;
+import java.sql.Connection;
+import java.sql.PreparedStatement;
+import java.sql.ResultSet;
+import java.sql.SQLException;
+import java.sql.Timestamp;
+
+import com.google.inject.Inject;
+import com.typesafe.config.Config;
+
+import javax.sql.DataSource;
+
+import org.apache.gobblin.broker.SharedResourcesBrokerFactory;
+import org.apache.gobblin.configuration.ConfigurationKeys;
+import org.apache.gobblin.metastore.MysqlDataSourceFactory;
+import org.apache.gobblin.service.ServiceConfigKeys;
+import org.apache.gobblin.util.ConfigUtils;
+
+
+public class MysqlSchedulerLeaseDeterminationStore implements 
SchedulerLeaseDeterminationStore {
+  public static final String CONFIG_PREFIX = 
"MysqlSchedulerLeaseDeterminationStore";
+
+  protected final DataSource dataSource;
+  private final DagActionStore dagActionStore;
+  private final String tableName;
+  private final long epsilon;
+  private final long linger;
+  /* TODO:
+     - define retention on this table
+     - initialize table with epsilon and linger if one already doesn't exist 
using these configs
+     - join with table above to ensure epsilon/linger values are consistent 
across hosts (in case hosts are deployed with different configs)
+   */
+  protected static final String WHERE_CLAUSE_TO_MATCH_ROW = "WHERE 
flow_group=? AND flow_name=? AND flow_execution_id=? "
+      + "AND flow_action=? AND ABS(trigger_event_timestamp-?) <= %s";
+  protected static final String 
ATTEMPT_INSERT_AND_GET_PURSUANT_TIMESTAMP_STATEMENT = "INSERT INTO %s 
(flow_group, "
+      + "flow_name, flow_execution_id, flow_action, trigger_event_timestamp) 
VALUES (?, ?, ?, ?, ?) WHERE NOT EXISTS ("
+      + "SELECT * FROM %s " + WHERE_CLAUSE_TO_MATCH_ROW + "; SELECT 
ROW_COUNT() AS rows_inserted_count, "
+      + "pursuant_timestamp FROM %s " + WHERE_CLAUSE_TO_MATCH_ROW;
+
+  protected static final String UPDATE_PURSUANT_TIMESTAMP_STATEMENT = "UPDATE 
%s SET pursuant_timestamp = NULL "
+      + WHERE_CLAUSE_TO_MATCH_ROW;
+  private static final String CREATE_TABLE_STATEMENT = "CREATE TABLE IF NOT 
EXISTS %S (" + "flow_group varchar("
+      + ServiceConfigKeys.MAX_FLOW_GROUP_LENGTH + ") NOT NULL, flow_name 
varchar("
+      + ServiceConfigKeys.MAX_FLOW_GROUP_LENGTH + ") NOT NULL, " + 
"flow_execution_id varchar("
+      + ServiceConfigKeys.MAX_FLOW_EXECUTION_ID_LENGTH + ") NOT NULL, 
flow_action varchar(100) NOT NULL, "
+      + "trigger_event_timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP, "
+      + "pursuant_timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP,"
+      + "PRIMARY KEY 
(flow_group,flow_name,flow_execution_id,flow_action,trigger_event_timestamp)";

Review Comment:
   Removing it as primary key so we do event consolidation for the same flow 
action and adding explanation to the JavaDoc





Issue Time Tracking
-------------------

    Worklog Id:     (was: 863328)
    Time Spent: 7h 10m  (was: 7h)

> Implement multi-active, non blocking for leader host
> ----------------------------------------------------
>
>                 Key: GOBBLIN-1837
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-1837
>             Project: Apache Gobblin
>          Issue Type: Bug
>          Components: gobblin-service
>            Reporter: Urmi Mustafi
>            Assignee: Abhishek Tiwari
>            Priority: Major
>          Time Spent: 7h 10m
>  Remaining Estimate: 0h
>
> This task will include the implementation of non-blocking, multi-active 
> scheduler for each host. It will NOT include metric emission or unit tests 
> for validation. That will be done in a separate follow-up ticket. The work in 
> this ticket includes
>  * define a table to do scheduler lease determination for each flow's trigger 
> event and related methods to execute actions on this tableĀ 
>  * update DagActionStore schema and DagActionStoreMonitor to act upon new 
> "LAUNCH" type events in addition to KILL/RESUME
>  * update scheduler/orchestrator logic to apply the non-blocking algorithm 
> when "multi-active scheduler mode" is enabled, otherwise submit events 
> directly to the DagManager after receiving a scheduler trigger
>  * implement the non-blocking algorithm, particularly handling reminder 
> events if another host is in the process of securing the lease for a 
> particular flow trigger



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to