[Design] - Kudu Output Operator

ananth Sat, 01 Apr 2017 16:30:06 -0700

Hello All,

I would like to the community's opinion on the implementation of Kuduoutput operator. A first cut implementation was made available inNovember last year but I guess we did not get time to discuss thisthoroughly on the mailing list and hence the PR did not get merged.

This operator would allow Apex to stream data into Kudu. A briefdescription of Kudu is here : https://kudu.apache.org/. This would allowat a high level the following use cases from Apex point of view:

- Low latency writes into Kudu store that allows SQL queries on the Kudustore. This essentially means sub-second data updates available for SQLquerying. As opposed to parquet styled data dumps which would ideallyneed a few minutes to accumulate data to take advantage of Parquetformats, this would make same second queries on very large datasets onKudu with Impala.

- Another very interesting use cases would be to allow Kudu as a sourcestore to stream based on SQL queries. The kudu input operator is anotherJIRA(https://issues.apache.org/jira/browse/APEXMALHAR-2472) and would becovering mechanisms to stream data from Kudu into Apex. This will bringin interesting use cases like de-dupe and selective streaming and out ofband data in a different way if Kudu is part of the eco system in agiven setup.


Here is the design of the Kudu output operator:

1. The operator would be an AbstractOperator and would allow theconcrete implementations to set a few behavioral aspects of the operator.


2. The following are the major phases of the operator:

During activate() phase of the operator : Establish a connection to thecluster and get the metadata about the table that is being used as the sink.During setup() phase of the operator: Fetch the current windowinformation and use it decide if we are recovering from a failure mode.(See point 8 below )During process() of Input port : Inspect the incoming ExecutionContext (see below ) tuple and perform one of the operations (Insert/Update/Delete/Upsert)3. The following parameters are tunable while establishing a Kuduconnection:Table name, Boss worker threads, Worker threads, Socket read time outsand External Consistency mode.4. The user need not specify any schema outright. The pojo fields areautomatically mapped to the table column names as identified in theschema parse in the activate phase.5. Allow the concrete implementation of the operator to override thePojo field name to the table schema column name. This would allowflexibility in use cases like table schema column names are notcompatible with java bean frameworks or in situations when column namescant be controlled as POJO is coming from an upstream operator.6. The input tuple that is to be supplied to this operator is of type"Kudu Execution Context". This tuple encompasses the actual Pojo that isgoing to be persisted to the Kudu store. Additionally it allows theupstream operator to specify the operation that needs to be performed.One of the following operations is permitted as part of the context :Insert, Upsert, Update and delete on the Pojo that is acting as thepayload in the Execution Context.7. The concrete implementation of the operator would allow the user tospecify the actual POJO class definition that would be used to the writeto the table. The execution context would contain this POJO as well asthe metadata that defines the behavior of the processing that needs tobe done on that tuple.8. The operator would allow for a special case of execution mode for thefirst window that is being processed as the operator gets activated.There are two modes for the first window of processing of the operator :a. Safe Mode : Safe mode is the "happy path execution" as in noextra processing is required to perform the Kudu mutation.b. Reconciling Mode: There is an additional function that would becalled to see if the user would like the tuple to be used for mutation.This mode is automatically set when OperatorContext.ACTIVATION_WINDOW_ID!= Stateless.WINDOW_ID during the first window of processing by theoperator.

This feature is deemed to be useful when an operator is recovering froma crash instance of the application and we do not want to performmultiple mutations of the same tuple given ATLEAST_ONCE is the defaultsemantics.


9. The operator is a stateless operator.
10. The operator would generate the following autometrics :

a. Counts of Inserts, Upserts, Deletes and Updates (separatecounters for each mutation) for a given window

     b. Bytes written in a given window
     c. Write RPCs in the given window
     d. Total RPC errors in this window

e. All of the above metrics for the operator for its entirelifetime of the operator.



Could you please provide your thoughts if the above design looks good ?




Regards,

Ananth

[Design] - Kudu Output Operator

Reply via email to