Hello All,

I was wondering if it would be worthwhile for the community to consider support for Apache Kudu as a store ( as a contrib operator inside Apache Malhar ) .

Here are some benefits I see:

1. Kudu is just declared 1.0 and has just been declared production ready.
2. Kudu as a store might a good a fit for many architectures in the
   years to come because of its capabilities to provide mutability of
   data ( unlike HDFS ) and optimized storage formats for scans.
3. It seems to also withstand high-throughput write patterns which
   makes it a stable sink for Apex workflows which operate at very high
   volumes.


Here are some links

 *  From the recent Strata conference
   https://kudu.apache.org/2016/09/26/strata-nyc-kudu-talks.html
 * https://kudu.apache.org/overview.html

I can implement this operator if the community feels it is worth adding it to our code base. If so, could someone please assign the JIRA to me. I have created this JIRA to track this : https://issues.apache.org/jira/browse/APEXMALHAR-2278


Regards,

Ananth

Reply via email to