Hello All,
I was wondering if it would be worthwhile for the community to consider
support for Apache Kudu as a store ( as a contrib operator inside Apache
Malhar ) .
Here are some benefits I see:
1. Kudu is just declared 1.0 and has just been declared production ready.
2. Kudu as a store might a good a fit for many architectures in the
years to come because of its capabilities to provide mutability of
data ( unlike HDFS ) and optimized storage formats for scans.
3. It seems to also withstand high-throughput write patterns which
makes it a stable sink for Apex workflows which operate at very high
volumes.
Here are some links
* From the recent Strata conference
https://kudu.apache.org/2016/09/26/strata-nyc-kudu-talks.html
* https://kudu.apache.org/overview.html
I can implement this operator if the community feels it is worth adding
it to our code base. If so, could someone please assign the JIRA to me.
I have created this JIRA to track this :
https://issues.apache.org/jira/browse/APEXMALHAR-2278
Regards,
Ananth