Repository: kudu Updated Branches: refs/heads/gh-pages e35030439 -> 9a2e15032
Blog: Getting Started with Kudu Change-Id: I9503dca5e6e565be902c4090bf2996bd1960d763 Reviewed-on: http://gerrit.cloudera.org:8080/11136 Reviewed-by: Jordan Birdsell <[email protected]> Tested-by: Jordan Birdsell <[email protected]> Project: http://git-wip-us.apache.org/repos/asf/kudu/repo Commit: http://git-wip-us.apache.org/repos/asf/kudu/commit/9a2e1503 Tree: http://git-wip-us.apache.org/repos/asf/kudu/tree/9a2e1503 Diff: http://git-wip-us.apache.org/repos/asf/kudu/diff/9a2e1503 Branch: refs/heads/gh-pages Commit: 9a2e150324b8f9ee93649d996b83c9067e4c8e47 Parents: e350304 Author: Brock Noland <[email protected]> Authored: Mon Aug 6 21:50:20 2018 -0500 Committer: Jordan Birdsell <[email protected]> Committed: Wed Aug 8 14:27:55 2018 +0000 ---------------------------------------------------------------------- ...etting-started-with-kudu-an-oreilly-title.md | 60 +++++++++++++++++++ ...tting-started-with-kudu-an-oreilly-title.gif | Bin 0 -> 17397 bytes 2 files changed, 60 insertions(+) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/kudu/blob/9a2e1503/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md ---------------------------------------------------------------------- diff --git a/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md b/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md new file mode 100644 index 0000000..a896e05 --- /dev/null +++ b/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md @@ -0,0 +1,60 @@ +--- +layout: post +title: Getting Started with Kudu an O'Reilly Title +author: Brock Noland +--- + +The following article by Brock Noland was reposted from the +[phData](https://www.phdata.io/getting-started-with-kudu/) +blog with their permission. + +Five years ago, enabling Data Science and Advanced Analytics on the +Hadoop platform was hard. Organizations required strong Software Engineering +capabilities to successfully implement complex Lambda architectures or even +simply implement continuous ingest. Updating or deleting data, were simplify +nightmare. General Data Protection Regulation (GDPR) would have been an extreme +challenge at that time. + +<!-- more --> + +In that context, on October 11th 2012 Todd Lipcon perform Apache Kudu's initial +commit. The commit message was: + + Code for writing cfiles seems to basically work + Need to write code for reading cfiles, still + +And Kudu development was off and running. Around this same time Todd, on his +internal Wiki page, started listing out the papers he was reading to develop +the theoretical background for creating Kudu. I followed along, reading as many +as I could, understanding little, because I knew Todd was up to something +important. About a year after that initial commit, I got my +[Kudu first commit](https://github.com/apache/kudu/commit/1d7e6864b4a31d3fe6897e4cb484dfcda6608d43), +documenting the upper bound of a library. This is a small contribution of which I am still +proud. + +In the meantime, I was lucky enough to be a founder of a Hadoop Managed Services +and Consulting company known as [phData](http://phdata.io/). We found that a majority +of our customers had use cases which Kudu vastly simplified. Whether it's Change Data +Capture (CDC) from thousands of source tables to Internet of Things (IoT) ingest, Kudu +makes life much easier as both an operator of a Hadoop cluster and a developer providing +business value on the platform. + +Through this work, I was lucky enough to be a co-author of +[Getting Started with Kudu(http://shop.oreilly.com/product/0636920065739.do). +The book is a summation of mine and our co-authors, Jean-Marc Spaggiari, Mladen +Kovacevic, and Ryan Bosshart, learnings while cutting our teeth on early versions +of Kudu. Specifically you will learn: + +* Theoretical understanding of Kudu concepts in simple plain spoken words and simple diagrams +* Why, for many use cases, using Kudu is so much easier than other ecosystem storage technologies +* How Kudu enables Hybrid Transactional/Analytical Processing (HTAP) use cases +* How to design IoT, Predictive Modeling, and Mixed Platform Solutions using Kudu +* How to design Kudu Schemas + +{: .img-responsive} + +Looking forward, I am excited to see Kudu gain additional features and adoption +and eventually the second revision of this title. In the meantime, if you have +feedback or questions, please reach out on the `#getting-started-kudu` channel of +the [Kudu Slack](https://getkudu-slack.herokuapp.com/) or if you prefer non-real-time +communication, please use the user@ mailing list! http://git-wip-us.apache.org/repos/asf/kudu/blob/9a2e1503/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif ---------------------------------------------------------------------- diff --git a/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif b/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif new file mode 100644 index 0000000..2f533d5 Binary files /dev/null and b/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif differ
