Repository: incubator-gobblin Updated Branches: refs/heads/master 2621f5e6b -> bd991d781
[GOBBLIN-0] Second Apache repo commit Project: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/commit/bd991d78 Tree: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/tree/bd991d78 Diff: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/diff/bd991d78 Branch: refs/heads/master Commit: bd991d7816d3aa2089905cc1f74ae3564493d1ff Parents: 2621f5e Author: Abhishek Tiwari <[email protected]> Authored: Wed Jul 26 15:58:58 2017 -0700 Committer: Abhishek Tiwari <[email protected]> Committed: Wed Jul 26 15:58:58 2017 -0700 ---------------------------------------------------------------------- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/incubator-gobblin/blob/bd991d78/README.md ---------------------------------------------------------------------- diff --git a/README.md b/README.md index 7671a7f..a60f1e1 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # Apache Gobblin [](https://travis-ci.org/linkedin/gobblin) [](http://gobblin.readthedocs.org/en/latest/?badge=latest) -Gobblin is a universal data ingestion framework for extracting, transforming, and loading large volume of data from a variety of data sources, e.g., databases, rest APIs, FTP/SFTP servers, filers, etc., onto Hadoop. Gobblin handles the common routine tasks required for all data ingestion ETLs, including job/task scheduling, task partitioning, error handling, state management, data quality checking, data publishing, etc. Gobblin ingests data from different data sources in the same execution framework, and manages metadata of different sources all in one place. This, combined with other features such as auto scalability, fault tolerance, data quality assurance, extensibility, and the ability of handling data model evolution, makes Gobblin an easy-to-use, self-serving, and efficient data ingestion framework. +Apache Gobblin is a universal data ingestion framework for extracting, transforming, and loading large volume of data from a variety of data sources, e.g., databases, rest APIs, FTP/SFTP servers, filers, etc., onto Hadoop. Gobblin handles the common routine tasks required for all data ingestion ETLs, including job/task scheduling, task partitioning, error handling, state management, data quality checking, data publishing, etc. Gobblin ingests data from different data sources in the same execution framework, and manages metadata of different sources all in one place. This, combined with other features such as auto scalability, fault tolerance, data quality assurance, extensibility, and the ability of handling data model evolution, makes Gobblin an easy-to-use, self-serving, and efficient data ingestion framework. # Quick Links
