Repository: incubator-gobblin Updated Branches: refs/heads/master 375c2918b -> 30921bf5c
[GOBBLIN-170] Update documentation to include Apache with Gobblin Project: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/commit/21853386 Tree: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/tree/21853386 Diff: http://git-wip-us.apache.org/repos/asf/incubator-gobblin/diff/21853386 Branch: refs/heads/master Commit: 218533860c468788f4b70c20a26977e53454e2b2 Parents: 2580b28 Author: Abhishek Tiwari <[email protected]> Authored: Thu Jul 27 15:45:03 2017 -0700 Committer: Abhishek Tiwari <[email protected]> Committed: Thu Jul 27 15:45:03 2017 -0700 ---------------------------------------------------------------------- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/incubator-gobblin/blob/21853386/README.md ---------------------------------------------------------------------- diff --git a/README.md b/README.md index a60f1e1..b513c70 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # Apache Gobblin [](https://travis-ci.org/linkedin/gobblin) [](http://gobblin.readthedocs.org/en/latest/?badge=latest) -Apache Gobblin is a universal data ingestion framework for extracting, transforming, and loading large volume of data from a variety of data sources, e.g., databases, rest APIs, FTP/SFTP servers, filers, etc., onto Hadoop. Gobblin handles the common routine tasks required for all data ingestion ETLs, including job/task scheduling, task partitioning, error handling, state management, data quality checking, data publishing, etc. Gobblin ingests data from different data sources in the same execution framework, and manages metadata of different sources all in one place. This, combined with other features such as auto scalability, fault tolerance, data quality assurance, extensibility, and the ability of handling data model evolution, makes Gobblin an easy-to-use, self-serving, and efficient data ingestion framework. +Apache Gobblin is a universal data ingestion framework for extracting, transforming, and loading large volume of data from a variety of data sources, e.g., databases, rest APIs, FTP/SFTP servers, filers, etc., onto Hadoop. Apache Gobblin handles the common routine tasks required for all data ingestion ETLs, including job/task scheduling, task partitioning, error handling, state management, data quality checking, data publishing, etc. Gobblin ingests data from different data sources in the same execution framework, and manages metadata of different sources all in one place. This, combined with other features such as auto scalability, fault tolerance, data quality assurance, extensibility, and the ability of handling data model evolution, makes Gobblin an easy-to-use, self-serving, and efficient data ingestion framework. # Quick Links
