This is an automated email from the ASF dual-hosted git repository.
aplex pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/gobblin.git
The following commit(s) were added to refs/heads/master by this push:
new 7d7bf14 [GOBBLIN-1363] Fix minor mistakes in the Getting-Started
guide (#3204)
7d7bf14 is described below
commit 7d7bf14b192a41bea4daad7d86e69821a52906d3
Author: Kengo Seki <[email protected]>
AuthorDate: Fri Jul 16 04:13:27 2021 +0900
[GOBBLIN-1363] Fix minor mistakes in the Getting-Started guide (#3204)
Fixed the following mistakes/obsolete descriptions in the Getting Started
document.
JDK version that Gobblin requires
Gradle task name for Findbugs
Broken links to other pages
Wrong download URL for avro-tools
---
gobblin-docs/Getting-Started.md | 21 +++++++++++----------
1 file changed, 11 insertions(+), 10 deletions(-)
diff --git a/gobblin-docs/Getting-Started.md b/gobblin-docs/Getting-Started.md
index 9f1005d..8bfeeba 100644
--- a/gobblin-docs/Getting-Started.md
+++ b/gobblin-docs/Getting-Started.md
@@ -6,13 +6,13 @@ Table of Contents
# Introduction
-This guide will help you setup Gobblin, and run your first job. Currently,
Gobblin requires JDK 7 or later to run.
+This guide will help you setup Gobblin, and run your first job. Currently,
Gobblin requires JDK 8 or later to run.
# Getting a Gobblin Release
-All steps in this page assume you are using a Apache Gobblin source
distribution.
+All steps in this page assume you are using a Apache Gobblin source
distribution.
-Download source distribution from [here](/download/).
+Download source distribution from [here](/download/).
## Building a Distribution
@@ -23,7 +23,7 @@ cd /path/to/gobblin/source
./gradlew :gobblin-distribution:buildDistributionTar
```
-Note: A full build takes time because it runs other tasks like test, javadoc,
findMainBugs, etc, which impacts the build performance.
+Note: A full build takes time because it runs other tasks like test, javadoc,
findbugsMain, etc, which impacts the build performance.
For a quick usage, building distribution is good enough. However a full build
can be easily made by running:
```bash
./gradlew build
@@ -31,13 +31,13 @@ For a quick usage, building distribution is good enough.
However a full build ca
The generated distribution contains the binary in a specific directory
structure, which is different from source.
-After the build is done, there should be a tarball (if there are multiple, use
the newest one) at
+After the build is done, there should be a tarball (if there are multiple, use
the newest one) at
`build/gobblin-distribution/distributions/`
Distributions built from source are generated as `*.tar.gz` files. After
getting the tarball, unpackage it locally:
-`tar -xvf gobblin-distribution-[VERSION].tar.gz`.
+`tar -xvf gobblin-distribution-[VERSION].tar.gz`.
# Run Your First Job
@@ -85,19 +85,20 @@ Each Gobblin job minimally involves several constructs,
e.g. [Source](https://gi
Some of the classes relevant to this example include
[WikipediaSource](https://github.com/apache/gobblin/blob/master/gobblin-example/src/main/java/org/apache/gobblin/example/wikipedia/WikipediaSource.java),
[WikipediaExtractor](https://github.com/apache/gobblin/blob/master/gobblin-example/src/main/java/org/apache/gobblin/example/wikipedia/WikipediaExtractor.java),
[WikipediaConverter](https://github.com/apache/gobblin/blob/master/gobblin-example/src/main/java/org/apache/gobblin/example/w
[...]
+
To run Gobblin in standalone daemon mode we need a Gobblin configuration file
(such as uses
[application.conf](https://github.com/apache/gobblin/blob/master/conf/standalone/application.conf)).
And for each job we wish to run, we also need a job configuration file (such
as
[wikipedia.pull](https://github.com/apache/gobblin/blob/master/gobblin-example/src/main/resources/wikipedia.pull)).
The Gobblin configuration file, which is passed to Gobblin as a command line
argument, should contain a [...]
-A list of commonly used configuration properties can be found here:
[Configuration Properties
Glossary](user-guide/Configuration-Properties-Glossary).
+A list of commonly used configuration properties can be found here:
[Configuration Properties
Glossary](/user-guide/Configuration-Properties-Glossary).
## Steps
* Create a folder to store the job configuration file. Put
[wikipedia.pull](https://github.com/apache/gobblin/blob/master/gobblin-example/src/main/resources/wikipedia.pull)
in this folder, and set environment variable `GOBBLIN_JOB_CONFIG_DIR` to point
to this folder. Also, make sure that the environment variable `JAVA_HOME` is
set correctly.
-* Create a folder as Gobblin's working directory. Gobblin will write job
output as well as other information there, such as locks and state-store (for
more information, see the [Standalone
Deployment](user-guide/Gobblin-Deployment#Standalone-Deployment) page). Set
environment variable `GOBBLIN_WORK_DIR` to point to that folder.
+* Create a folder as Gobblin's working directory. Gobblin will write job
output as well as other information there, such as locks and state-store (for
more information, see the [Standalone
Deployment](/user-guide/Gobblin-Deployment#standalone-architecture) page). Set
environment variable `GOBBLIN_WORK_DIR` to point to that folder.
* Unpack Gobblin distribution:
-* Launch Gobblin in one of the execution mode [for more info refer:
[Gobblin-CLI](/gobblin-docs/user-guide/Gobblin-CLI.md)] :
+* Launch Gobblin in one of the execution mode [for more info refer:
[Gobblin-CLI](/user-guide/Gobblin-CLI)] :
```bash
gobblin service standalone start
@@ -141,7 +142,7 @@ The job output is written in `GOBBLIN_WORK_DIR/job-output`
folder as an Avro fil
To see the content of the job output, use the Avro tools to convert Avro to
JSON. Download the latest version of Avro tools (e.g. avro-tools-1.8.1.jar):
```bash
-curl -O
http://central.maven.org/maven2/org/apache/avro/avro-tools/1.8.1/avro-tools-1.8.1.jar
+curl -O
https://repo.maven.apache.org/maven2/org/apache/avro/avro-tools/1.8.1/avro-tools-1.8.1.jar
```
and run