This is an automated email from the ASF dual-hosted git repository.
kirs pushed a commit to branch dev
in repository https://gitbox.apache.org/repos/asf/incubator-seatunnel.git
The following commit(s) were added to refs/heads/dev by this push:
new 2b84996 [doc] Move introduction.md from website (#1563)
2b84996 is described below
commit 2b849966de47638803b176941ca1030b86743367
Author: Jiajie Zhong <[email protected]>
AuthorDate: Fri Mar 25 16:47:28 2022 +0800
[doc] Move introduction.md from website (#1563)
* [doc] Move introduction.md from website
Doc introduction.md is in repository seatunnel-website,
and this patch move it to main repo to keep our docs
together which make more sense and maintainable.
* fix dead link
---
docs/en/images/seatunnel-workflow.svg | 4 +
docs/en/introduction.md | 169 ++++++++++++++++++++++++++++++++++
2 files changed, 173 insertions(+)
diff --git a/docs/en/images/seatunnel-workflow.svg
b/docs/en/images/seatunnel-workflow.svg
new file mode 100644
index 0000000..7280e4a
--- /dev/null
+++ b/docs/en/images/seatunnel-workflow.svg
@@ -0,0 +1,4 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!-- Do not edit this file with editors other than diagrams.net -->
+<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
+<svg xmlns="http://www.w3.org/2000/svg"
xmlns:xlink="http://www.w3.org/1999/xlink" version="1.1" width="622px"
height="718px" viewBox="-0.5 -0.5 622 718" content="<mxfile
host="Electron" modified="2021-12-30T15:17:57.852Z"
agent="5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML,
like Gecko) draw.io/15.4.0 Chrome/91.0.4472.164 Electron/13.5.0
Safari/537.36" etag="y11mgoacIhryQ4lqCp5C"
version="15.4.0" type="device& [...]
\ No newline at end of file
diff --git a/docs/en/introduction.md b/docs/en/introduction.md
new file mode 100644
index 0000000..7cfff2f
--- /dev/null
+++ b/docs/en/introduction.md
@@ -0,0 +1,169 @@
+---
+title: Introduction
+sidebar_position: 1
+---
+
+# SeaTunnel
+
+<img src="https://seatunnel.apache.org/image/logo.png" alt="seatunnel logo"
width="200px" height="200px" align="right" />
+
+[](https://join.slack.com/t/apacheseatunnel/shared_invite/zt-123jmewxe-RjB_DW3M3gV~xL91pZ0oVQ)
+[](https://twitter.com/ASFSeaTunnel)
+
+---
+
+SeaTunnel was formerly named Waterdrop , and renamed SeaTunnel since October
12, 2021.
+
+---
+
+SeaTunnel is a very easy-to-use ultra-high-performance distributed data
integration platform that supports real-time
+synchronization of massive data. It can synchronize tens of billions of data
stably and efficiently every day, and has
+been used in the production of nearly 100 companies.
+
+## Why do we need SeaTunnel
+
+SeaTunnel will do its best to solve the problems that may be encountered in
the synchronization of massive data:
+
+- Data loss and duplication
+- Task accumulation and delay
+- Low throughput
+- Long cycle to be applied in the production environment
+- Lack of application running status monitoring
+
+## SeaTunnel use scenarios
+
+- Mass data synchronization
+- Mass data integration
+- ETL with massive data
+- Mass data aggregation
+- Multi-source data processing
+
+## Features of SeaTunnel
+
+- Easy to use, flexible configuration, low code development
+- Real-time streaming
+- Offline multi-source data analysis
+- High-performance, massive data processing capabilities
+- Modular and plug-in mechanism, easy to extend
+- Support data processing and aggregation by SQL
+- Support Spark structured streaming
+- Support Spark 2.x
+
+## Workflow of SeaTunnel
+
+
+
+```
+Source[Data Source Input] -> Transform[Data Processing] -> Sink[Result Output]
+```
+
+The data processing pipeline is constituted by multiple filters to meet a
variety of data processing needs. If you are
+accustomed to SQL, you can also directly construct a data processing pipeline
by SQL, which is simple and efficient.
+Currently, the filter list supported by SeaTunnel is still being expanded.
Furthermore, you can develop your own data
+processing plug-in, because the whole system is easy to expand.
+
+## Plugins supported by SeaTunnel
+
+- Input plugin Fake, File, Hdfs, Kafka, Druid, InfluxDB, S3, Socket,
self-developed Input plugin
+
+- Filter plugin Add, Checksum, Convert, Date, Drop, Grok, Json, Kv, Lowercase,
Remove, Rename, Repartition, Replace,
+ Sample, Split, Sql, Table, Truncate, Uppercase, Uuid, Self-developed Filter
plugin
+
+- Output plugin Elasticsearch, File, Hdfs, Jdbc, Kafka, Druid, InfluxDB,
Mysql, S3, Stdout, self-developed Output plugin
+
+## Environmental dependency
+
+1. java runtime environment, java >= 8
+
+2. If you want to run SeaTunnel in a cluster environment, any of the following
Spark cluster environments is usable:
+
+- Spark on Yarn
+- Spark Standalone
+
+If the data volume is small, or the goal is merely for functional
verification, you can also start in local mode without
+a cluster environment, because SeaTunnel supports standalone operation. Note:
SeaTunnel 2.0 supports running on Spark
+and Flink.
+
+## Downloads
+
+Download address for run-directly software package
:https://github.com/apache/incubator-seatunnel/releases
+
+## Quick start
+
+**Spark**
+https://seatunnel.apache.org/docs/spark/quick-start
+
+**Flink**
+https://seatunnel.apache.org/docs/flink/quick-start
+
+Detailed documentation on SeaTunnel
+https://seatunnel.apache.org/docs/introduction
+
+## Application practice cases
+
+- Weibo, Value-added Business Department Data Platform
+
+Weibo business uses an internal customized version of SeaTunnel and its
sub-project Guardian for SeaTunnel On Yarn task
+monitoring for hundreds of real-time streaming computing tasks.
+
+- Sina, Big Data Operation Analysis Platform
+
+Sina Data Operation Analysis Platform uses SeaTunnel to perform real-time and
offline analysis of data operation and
+maintenance for Sina News, CDN and other services, and write it into
Clickhouse.
+
+- Sogou, Sogou Qiqian System
+
+Sogou Qiqian System takes SeaTunnel as an ETL tool to help establish a
real-time data warehouse system.
+
+- Qutoutiao, Qutoutiao Data Center
+
+Qutoutiao Data Center uses SeaTunnel to support mysql to hive offline ETL
tasks, real-time hive to clickhouse backfill
+technical support, and well covers most offline and real-time tasks needs.
+
+- Yixia Technology, Yizhibo Data Platform
+
+- Yonghui Superstores Founders' Alliance-Yonghui Yunchuang Technology, Member
E-commerce Data Analysis Platform
+
+SeaTunnel provides real-time streaming and offline SQL computing of e-commerce
user behavior data for Yonghui Life, a
+new retail brand of Yonghui Yunchuang Technology.
+
+- Shuidichou, Data Platform
+
+Shuidichou adopts SeaTunnel to do real-time streaming and regular offline
batch processing on Yarn, processing 3~4T data
+volume average daily, and later writing the data to Clickhouse.
+
+- Tencent Cloud
+
+Collecting various logs from business services into Apache Kafka, some of the
data in Apache Kafka is consumed and extracted through Seatunnel, and then
store into Clickhouse.
+
+For more use cases, please refer to: https://seatunnel.apache.org/blog
+
+## Code of conduct
+
+This project adheres to the Contributor Covenant [code of
conduct](https://www.apache.org/foundation/policies/conduct).
+By participating, you are expected to uphold this code. Please follow
+the [REPORTING
GUIDELINES](https://www.apache.org/foundation/policies/conduct#reporting-guidelines)
to report
+unacceptable behavior.
+
+## Developer
+
+Thanks to all developers!
+
+[](https://github.com/apache/incubator-seatunnel/graphs/contributors)
+
+## Contact Us
+
+* Mail list: **[email protected]**. Mail to
`[email protected]`, follow the reply to subscribe
+ the mail list.
+* Slack:
https://join.slack.com/t/apacheseatunnel/shared_invite/zt-123jmewxe-RjB_DW3M3gV~xL91pZ0oVQ
+* Twitter: https://twitter.com/ASFSeaTunnel
+* [Bilibili](https://space.bilibili.com/1542095008) (for Chinese users)
+
+## Landscapes
+
+<p align="center">
+<br/><br/>
+<img src="https://landscape.cncf.io/images/left-logo.svg" width="150"
alt=""/> <img src="https://landscape.cncf.io/images/right-logo.svg"
width="200" alt=""/>
+<br/><br/>
+SeaTunnel enriches the <a
href="https://landscape.cncf.io/landscape=observability-and-analysis&license=apache-license-2-0">CNCF
CLOUD NATIVE Landscape.</a >
+</p >