[incubator-devlake-website] branch main updated: docs: update scope and connection definition (#559)

klesh Wed, 14 Jun 2023 19:50:42 -0700

This is an automated email from the ASF dual-hosted git repository.

klesh pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/incubator-devlake-website.git



The following commit(s) were added to refs/heads/main by this push:
     new 422da1cbe2 docs: update scope and connection definition (#559)
422da1cbe2 is described below

commit 422da1cbe2e84c3f7ea17ad378a2d9f2e37cb332
Author: Louis.z <[email protected]>
AuthorDate: Thu Jun 15 10:50:21 2023 +0800

    docs: update scope and connection definition (#559)
    
    * docs: update scope and connection definition
    
    * docs: update data scope definition
    
    ---------
    
    Co-authored-by: Startrekzky <[email protected]>
---
 docs/Overview/KeyConcepts.md                         | 20 ++++++++------------
 versioned_docs/version-v0.17/Overview/KeyConcepts.md | 18 +++++++-----------
 2 files changed, 15 insertions(+), 23 deletions(-)

diff --git a/docs/Overview/KeyConcepts.md b/docs/Overview/KeyConcepts.md
index b33dc16c9a..dcd4739760 100644
--- a/docs/Overview/KeyConcepts.md
+++ b/docs/Overview/KeyConcepts.md
@@ -1,12 +1,12 @@
 ---
 sidebar_position: 4
 title: "Key Concepts"
-linkTitle: "KeyConepts"
+linkTitle: "KeyConcepts"
 description: >
   DevLake Key Concepts
 ---
 
-*Last updated: May 16 2022*
+*Last updated: June 13, 2023*
 
 
 ## In Configuration UI (Regular Mode)
@@ -27,30 +27,26 @@ For example, when a user associates 'Jenkins Job A' and  
'Jira board B' with pro
 3. **Adding [Transformation Rules](#transformation-rules) (optional)**: You 
can optionally apply transformation for the data scope you have just selected, 
in order to view more advanced metrics.
 3. **Setting the Sync Frequency**: You can specify the sync frequency for your 
blueprint to achieve recurring data syncs and transformation. Alternatively, 
you can set the frequency to manual if you wish to run the tasks in the 
blueprint manually.
 
-The relationship among Blueprint, Data Connections, Data Scope and 
Transformation Rules is explained as follows:
+The relationship between Blueprint, Data Connections, Data Scope and 
Transformation Rules is explained as follows:
 
 ![Blueprint ERD](/img/Glossary/blueprint-erd.svg)
 - Each blueprint can have multiple data connections.
-- Each data connection can have multiple sets of data scope.
+- Each data connection can have multiple data scopes.
 - Each set of data scope only consists of one GitHub/GitLab project or Jira 
board, along with their corresponding data entities.
 - Each set of data scope can only have one set of transformation rules.
 
 ### Data Sources
 **A data source is a specific DevOps tool from which you wish to sync your 
data, such as GitHub, GitLab, Jira and Jenkins.**
 
-DevLake normally uses one [data plugin](#data-plugins) to pull data for a 
single data source. However, in some cases, DevLake uses multiple data plugins 
for one data source for the purpose of improved sync speed, among many other 
advantages. For instance, when you pull data from GitHub or GitLab, aside from 
the GitHub or GitLab plugin, Git Extractor is also used to pull data from the 
repositories. In this case, DevLake still refers GitHub or GitLab as a single 
data source.
+DevLake normally uses one [data plugin](#data-plugins) to pull data for a 
single data source. However, in some cases, DevLake uses multiple data plugins 
for one data source for a better sync speed, among many other advantages. For 
instance, when you pull data from GitHub or GitLab, aside from the GitHub or 
GitLab plugin, Git Extractor is also used to pull data from the repositories. 
In this case, DevLake still recognizes GitHub or GitLab as a single data source.
 
 ### Data Connections
 **A data connection is a specific instance of a data source that stores 
information such as `endpoint` and `auth`.** A single data source can have one 
or more data connections (e.g. two Jira instances). Currently, DevLake supports 
one data connection for GitHub, GitLab and Jenkins, and multiple connections 
for Jira.
 
-You can set up a new data connection either during the first step of creating 
a blueprint, or in the Connections page that can be accessed from the 
navigation bar. Because one single data connection can be reused in multiple 
blueprints, you can update the information of a particular data connection in 
Connections, to ensure all its associated blueprints will run properly. For 
example, you may want to update your GitHub token in a data connection if it 
goes expired.
+The recommended way to set up a new data connection is via the Data 
Connections page. You can then add the data connection to a DevLake project to 
measure metrics later. A data connection can be used in multiple projects.
 
-### Data Scope
-**In a blueprint, each data connection can have multiple sets of data scope 
configurations, including GitHub or GitLab projects, Jira boards and their 
corresponding [data entities](#data-entities).** The fields for data scope 
configuration vary according to different data sources.
-
-Each set of data scope refers to one GitHub or GitLab project, or one Jira 
board and the data entities you would like to sync for them, for the 
convenience of applying transformation in the next step. For instance, if you 
wish to sync 5 GitHub projects, you will have 5 sets of data scope for GitHub.
-
-To learn more about the default data scope of all data sources and data 
plugins, please refer to [Supported Data Sources](./SupportedDataSources.md).
+### Data Scope(s)
+**A data scope is the top-level "container" in a data source**. For example, a 
data scope for Jira is a Jira board, for TAPD is a TAPD workspace, for 
GitHub/GitLab/BitBucket is a repo, for Jenkins is a Jenkins job, etc. You can 
add multiple data scopes to a data connection to determine which data to 
collect. Data scopes vary for different data sources.
 
 ### Data Entities
 **Data entities refer to the data fields from one of the five data domains: 
Issue Tracking, Source Code Management, Code Review, CI/CD and Cross-Domain.**
diff --git a/versioned_docs/version-v0.17/Overview/KeyConcepts.md 
b/versioned_docs/version-v0.17/Overview/KeyConcepts.md
index b33dc16c9a..63ddd72b63 100644
--- a/versioned_docs/version-v0.17/Overview/KeyConcepts.md
+++ b/versioned_docs/version-v0.17/Overview/KeyConcepts.md
@@ -6,7 +6,7 @@ description: >
   DevLake Key Concepts
 ---
 
-*Last updated: May 16 2022*
+*Last updated: June 13, 2023*
 
 
 ## In Configuration UI (Regular Mode)
@@ -27,30 +27,26 @@ For example, when a user associates 'Jenkins Job A' and  
'Jira board B' with pro
 3. **Adding [Transformation Rules](#transformation-rules) (optional)**: You 
can optionally apply transformation for the data scope you have just selected, 
in order to view more advanced metrics.
 3. **Setting the Sync Frequency**: You can specify the sync frequency for your 
blueprint to achieve recurring data syncs and transformation. Alternatively, 
you can set the frequency to manual if you wish to run the tasks in the 
blueprint manually.
 
-The relationship among Blueprint, Data Connections, Data Scope and 
Transformation Rules is explained as follows:
+The relationship between Blueprint, Data Connections, Data Scope and 
Transformation Rules is explained as follows:
 
 ![Blueprint ERD](/img/Glossary/blueprint-erd.svg)
 - Each blueprint can have multiple data connections.
-- Each data connection can have multiple sets of data scope.
+- Each data connection can have multiple data scopes.
 - Each set of data scope only consists of one GitHub/GitLab project or Jira 
board, along with their corresponding data entities.
 - Each set of data scope can only have one set of transformation rules.
 
 ### Data Sources
 **A data source is a specific DevOps tool from which you wish to sync your 
data, such as GitHub, GitLab, Jira and Jenkins.**
 
-DevLake normally uses one [data plugin](#data-plugins) to pull data for a 
single data source. However, in some cases, DevLake uses multiple data plugins 
for one data source for the purpose of improved sync speed, among many other 
advantages. For instance, when you pull data from GitHub or GitLab, aside from 
the GitHub or GitLab plugin, Git Extractor is also used to pull data from the 
repositories. In this case, DevLake still refers GitHub or GitLab as a single 
data source.
+DevLake normally uses one [data plugin](#data-plugins) to pull data for a 
single data source. However, in some cases, DevLake uses multiple data plugins 
for one data source for a better sync speed, among many other advantages. For 
instance, when you pull data from GitHub or GitLab, aside from the GitHub or 
GitLab plugin, Git Extractor is also used to pull data from the repositories. 
In this case, DevLake still recognizes GitHub or GitLab as a single data source.
 
 ### Data Connections
 **A data connection is a specific instance of a data source that stores 
information such as `endpoint` and `auth`.** A single data source can have one 
or more data connections (e.g. two Jira instances). Currently, DevLake supports 
one data connection for GitHub, GitLab and Jenkins, and multiple connections 
for Jira.
 
-You can set up a new data connection either during the first step of creating 
a blueprint, or in the Connections page that can be accessed from the 
navigation bar. Because one single data connection can be reused in multiple 
blueprints, you can update the information of a particular data connection in 
Connections, to ensure all its associated blueprints will run properly. For 
example, you may want to update your GitHub token in a data connection if it 
goes expired.
+The recommended way to set up a new data connection is via the Data 
Connections page. You can then add the data connection to a DevLake project to 
measure metrics later. A data connection can be used in multiple projects.
 
-### Data Scope
-**In a blueprint, each data connection can have multiple sets of data scope 
configurations, including GitHub or GitLab projects, Jira boards and their 
corresponding [data entities](#data-entities).** The fields for data scope 
configuration vary according to different data sources.
-
-Each set of data scope refers to one GitHub or GitLab project, or one Jira 
board and the data entities you would like to sync for them, for the 
convenience of applying transformation in the next step. For instance, if you 
wish to sync 5 GitHub projects, you will have 5 sets of data scope for GitHub.
-
-To learn more about the default data scope of all data sources and data 
plugins, please refer to [Supported Data Sources](./SupportedDataSources.md).
+### Data Scope(s)
+**A data scope is the top-level "container" in a data source**. For example, a 
data scope for Jira is a Jira board, for TAPD is a TAPD workspace, for 
GitHub/GitLab/BitBucket is a repo, for Jenkins is a Jenkins job, etc. You can 
add multiple data scopes to a data connection to determine which data to 
collect. Data scopes vary for different data sources.
 
 ### Data Entities
 **Data entities refer to the data fields from one of the five data domains: 
Issue Tracking, Source Code Management, Code Review, CI/CD and Cross-Domain.**

[incubator-devlake-website] branch main updated: docs: update scope and connection definition (#559)

Reply via email to