This is an automated email from the ASF dual-hosted git repository.
kassiez pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push:
new fef73cb2fc0 [fix] Fix picture of blog (#2410)
fef73cb2fc0 is described below
commit fef73cb2fc098bfdbc01211370db30cb9c2d5f52
Author: KassieZ <[email protected]>
AuthorDate: Fri May 23 20:36:34 2025 +0800
[fix] Fix picture of blog (#2410)
## Versions
- [ ] dev
- [ ] 3.0
- [ ] 2.1
- [ ] 2.0
## Languages
- [ ] Chinese
- [ ] English
## Docs Checklist
- [ ] Checked by AI
- [ ] Test Cases Built
---
...-data-reporting-tagging-and-data-lake-analytics.md | 13 +++++++------
.../average-query-response-time.png | Bin 0 -> 488086 bytes
.../data-synchronization-size-and-duration.png | Bin 0 -> 1067487 bytes
.../from-clickHouse-mysql-to-apache-doris.png | Bin 0 -> 151906 bytes
.../query-response-time-percentile.png | Bin 0 -> 565356 bytes
static/images/blogs/speed-up-lakehouse/skein.png | Bin 0 -> 271309 bytes
.../blogs/speed-up-lakehouse/tagging-services.png | Bin 0 -> 158447 bytes
7 files changed, 7 insertions(+), 6 deletions(-)
diff --git
a/blog/apache-doris-speeds-up-data-reporting-tagging-and-data-lake-analytics.md
b/blog/apache-doris-speeds-up-data-reporting-tagging-and-data-lake-analytics.md
index 6759dca3428..d4285f33aec 100644
---
a/blog/apache-doris-speeds-up-data-reporting-tagging-and-data-lake-analytics.md
+++
b/blog/apache-doris-speeds-up-data-reporting-tagging-and-data-lake-analytics.md
@@ -41,19 +41,20 @@ Before migrating to Apache Doris, they used ClickHouse,
MySQL, and Elasticsearch
Data reporting is one of the major services they provide to their customers
and they are bound by an SLA. They used to support such service with a
combination of ClickHouse and MySQL, but they found significant fluctuations in
their data synchronization duration, making it hard for them to meet the
service levels outlined in their SLA. Diagnosis showed that it was because the
multiple components add to the complexity and instability of data
synchronization tasks. To fix that, they have u [...]
-<div style={{textAlign:'center'}}><img
src="https://cdn.selectdb.com/static/from_clickhouse_mysql_to_apache_doris_6387c0363a.png"
alt="from-clickhouse-mysql-to-apache-doris" width="840" style={{display:
'inline-block'}} /></div >
+
### Performance improvements
With Apache Doris, they ingest data via the [Broker
Load](https://doris.apache.org/docs/1.2/data-operate/import/import-way/broker-load-manual)
method and reach an SLA compliance rate of over 99% in terms of data
synchronization performance.
-<div style={{textAlign:'center'}}><img
src="https://cdn.selectdb.com/static/data_synchronization_size_and_duration_327e4dc1fe.png"
alt="data-synchronization-size-and-duration" width="640" style={{display:
'inline-block'}} /></div >
+
As for data queries, the Doris-based architecture maintains an **average query
response time** of less than **10s** and a **P90 response time** of less than
**30s**. This is a 50% speedup compared to the old architecture.
-<div style={{textAlign:'center'}}><img
src="https://cdn.selectdb.com/static/average_query_response_time_372d71ef16.png"
alt="average-query-response-time" width="840" style={{display:
'inline-block'}} /></div >
+
+
+
-<div style={{textAlign:'center'}}><img
src="https://cdn.selectdb.com/static/query_response_time_percentile_756c6f6a71.png"
alt="query-response-time-percentile" width="840" style={{display:
'inline-block'}} /></div >
## Tagging
@@ -64,7 +65,7 @@ In the old processing architecture where Elasticsearch was
the processing engine
- Any problematic data in any of the tags could spoil the entire merging
operation and thus interrupt the data services.
- The merging operation was implemented based on Spark and MapReduce and took
up to 4 hours. Such a long time frame could encroach on marketing opportunities
and lead to unseen losses.
-<div style={{textAlign:'center'}}><img
src="https://cdn.selectdb.com/static/tagging_services_3263e21c36.png"
alt="tagging-services" width="840" style={{display: 'inline-block'}} /></div >
+
Then Apache Doris takes this over. Apache Doris arranges tag data with its
data models, which process data fast and smoothly. The aforementioned merging
step can be done by the [Aggregate Key
model](https://doris.apache.org/docs/data-table/data-model#aggregate-model),
which aggregates tag data based on the specified Aggregate Key upon data
ingestion. The [Unique Key
model](https://doris.apache.org/docs/data-table/data-model#unique-model) is
handy for partial column updates. Again, all yo [...]
@@ -84,7 +85,7 @@ The user introduces Compute Nodes into their cluster and
deploys them with other
For easier deployment, they have also optimized their Deploy on Yarn process
via Skein. As is shown below, they define the number of Compute nodes and the
required resources in the YAML file, and then pack the installation file,
configuration file, and startup script into the distributed file system. In
this way, they can start or stop the entire cluster of over 100 nodes within
minutes using one simple line of code.
-<div style={{textAlign:'center'}}><img
src="https://cdn.selectdb.com/static/skein_3516ba1a83.png" alt="skein"
width="560" style={{display: 'inline-block'}} /></div >
+
## Conclusion
diff --git
a/static/images/blogs/speed-up-lakehouse/average-query-response-time.png
b/static/images/blogs/speed-up-lakehouse/average-query-response-time.png
new file mode 100644
index 00000000000..bc81fe04730
Binary files /dev/null and
b/static/images/blogs/speed-up-lakehouse/average-query-response-time.png differ
diff --git
a/static/images/blogs/speed-up-lakehouse/data-synchronization-size-and-duration.png
b/static/images/blogs/speed-up-lakehouse/data-synchronization-size-and-duration.png
new file mode 100644
index 00000000000..b3e4f84051f
Binary files /dev/null and
b/static/images/blogs/speed-up-lakehouse/data-synchronization-size-and-duration.png
differ
diff --git
a/static/images/blogs/speed-up-lakehouse/from-clickHouse-mysql-to-apache-doris.png
b/static/images/blogs/speed-up-lakehouse/from-clickHouse-mysql-to-apache-doris.png
new file mode 100644
index 00000000000..3f1c2810511
Binary files /dev/null and
b/static/images/blogs/speed-up-lakehouse/from-clickHouse-mysql-to-apache-doris.png
differ
diff --git
a/static/images/blogs/speed-up-lakehouse/query-response-time-percentile.png
b/static/images/blogs/speed-up-lakehouse/query-response-time-percentile.png
new file mode 100644
index 00000000000..3106633a41f
Binary files /dev/null and
b/static/images/blogs/speed-up-lakehouse/query-response-time-percentile.png
differ
diff --git a/static/images/blogs/speed-up-lakehouse/skein.png
b/static/images/blogs/speed-up-lakehouse/skein.png
new file mode 100644
index 00000000000..560441c46a7
Binary files /dev/null and b/static/images/blogs/speed-up-lakehouse/skein.png
differ
diff --git a/static/images/blogs/speed-up-lakehouse/tagging-services.png
b/static/images/blogs/speed-up-lakehouse/tagging-services.png
new file mode 100644
index 00000000000..564bd25817d
Binary files /dev/null and
b/static/images/blogs/speed-up-lakehouse/tagging-services.png differ
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]