boluor opened a new pull request, #3701:
URL: https://github.com/apache/doris-website/pull/3701

   ## Summary
   
   Batch fix for **45 issues** found in a documentation audit (audit findings 
#111–#200, the "simple" tier — typos, broken anchors, missing punctuation, 
wrong frontmatter, etc.). Each fix is applied consistently across `docs/`, the 
affected `versioned_docs/`, and the `i18n/zh-CN/` / `ja-source/` copies where 
the bug actually exists.
   
   Four flagged findings were intentionally **excluded** from this PR after 
verification:
   - **#116/#117** — anchor target was deleted in a recent refactor of 
`install-doris-cluster.md`; correct replacement is ambiguous, needs author 
input.
   - **#120** — already correct in the current English doc, no fix needed.
   - **#159** — `COUNT(DISTINCT uv)` on an HLL column is the intentional 
canonical example of HLL approximate-distinct (not a bug).
   
   ### Issues fixed by area
   
   **install / deploy-manually + deploy-on-kubernetes**
   - `separating-storage-compute-deploy-manually.md`: Step 7 sub-steps 
renumbered 1–4 (had 1,3,4,5); `[PROTERTIES propertires]` → `[PROPERTIES 
properties]`
   - k8s `install-config-cluster.md`: `syg_level` / `syg_mode` → 
`sys_log_level` / `sys_log_mode`
   - k8s `intro.mdx`: card link dropped trailing `.md` to match sibling cards
   - k8s `config-ms.md` / `install-doris-cluster.md`: fix broken FoundationDB 
anchor
   - k8s `config-cluster.md`: Chinese anchor `#权限类型` → `#types-of-permissions`
   - k8s `config-fe.md`: mysql host `doriscluster-sample-fe-service` → 
`doriscluster-sample-fe`
   
   **lakehouse best-practices / catalogs / metastores / storages / 
file-formats**
   - `spill-to-disk.mdx`: link target `workload_groups` → `backend_active_tasks`
   - `doris-hudi.md`: swap mislabeled `-- Doris` / `-- Spark` comments above 
login scripts
   - `doris-paimon.md`: translate stray Chinese comment in English copies
   - `doris-onelake.md`: `<talent_id>` → `<tenant_id>`
   - `jdbc-catalog-overview.md`: fix `'type' =='jdbc'`; the 
PG/Oracle/SQLServer/DB2 rows all linked to `jdbc-mysql-catalog.md`, now point 
to the right per-DB doc
   - `jdbc-ibmdb2-catalog.md`: frontmatter `language: "zh-CN"` → `"en"`
   - `jdbc-saphana-catalog.md`: `smalling` → `smallint`
   - `hudi-catalog.md`: `Java SKD` → `Java SDK`
   - `hive-catalog.mdx` (EN + ja): close the Glue IAM-role code fence
   - `hive-catalog.mdx` / `iceberg-catalog.mdx` / `paimon-catalog.mdx` / 
`kerberos.md`: trim trailing space inside `'hive.metastore.sasl.enabled '`
   - `iceberg-catalog.mdx` / `hive-metastore.md` / `kerberos.md`: collapse 
duplicated `keytabs/keytabs/`
   - `iceberg-catalog.mdx`: add missing comma after `'warehouse' = 
'<warehouse>'`
   - `catalog-overview.md`: SHOW CATALOGS sample shows `iceberg_catalog` type 
`hms`, fixed to `iceberg`
   - `parquet.md` / `text.md` / `gcs.md`: frontmatter `language: "zh-CN"` → 
`"en"` in English copies
   - `lakehouse/statistics.md`: anchor `#外表收集` → `#external-table-collection`
   - `minio.md` / `tencent-cos.md`: legacy name `s3.connection.timeout` → 
`s3.connection.request.timeout`
   
   **query-acceleration / query-data**
   - async-mv `overview.md`: `DELETE TABLE` → `DROP TABLE`
   - async-mv `use-guide.md`: INSERT block had trailing `,` and missing `;`; 
SELECT was missing a comma after `l_extendedprice * (1 - l_discount)`
   - async-mv `faq.md` / `use-guide.md`: replace Chinese anchors `#可选参数`, 
`#透明改写能力`, `#q12构建分区物化视图报错` with English slugs
   - `asof-join.md`: remove orphan `</content>` / `</invoke>` tags at EOF
   - `udf/python-user-defined-function.md`: SELECT was missing a comma after 
`id`
   - `query-data/window-function.md`: 4th stream-load curl missing `columns: ` 
prefix
   - `udf/java-user-defined-function.md`: DDL function name `java-utdf` → 
`java_utdf`
   - `tuning-plan/reordering-join-with-leading-hint.md`: `explain shape plan 
select from` → `select *`
   - `query-data/mysql-compatibility.md`: bitmap / inverted / N-Gram bullet 
said "MySQL supports", fixed to "Doris"
   
   **sql-manual basic-element data-types**
   - `AGG-STATE.md`: `agg_state<group_concat(string) generic` → add missing 
closing `>`
   - `date-time/DATE.md`: sentence wrongly said "TIME type", fixed to "DATE 
type"
   - `numeric/LARGEINT.md`: `### keywords` listed `BIGINT`, fixed to `LARGEINT`
   - `numeric/DECIMAL.md`: example began `elect` → `select`
   - `semi-structured/ARRAY.md`: remove trailing comma after last column 
definition
   - `semi-structured/JSON.md` (EN): translate Chinese section heading `## JSON 
的分组支持`
   - `semi-structured/JSON.md`: grammar `28% of rows is invalid` → `are invalid`
   
   ## Test plan
   
   - [ ] CI doc build passes
   - [ ] Spot-check the corrected anchor links resolve to existing headings
   - [ ] Verify code-block / table render fixes (Glue IAM block, async-mv 
INSERT/SELECT, MinIO/COS rows, etc.)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to