labuladong opened a new issue, #18026: URL: https://github.com/apache/pulsar/issues/18026
### Search before asking - [X] I searched in the [issues](https://github.com/apache/pulsar/issues) and found nothing similar. ### What issue do you find in Pulsar docs? After #17495 and #17599, we can see many broken links in the Pulsar document. And more links may break during further evolution. ### What is your suggestion? In #17599 I use a script but it's not reliable enough so I have to check the links list manually. I suggest developing a reliable script to auto-detect the incorrect links in the Pulsar document. Maybe we can integrate this script into the CI process which is related to document change. Before development, I'd like to enumerate all kinds of broken links. ### 1. wrong markdown file reference For example the link of [this page](https://pulsar.apache.org/docs/next/admin-api-tenants/#get-configuration): <img width="573" alt="image" src="https://user-images.githubusercontent.com/37220920/195486126-e8a3de23-4cc5-4319-a11f-7c222b2a1b4a.png"> The markdown content is `[configuration](reference-configuration.md)`, but the `reference-configuration.md` file not exists. ### 2. 404 URL path For example the link of [this page](https://pulsar.apache.org/docs/next/client-libraries-websocket/#query-param): <img width="827" alt="image" src="https://user-images.githubusercontent.com/37220920/195486601-aea69e74-005d-4c2c-8fb1-02fcc80d1f9c.png"> The markdown content is `[type](/api/client/index.html?org/apache/pulsar/client/api/CompressionType.html)`, but the Pulsar site doesn't have this path. ### 3. confusing URL path For example the link of [this page](https://pulsar.apache.org/docs/next/functions-debug-cli/):  The markdown content is `[Pulsar Functions CLI](/tools/pulsar-admin/)`, but this refers to a confusing page: <img width="582" alt="image" src="https://user-images.githubusercontent.com/37220920/195488231-79bb5f13-e832-415b-81f5-36c0330908b4.png"> ### 4. invalid title anchor We can use `#` to refer to a specific block of HTML this way: `[dataDir](reference-configuration.md#zookeeper-dataDir)`. So if our script can detect the anchor will be better. Our script should be able to detect these broken links and print warning messages to users. cc @tisonkun @Anonymitaet @momo-jun @michaeljmarshall ### Any reference? _No response_ ### Are you willing to submit a PR? - [X] I'm willing to submit a PR! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
