Hi,

On 9/3/22 03:33, Dušan Kreheľ wrote:
i wanna share my idea (writed in the article) about the wikipage
parser: 
https://en.wikipedia.org/wiki/User:Du%C5%A1an_Krehe%C4%BE/Signpost_draft:My_idea_about_wikipage_parser

I wasn't able to fully follow your conclusion, but I do want to say that Parsoid HTML is actually really great for bots. It's pretty well documented[1], and makes common bot things like parsing/manipulating templates really easy once you wrap your head around how to manipulate the DOM.

I developed a `parsoid` Rust crate[2] that lets you operate on Parsoid HTML at a very high level so you don't need to really understand the DOM stuff; it's basically the mwparserfromhell API with a few tweaks. It's part of the mwbot-rs project[3], so bots are a first-class use case (and interactive web tools too).

I would recommend you give it a second try, and if you get stuck, the Parsoid developers are pretty responsive to questions; you can find them in #mediawiki-parsoid on Libera, which is bridged to #parsoid:matrix.org.

[1] https://www.mediawiki.org/wiki/Specs/HTML/2.5.0
[2] https://docs.rs/parsoid/0.7.0-alpha.4/parsoid/index.html
[3] https://www.mediawiki.org/wiki/mwbot-rs

-- Kunal / Legoktm
_______________________________________________
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

Reply via email to