Hello,
Persian Wikipedia is one of the largest wikis based on number of categories
but It's not very common that people consider adding interwiki of
categories (they think interwiki is just for articles) so we have tons of
tons (before writing my engine that was 30K out of 170K) categories without
any interwikis which is really bad. I wrote some codes to make it better
but It wasn't enough So I wrote an engine that gets two database: 1-list of
categories without interwiki 2-list of categories with interwiki to a
certain language (e.g. English) with the target interwiki and after that my
bot analyzes and "guess" what is the correct interwiki of category based on
patterns of naming them in the second database
and bot reports. After running this code on fa.wp there was a very huge
report [1] and we started to sort things out (merging duplicates [2],
deleting extra ones, adding the correct iw) and now it's less than 25K
categories without interwikis (and It's becoming less and less) we did the
same on templates namespace [3] and we interwikified more than 10K
templates after that.

And because this engine doesn't use any language-related analyses It can be
ran in any language and get interwiki from any language (we planned to run
this on Persian Wikipedia again but this time we use Dutch and German
languages as repo of interwiki)

So here is my question: Is there similar situation in your wiki? Do you
want to run this code in your wiki too? Do you have any suggestion?
[1]: https://fa.wikipedia.org/w/index.php?title=کاربر:Ladsgroup/
رده‌ها&oldid=10959457
[2]: One of the benefits of running this engine is we can find duplicates
[3]:
https://fa.wikipedia.org/wiki/%DA%A9%D8%A7%D8%B1%D8%A8%D8%B1:Ladsgroup/%D8%A7%D9%84%DA%AF%D9%88%D9%87%D8%A7%DB%8C
Best
---
Amir
_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to