Using a combination of pywiki and mwparserfromhell it shouldn’t be too much
of an issue for a wiki. It might be hard for such a bot to keep up on say
enwiki, but slower wikis shouldn’t be an issue. Pair that with a database
backend, and you should be able to do it without too much issues.

On Thu, Dec 27, 2018 at 11:42 AM Huji Lee <[email protected]> wrote:

> This is an idea that came up on fawiki, and there is some merit to it. I
> just want to figure out the best approach to implement it and would love
> your input.
>
> *TL;DR: *We want to sweep through the recent edits in articles, look at
> each diff, see if it contains the addition of a "{{cite book}}" template,
> and if so, set it aside for future processing by another code.
>
> I wonder if there are already scripts in pywikibot that would help
> initiate this. If not, I wonder what is the best strategy to implement this
> using MW API.
>
> Thanks,
> Huji
>
> ------------
>
> Long version:
>
> The idea is to identify users who probably have access to certain offline
> sources, so that if another user needs something to be checked in that
> source and they don't have access to it, they know who to ask. For
> instance, if I have access to a physical copy of Encyclopedia Britannica
> (let's say it is a book and is not available digitally), and you want me to
> check if it has an entry for  Sir Isaac Newton, it would be great if
> instead of or in addition to asking on the village pump (which I might not
> follow), you would ask me directly.
>
> The assumption is that if the same user keeps adding the same "{{cite
> book}}" template in many articles (e.g. if I add the {{cite book | title =
> Encyclopedia Britannica | ... }} in several edits across several articles),
> then that user most likely has access to that source. And if these edits
> are relatively recent and the user is still active, then chances are the
> user can still access that source if another user asks them to.
>
> So if we find all such edits, we probably can aggregate them into a table
> that shows "Huji" added a {{cite book}} for a book titled "Encyclopedia
> Britannica" 17 times, and so on and so forth. Sorting it by the frequency
> column, we might have a good list of user-source pairs.
>
> _______________________________________________
> Wikimedia Cloud Services mailing list
> [email protected] (formerly [email protected])
> https://lists.wikimedia.org/mailman/listinfo/cloud
_______________________________________________
Wikimedia Cloud Services mailing list
[email protected] (formerly [email protected])
https://lists.wikimedia.org/mailman/listinfo/cloud

Reply via email to