Using a combination of pywiki and mwparserfromhell it shouldn’t be too much of an issue for a wiki. It might be hard for such a bot to keep up on say enwiki, but slower wikis shouldn’t be an issue. Pair that with a database backend, and you should be able to do it without too much issues.
On Thu, Dec 27, 2018 at 11:42 AM Huji Lee <[email protected]> wrote: > This is an idea that came up on fawiki, and there is some merit to it. I > just want to figure out the best approach to implement it and would love > your input. > > *TL;DR: *We want to sweep through the recent edits in articles, look at > each diff, see if it contains the addition of a "{{cite book}}" template, > and if so, set it aside for future processing by another code. > > I wonder if there are already scripts in pywikibot that would help > initiate this. If not, I wonder what is the best strategy to implement this > using MW API. > > Thanks, > Huji > > ------------ > > Long version: > > The idea is to identify users who probably have access to certain offline > sources, so that if another user needs something to be checked in that > source and they don't have access to it, they know who to ask. For > instance, if I have access to a physical copy of Encyclopedia Britannica > (let's say it is a book and is not available digitally), and you want me to > check if it has an entry for Sir Isaac Newton, it would be great if > instead of or in addition to asking on the village pump (which I might not > follow), you would ask me directly. > > The assumption is that if the same user keeps adding the same "{{cite > book}}" template in many articles (e.g. if I add the {{cite book | title = > Encyclopedia Britannica | ... }} in several edits across several articles), > then that user most likely has access to that source. And if these edits > are relatively recent and the user is still active, then chances are the > user can still access that source if another user asks them to. > > So if we find all such edits, we probably can aggregate them into a table > that shows "Huji" added a {{cite book}} for a book titled "Encyclopedia > Britannica" 17 times, and so on and so forth. Sorting it by the frequency > column, we might have a good list of user-source pairs. > > _______________________________________________ > Wikimedia Cloud Services mailing list > [email protected] (formerly [email protected]) > https://lists.wikimedia.org/mailman/listinfo/cloud
_______________________________________________ Wikimedia Cloud Services mailing list [email protected] (formerly [email protected]) https://lists.wikimedia.org/mailman/listinfo/cloud
