Re: [datameet] plug -- data story on wikipedia abuse in India

Shijith Kunhitty Tue, 21 Dec 2021 23:40:36 -0800

Oh yeah, the fact that certain wikipedia pages come under protection status 
definitely lowers their revert counts. And I accept that as well in my 
blog.

Blog excerpt:* "This is more of a caveat. Some of these pages will have 
been placed under protection to stop getting abused. Protection could, for 
example, mean edits by anonymous users not appearing till they're approved, 
or if someone has been a registered user for less than 4 days old and has 
done less than 10 edits, they won't be able to edit certain pages. The page 
on Narendra Modi is under even stricter protection—only someone who's been 
a user for more than 30 days and has done over 500 edits can edit the page, 
and this has been the case since April. Such protections brings down the 
abuse levels for many pages, so that should be kept in mind."*

But then you could turn around and say, hey Shijith if you're aware of 
this, why didn't you factor it into your calculations? It's because I think 
there should be a limit to the level of complexity a journalist aims for in 
their story. I accept that I posted this in the datameet mailing list, but 
the story is aimed at a general audience.

Again from my blog: *"I've done work in the past that tries to be 
(conscientiously) rigorous, but I don't think journalism is the place for 
such work. Academic journals maybe, but not publications meant for the 
general public. In the pursuit of precision and conclusion validity, a lot 
of data journalism in India has become completely unreadable. I don't think 
there's anything wrong in admitting upfront that this post is a best-effort 
attempt, the conclusions may not be completely valid, but that hopefully 
this promotes the topic of wikipedia abuse in India as a legitimate avenue 
of research. And that some academic out there does a better job than I did 
in their paper. But I don't think a journalist should be the one doing that 
paper."*

As for the level to which abuse of certain wikipedia pages is organised, 
and how that coordination is done over Telegram and Whatsapp, because many 
of these groups are private groups, it's difficult to monitor that activity 
and find out which pages they are targeting.

But even if that's the case, I don't think I'll be missing out on any pages 
though. Right now the script goes through 150k pages (and once i've 
reworked the code, all the pages) which have been assessed by WikiProject 
India <https://en.wikipedia.org/wiki/Wikipedia:WikiProject_India>, the 
group of editors that maintains pages about India, and they cover *almost 
everything*. So no scope for missing out on disputed pages.

To your point about following OpIndia or other right-wing handles to find 
which wikipedia pages they are targeting, the only issue I would have with 
that is it won't capture abuse of pages that *aren't *a result of 
coordinated, organised campaigns. Abuse of pages that is a result of 
individual actors editing separately, but still devastating at an aggregate 
level, is also interesting to me. The right-wing agenda is pretty much all 
pervasive now, and people don't need to be prodded by politicians, media 
etc. to do their bidding, they pretty much act on their own volition now :(

On Wednesday, 22 December 2021 at 12:00:20 UTC+5:30 Shyamal wrote:

> Dear Shijith,
>
> This is very interesting work but I think it misses a lot of action due to 
> the rather simple approach to identifying dispute. An example is that calls 
> for action are often made from Whatsapp Group and on Twitter - such as 
> ("meat puppet") campaigns to make changes - for instance to alter the entry 
> on "Love Jihad" to read differently or for "Adam's Bridge" to be renamed 
> which have resulted in those pages being placed on protection - that in 
> turn drastically lowers edit revert counts. A script that follows 
> highly-followed twitter handles (OpIndia for instance) complaining about 
> Wikipedia pages might show up more disputed pages. 
>
> best wishes
> Shyamal
> https://muscicapa.blogspot.com
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/datameet/ff7d7d96-427f-4bf8-91c4-4e9d5df8b36fn%40googlegroups.com.

Re: [datameet] plug -- data story on wikipedia abuse in India

Reply via email to