Hi, my name is Shijith, and I'm a freelance data journalist. (Worked 
previously at Hindustan Times and IndiaSpend, have also contributed to 
datameet.org in the past <http://datameet.org/author/shijithpk/>.)

Just wanted to plug a data story I did recently about Wikipedia abuse in 
India. Such abuse is an old problem, but it's getting more media attention 
with users distorting facts on pages about the Delhi riots or farmer 
protests. Sometimes users engage in straight out vandalism where they 
delete whole sections from a page.

I tried to determine which wikipedia pages faced the most abuse this year, 
and I also introduce a twitter account that allows people to track 
wikipedia abuse weekly.

This is the link to the story: 
https://shijith.com/blog/wikipedia-page-abuse/

This is the twitter account for tracking wikipedia abuse every week: 
http://twitter.com/abuse_checker 

And here's the python code I used for the project: 
https://github.com/shijithpk/wikipedia_abuse_checker

(Am in the process of re-working the code. Right now it's querying the 
wikipedia API every week for the edit histories of over 150k articles, and 
the whole run is taking 2 days now. Discovered an API endpoint for recent 
changes that should make things more efficient.)

Have any questions or feedback, do let me know!
Thanks, Shijith

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/datameet/7d4d5e8b-2602-4e7d-a9ff-d8b07e216c0cn%40googlegroups.com.

Reply via email to