The new and improved version of the copy and detection bot that we at [[WP:
MED]] have been using for nearly a year [
https://en.wikipedia.org/wiki/User:EranBot/Copyright here] is nearly ready
to be expanded to other topic areas.
It can be found here [
Hi Peter,
The complete quote goes: There must be another way to work for the value
of free knowledge for the people but to destroy net neutrality and the
experience of an open web in the very beginning at the same time.
When it comes to schools and other educational organisations in developing
Still, in my assessment it is lacking on concrete details. There are many
terms that are coined and movements cited which are not definitively
explained, in some cases with hints that the departments doing the
reporting have not themselves yet arrived at precise meaning. I suppose
that, like the
Okay, but seriously, please stop resurrecting this thread. If you
think it's important that something be done, start a new one, and
*actually suggest something* rather than just copying articles from
somewhere else.
Austin
On Fri, Apr 3, 2015 at 1:58 AM, Andreas Kolbe jayen...@gmail.com wrote:
Hi, James.
Is the source code available anywhere?
IF you want to try your bot in other languages, I could help you with
testing in Russian Wikipedia :)
Best regards.
rubin16
2015-04-03 12:07 GMT+03:00 James Heilman jmh...@gmail.com:
The new and improved version of the copy and detection bot
Hi James
I often suspect copy-paste and find exact matches of the text
elsewhere. However, whereas one can painstakingly (unless there is a
trick that I am not aware of) ascertain when text was enetered into
an article, it is not always possible to know when the other text
first appeared on the
In focus: WMF's latest strategy document shows successes, vagueness, and the
need for better data
http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2015-04-01/In_focus
In the media: Wiki-PR duo bulldoze a piñata store; Wifione arbitration case;
French parliamentary plagiarism
1) Yes the source code is available. User:Eran has posted it here
https://github.com/valhallasw/plagiabot
2) This bot ONLY works on new edits within a couple of hours of them
occurring. This reducing the number of false positives. It DOES NOT look at
old edits.
3) This requires human follow up
Hi Brian,
2015-03-30 0:25 GMT+02:00 Brian reflect...@gmail.com:
Although the initial goal of the Netflix Prize was to design a
collaborative filtering algorithm, it became notorious when the data was
used to de-anonymize Netflix users. Researchers proved that given just a
user's movie ratings