Hi all,

some of you could find my recent experiments interesting. I posted them here:

http://morfologik.blogspot.com/2007/01/wikipedia-history-diff-as-revision.html

In short, it seems that Lars' idea was brilliant, and it is possible to filter out the edit wars using simple metrics. Prepare to buy larger disks, revision histories are big files :)

best,
Marcin

[email protected] napisał(a):
Lars wrote:

One idea for finding stats on errors is to compare changes made to Wikipedia articles. The complete text revision history is

That might make a good corpus.
Would it be possible to write a script that picks up just the spelling/grammar changes? If not, you'll be counting the effects of numerous edit wars.

xan

jonathon




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to