[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Diederik van Liere changed: What|Removed |Added Assignee|dvanli...@gmail.com |wikibugs-l@lists.wikimedia. ||org -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Nemo changed: What|Removed |Added Component|Wikistats |Webstatscollector Product|Analytics |Datasets -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Andre Klapper changed: What|Removed |Added Priority|Normal |Low Status|ASSIGNED|UNCONFIRMED CC||aklap...@wikimedia.org Ever Confirmed|1 |0 --- Comment #7 from Andre Klapper 2012-12-03 16:58:29 UTC --- So the only potential usecase I've seen mentioned so far in this report is "to write a bot that will use such data to automatically fix links to disambig pages". Is that all? However, better clickstream data is mentioned at https://www.mediawiki.org/wiki/Wikimedia_Engineering/2012-13_Goals#Analytics -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Andre Klapper changed: What|Removed |Added CC||wikibugs-l@lists.wikimedia. ||org Component|Statistics |Wikistats Product|Wikimedia |Analytics --- Comment #6 from Andre Klapper 2012-12-03 13:59:10 UTC --- [mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!] -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Rob Lanphier changed: What|Removed |Added AssignedTo|ro...@wikimedia.org |dvanli...@gmail.com -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 howief changed: What|Removed |Added AssignedTo|hf...@wikimedia.org |ro...@wikimedia.org -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 howief changed: What|Removed |Added CC||howiew...@gmail.com --- Comment #5 from howief 2011-02-10 02:31:59 UTC --- I'm not sure the benefits of fixing the disambiguation issue outweigh the potential privacy concerns. Yes, we do want better analytics, but we should think about what clickdata we want to track and/or publish very carefully. E.g., we may consider applying click-tracking to some types of pages, but not others if that's possible. Robla is managing the priority list of analytics related features, so I'm going to assign this to him. Any other use cases for this data? -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 --- Comment #4 from Bawolff 2011-02-09 19:15:29 UTC --- >Based on some Google research I did in response to your >comment, I found that the Wikimedia Foundation already decided last year to get >some better analytics tools. My main concern was giving out such data to everyone who could potentially want it. Bot developers are a wide group of people, of varying levels of competency. I wouldn't really want such a group to have access to such data unless it was very well anonimized. Such information could be sensitive. Say someone browsed through various articles on Wikipedia about sexual topics, followed by a browse through the commons categories for sexual images (Assuming such categories still exist after the recent controversies that i havn't really been following) followed by the user visiting his own userpage (so one can identify who it is. If user pages aren't listed, perhaps followed by him accidently making a typo and going to uer:/ whatever). That might be something that the user would not want to be published. Anyways, I'm all for better analyitic tools in general (I love the page stats), but we also have to be careful. Even anonoymized data can be harmful to release (for example [[AOL search data scandal]]) if not done carefully. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 --- Comment #3 from Diederik van Liere 2011-02-09 03:52:50 UTC --- Maybe we should assign this to Rob Lanphier or Nimish. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Jason Spiro changed: What|Removed |Added Status|NEW |ASSIGNED CC|hf...@wikimedia.org | AssignedTo|wikibugs-l@lists.wikimedia. |hf...@wikimedia.org |org | --- Comment #2 from Jason Spiro 2011-02-09 03:37:20 UTC --- Sorry Bawolff. Based on some Google research I did in response to your comment, I found that the Wikimedia Foundation already decided last year to get some better analytics tools.[1] :) But remember that the Foundation has a privacy policy already. Also, they can do a few things if they so choose: they can limit who can see the data, and they can limit from whom they collect the data. ^ [1]. http://www.mediawiki.org/wiki/Analytics_upgrade Also, if they decide to make clickstream data available to certain people (say, bot developers), they can further sanitize it by removing all records of clicks on user pages and user talk pages. I just CC'ed all five members of the analytics upgrade team to this bug, and assigned this bug to Howie Fung. I hope both of those actions were OK. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Bawolff changed: What|Removed |Added CC||bawolff...@gmail.com --- Comment #1 from Bawolff 2011-02-08 03:14:40 UTC --- As a privacy nutjob (I think you can guess where my comment is going) -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 12742] Collect enwiki clickstream data (we could use it to automatically fix links to disambiguation pages and more)
https://bugzilla.wikimedia.org/show_bug.cgi?id=12742 Diederik van Liere changed: What|Removed |Added Keywords||analytics -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l