[Wikidata-bugs] [Maniphest] T261850: compare model accuracy with and without property suggester

2020-09-10 Thread Halfak
Halfak added a comment. It's not quite fair to compare the old an new feature sets. It does look like the property suggestor was having a minor positive effect, but that seems like it was not worth the additional API call. Everything that follows is just me nerding out about the stats

[Wikidata-bugs] [Maniphest] [Commented On] T257341: Add ORES article quality predictions to the WDQS

2020-07-07 Thread Halfak
Halfak added a comment. We already store article quality predictions in the `ores_classification` table on the wikis where we have support. We store some prediction in Elastic Search related to topic (see the "articletopic:foo" keyword). I'm not sure about how much Elas

[Wikidata-bugs] [Maniphest] [Created] T257341: Add ORES article quality predictions to the WDQS

2020-07-07 Thread Halfak
Halfak created this task. Halfak added projects: Scoring-platform-team, articlequality-modeling, Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. Restricted Application added projects: Wikidata, artificial-intelligence. TASK DESCRIPTION From @SandraF_WMF >

[Wikidata-bugs] [Maniphest] [Updated] T139573: Simple html formatting within Wikidata labels

2020-05-09 Thread Halfak
Halfak added a parent task: T252280: Improve Wikilabels UI. TASK DETAIL https://phabricator.wikimedia.org/T139573 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: Malore, Liuxinyu970226, Ash_Crow, daniel, VIGNERON, Moyogo, Bugreporter

[Wikidata-bugs] [Maniphest] [Unassigned] T141896: Impact of ORES on Wikidata: time-to-revert changes

2020-03-18 Thread Halfak
Halfak removed Halfak as the assignee of this task. TASK DETAIL https://phabricator.wikimedia.org/T141896 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: Halfak, DarTar, Aklapper, darthmon_wmde, Nandana, Lahi, Gq86, Xinbenlv, Vacio

[Wikidata-bugs] [Maniphest] [Commented On] T225944: Explore using ShEx to support ORES in Wikidata

2019-11-06 Thread Halfak
Halfak added a comment. @WMDE-leszek, has any progress been made on item #1. TASK DETAIL https://phabricator.wikimedia.org/T225944 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: WMDE-leszek, Lucas_Werkmeister_WMDE, Halfak, Aklapper

[Wikidata-bugs] [Maniphest] [Closed] T226120: ORES is too slow for ORC tool

2019-11-06 Thread Halfak
Halfak closed this task as "Invalid". Halfak added a comment. Thanks for the notes @YMS. (Sorry to not respond sooner. Lost track of this ticket.) We'll close for now. Make sure that, when you do try again, you use https://ores.wikimedia.org as your base URL. I'm not

[Wikidata-bugs] [Maniphest] [Commented On] T195702: track quality of all/top 10000 Wikidata items over time

2019-09-25 Thread Halfak
Halfak added a comment. @abian, ORES models directly a measure "completeness". However, it turns out that accuracy and consistency strongly correlate to these measures of "completeness" so it also a //good and useful// proxy measure of "consistency" and &quo

[Wikidata-bugs] [Maniphest] [Commented On] T195702: track quality of all/top 10000 Wikidata items over time

2019-09-04 Thread Halfak
Halfak added a comment. For clarity, making millions of calls to ORES is totally feasible. We have a utility for doing just this. @GoranSMilovanovic has been using the `ores score_revisions` utility. If you create a json file with a field called "rev_id" containing the most rec

[Wikidata-bugs] [Maniphest] [Triaged] T226120: ORES is too slow for ORC tool

2019-07-23 Thread Halfak
Halfak triaged this task as "Normal" priority. TASK DETAIL https://phabricator.wikimedia.org/T226120 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: YMS, Halfak, Groceryheist, Aklapper, darthmon_wmde, Nandana, Sario528, Lahi, G

[Wikidata-bugs] [Maniphest] [Commented On] T226120: ORES is too slow for ORC tool

2019-07-23 Thread Halfak
Halfak added a comment. Honestly, it is hard to believe that ORES is too slow. But it's important that I understand what the ORC tool *needs* before concluding that. @YMS, could you give us some insights? Anyone else use the ORC tool and knows what the issue might be is welcome too

[Wikidata-bugs] [Maniphest] [Commented On] T226120: ORES is too slow for ORC tool

2019-06-20 Thread Halfak
Halfak added a comment. @YMS, I just heard about this. I'd like to see how we might make ORES work better for the ORC tool. Can you let us know any more about what you'd like to use ORES for and how it isn't working for you? TASK DETAIL https://phabricator.wikimedia.org/T226120 EMAIL

[Wikidata-bugs] [Maniphest] [Created] T226120: ORES is too slow for ORC tool

2019-06-19 Thread Halfak
Halfak created this task. Halfak added projects: ORES, editquality-modeling, Wikidata. Restricted Application added a subscriber: Aklapper. Restricted Application added projects: Scoring-platform-team, artificial-intelligence. TASK DESCRIPTION @Groceryheist said that it was reported that ORES

[Wikidata-bugs] [Maniphest] [Updated] T225944: Explore using ShEx to support ORES in Wikidata

2019-06-18 Thread Halfak
Halfak edited projects, added Scoring-platform-team (Research); removed Scoring-platform-team. TASK DETAIL https://phabricator.wikimedia.org/T225944 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: WMDE-leszek, Lucas_Werkmeister_WMDE

[Wikidata-bugs] [Maniphest] [Commented On] T225944: Explore using ShEx to support ORES in Wikidata

2019-06-17 Thread Halfak
Halfak added a comment. My take-away is this: 1. There's no nice API that, given an item/edit, can tell us what constraints are violated. The current system processes the schemas in JavaScript. 2. There's no nice way to version the ShEx schemas. 3. It's still very early and people

[Wikidata-bugs] [Maniphest] [Updated] T225944: Explore using ShEx to support ORES in Wikidata

2019-06-17 Thread Halfak
Halfak added projects: ORES, Wikidata, Scoring-platform-team. TASK DETAIL https://phabricator.wikimedia.org/T225944 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: Halfak, Aklapper, darthmon_wmde, Nandana, Lahi, Gq86, Vacio

[Wikidata-bugs] [Maniphest] [Updated] T225944: Explore using ShEx to support ORES in Wikidata

2019-06-17 Thread Halfak
Halfak edited projects, added Scoring-platform-team (Research); removed Scoring-platform-team. Restricted Application edited projects, added Scoring-platform-team; removed Scoring-platform-team (Research). TASK DETAIL https://phabricator.wikimedia.org/T225944 EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T224672: ORES/ChangesListHooksHandlerTest causing build failures in other repos (e.g. UploadWizard)

2019-06-04 Thread Halfak
Halfak added a subscriber: Ladsgroup. Halfak added a comment. I'm honestly not sure. It's likely that @Ladsgroup knows some history here. TASK DETAIL https://phabricator.wikimedia.org/T224672 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Updated] T223590: Display Wikidata item quality on Wikipedia article

2019-05-28 Thread Halfak
Halfak added a parent task: T223845: Generate quality score average and add to WikidataQuality gadget. TASK DETAIL https://phabricator.wikimedia.org/T223590 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Arybolab, Halfak Cc: Halfak, Aklapper

[Wikidata-bugs] [Maniphest] [Updated] T127467: Finding items on Wikidata that should be merged

2019-04-10 Thread Halfak
Halfak edited projects, added Scoring-platform-team (Research); removed Scoring-platform-team. TASK DETAIL https://phabricator.wikimedia.org/T127467 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: Capankajsmilyo, mforns, Lahi

[Wikidata-bugs] [Maniphest] [Updated] T155560: Linked fact checker

2019-04-10 Thread Halfak
Halfak edited projects, added Scoring-platform-team (Research); removed Scoring-platform-team. TASK DETAIL https://phabricator.wikimedia.org/T155560 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Halfak Cc: Cirdan, Capankajsmilyo, PokestarFan

[Wikidata-bugs] [Maniphest] [Closed] T195703: quick overview of the quality of an item

2019-02-26 Thread Halfak
Halfak closed this task as "Resolved". Halfak edited projects, added Scoring-platform-team (Current); removed Scoring-platform-team. Halfak claimed this task. Halfak added a comment. Given the inactivity, we're going to mark this task as resolved. We'd be very happy to work

[Wikidata-bugs] [Maniphest] [Triaged] T194737: Improve features for wikibase vandalism detection model

2019-02-19 Thread Halfak
Halfak moved this task from Untriaged to New development on the Scoring-platform-team board.Halfak triaged this task as "Normal" priority. TASK DETAILhttps://phabricator.wikimedia.org/T194737WORKBOARDhttps://phabricator.wikimedia.org/project/board/1901/EMAIL PREFER

[Wikidata-bugs] [Maniphest] [Retitled] T158430: Use suggested properties to get signal for completeness

2019-01-17 Thread Halfak
Halfak renamed this task from "[Spike] Use suggested properties to get signal for completeness" to "Use suggested properties to get signal for completeness".Halfak edited projects, added Scoring-platform-team (Current); removed Scoring-platform-team. TASK DETAILhttps://phabr

[Wikidata-bugs] [Maniphest] [Commented On] T158430: [Spike] Use suggested properties to get signal for completeness

2018-11-13 Thread Halfak
Halfak added a comment. https://github.com/wikimedia/articlequality/blob/master/articlequality/feature_lists/wikidatawiki.pyTASK DETAILhttps://phabricator.wikimedia.org/T158430EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: hoo, PokestarFan, daniel

[Wikidata-bugs] [Maniphest] [Commented On] T158430: [Spike] Use suggested properties to get signal for completeness

2018-10-19 Thread Halfak
Halfak added a comment. I just implemented https://github.com/wikimedia/revscoring/pull/414 which will give us a datasource on top of which to build the feature @hoo has been working on.TASK DETAILhttps://phabricator.wikimedia.org/T158430EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings

[Wikidata-bugs] [Maniphest] [Commented On] T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps

2018-10-16 Thread Halfak
Halfak added a comment. For clarity, I was originally advocating that we didn't combine any hashes and that instead we provided a tag in each of the slots. I now see that we're going to make a mess in favor of backwards compatibility. So there will continue to be a tag and at the top

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps

2018-10-12 Thread Halfak
Halfak added a subscriber: mako.Halfak added a comment. Aha! I don't think anyone is generally comparing the content text to a specific checksum for any reason (except some old studies by @mako to check to see if the checksums were consistent historically (they aren't). So, I'm a fan

[Wikidata-bugs] [Maniphest] [Commented On] T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps

2018-10-10 Thread Halfak
Halfak added a comment. re. the "revision hash", it seems that this has already been determined so I'm not sure what other insights I might give. But FWIW, the combined rev_sha1 seems very crazy :) If there is combined rev_sha1 that is built by any strategy (crazy or not) from th

[Wikidata-bugs] [Maniphest] [Commented On] T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps

2018-10-09 Thread Halfak
Halfak added a comment. where do we put the revision hash? What is the "revision hash"?TASK DETAILhttps://phabricator.wikimedia.org/T199121EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: ArielGlenn, HalfakCc: FaFlo, Halfak, vrandezo, Denny

[Wikidata-bugs] [Maniphest] [Commented On] T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps

2018-10-09 Thread Halfak
Halfak added a comment. Oh. I guess my sense is to drop the existing in favor of the 's related to individual content slots. It doesn't make sense anymore, and we're breaking the schema anyway to add new content slots.TASK DETAILhttps://phabricator.wikimedia.org/T199121EMAIL PREFERENCEShttps

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T199121: RFC: Spec for representing multiple content objects per revision (MCR) in XML dumps

2018-10-09 Thread Halfak
Halfak added a subscriber: FaFlo.Halfak added a comment. Seems to me that each slot should have it's own sha1. There's a huge amount of research of non-article content in Wikipedia. I imagine that analysts will be interested in identity reverts (the most common type that are detectable with sha1

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T164994: Enable wbgetsuggestions API to get recommended properties even if they have existed in an item

2018-09-24 Thread Halfak
Halfak added a subscriber: Ladsgroup.Halfak added a comment. @Ladsgroup, could you take a look?TASK DETAILhttps://phabricator.wikimedia.org/T164994EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: hoo, HalfakCc: Ladsgroup, gerritbot, PokestarFan, Sjoerddebruin

[Wikidata-bugs] [Maniphest] [Commented On] T164994: Enable wbgetsuggestions API to get recommended properties even if they have existed in an item

2018-09-11 Thread Halfak
Halfak added a comment. @hoo, any updates? Seems like this task has been stagnant for a few weeks.TASK DETAILhttps://phabricator.wikimedia.org/T164994EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: hoo, HalfakCc: gerritbot, PokestarFan, Sjoerddebruin, daniel

[Wikidata-bugs] [Maniphest] [Commented On] T189962: Run analysis of revert time and number changes over time for wikidata

2018-08-01 Thread Halfak
Halfak added a comment. Oh! And to the point of reviewing this specific task, please limit your aggregate analysis to 12 months. This will help account for seasonality. E.g., December/January and September look weird and can appear twice in a 17 month sample.TASK DETAILhttps

[Wikidata-bugs] [Maniphest] [Updated] T189962: Run analysis of revert time and number changes over time for wikidata

2018-08-01 Thread Halfak
Halfak added a comment. OK I made an epic to cover other work we should do before we speak publicly about what we found. See T200898: Analyze the effects of ORES deployments on counter-vandalism behavior I think this will make for a great follow-up paper to When the Levee Breaks. But more

[Wikidata-bugs] [Maniphest] [Updated] T189962: Run analysis of revert time and number changes over time for wikidata

2018-08-01 Thread Halfak
Halfak added a parent task: T200898: Analyze the effects of ORES deployments on counter-vandalism behavior. TASK DETAILhttps://phabricator.wikimedia.org/T189962EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak, Aklapper, Ladsgroup

[Wikidata-bugs] [Maniphest] [Updated] T166427: Measure maturity/quality of Wikidata items in a way it can be queried

2018-07-30 Thread Halfak
Halfak added a comment. Great! I created T200716 to track that work. :)TASK DETAILhttps://phabricator.wikimedia.org/T166427EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Lydia_Pintscher, gerritbot, Ladsgroup, Esc3300, PokestarFan

[Wikidata-bugs] [Maniphest] [Created] T200716: Include ORES predictions in RDF export

2018-07-30 Thread Halfak
Halfak created this task.Halfak triaged this task as "Normal" priority.Halfak added projects: Wikidata, ORES, Wikidata-Query-Service, Discovery, Scoring-platform-team (Current), User-Ladsgroup, Patch-For-Review.Restricted Application removed a project: Patch-For-Review. TASK DES

[Wikidata-bugs] [Maniphest] [Commented On] T189962: Run analysis of revert time and number changes over time for wikidata

2018-07-30 Thread Halfak
Halfak added a comment. Make a report! Start from https://meta.wikimedia.org/wiki/Research:New_project :)TASK DETAILhttps://phabricator.wikimedia.org/T189962EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak, Aklapper, Ladsgroup

[Wikidata-bugs] [Maniphest] [Commented On] T195703: quick overview of the quality of an item

2018-07-30 Thread Halfak
Halfak added a comment. F24333890: Screenshot from 2018-07-30 10-31-33.png Note the prediction right underneath the title.TASK DETAILhttps://phabricator.wikimedia.org/T195703EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Jan_Dittrich, Aklapper

[Wikidata-bugs] [Maniphest] [Commented On] T166427: Measure maturity/quality of Wikidata items in a way it can be queried

2018-07-30 Thread Halfak
Halfak added a comment. It looks like the item quality model isn't loaded yet. See https://quarry.wmflabs.org/query/28639 Are there any data storage/scaling concerns with having a quality prediction for every single item in Wikidata? It looks like that would be about 51 million rows. Next Q

[Wikidata-bugs] [Maniphest] [Commented On] T164994: Enable wbgetsuggestions API to get recommended properties even if they have existed in an item

2018-07-30 Thread Halfak
Halfak added a comment. @hoo might take a look at this. Note that the discussion above covers quite a bit of the details necessary for implementing this :)TASK DETAILhttps://phabricator.wikimedia.org/T164994EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T189962: Run analysis of revert time and number changes over time for wikidata

2018-07-16 Thread Halfak
Halfak added a comment. This is great. Please graph the results, write a report, and give a description of the weirdness in Spanish's dump files.TASK DETAILhttps://phabricator.wikimedia.org/T189962EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-07-10 Thread Halfak
Halfak added a comment. Oohhh. Hmm. Yeah. I wonder if we can adjust for block reason. Or maybe let *is bot/was bot* take precedence.TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-07-09 Thread Halfak
Halfak added a comment. Merged. Ready for loading into Wiki labels.TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Aklapper, Halfak, Ladsgroup, matej_suchanek, Lydia_Pintscher, Lahi

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-06-28 Thread Halfak
Halfak added a comment. I left some notes on the PR. I think it is more complicated than necessary.TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak, Ladsgroup, matej_suchanek

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-06-25 Thread Halfak
Halfak added a comment. I think that should be the plan then. Query for a random sample of 500k. Then select *needs_review* from that set.TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-06-21 Thread Halfak
Halfak added a comment. How big of a sample do you think we would need in order to get enough "needs_review" samples?TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak,

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-06-21 Thread Halfak
Halfak added a comment. We don't actually count all edits by people with 1000+ edits as good. We'll check to see if the edit was reverted and if they are, they are included in the needs_review dataset.TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps

[Wikidata-bugs] [Maniphest] [Commented On] T195701: new ORES labeling campaign for Wikidata

2018-06-20 Thread Halfak
Halfak added a comment. What's the purpose of the editcount restriction?TASK DETAILhttps://phabricator.wikimedia.org/T195701EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak, Ladsgroup, matej_suchanek, Aklapper, Lydia_Pintscher, Lahi

[Wikidata-bugs] [Maniphest] [Commented On] T195703: quick overview of the quality of an item

2018-06-20 Thread Halfak
Halfak added a comment. @Lydia_Pintscher do you want some design work on this before we resolve or should we consider this done for now?TASK DETAILhttps://phabricator.wikimedia.org/T195703EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Ladsgroup

[Wikidata-bugs] [Maniphest] [Updated] T184322: Enable fine grained lua tracking gradually in client wikis

2018-06-15 Thread Halfak
Halfak removed a project: Security.Halfak changed the visibility from "Custom Policy" to "Public (No Login Required)". TASK DETAILhttps://phabricator.wikimedia.org/T184322EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Akl

[Wikidata-bugs] [Maniphest] [Commented On] T195703: quick overview of the quality of an item

2018-05-29 Thread Halfak
Halfak added a comment. https://wikidata.org/wiki/User:EpochFail/ItemQuality.js Enable via so that it only runs on Items: https://www.wikidata.org/wiki/User:EpochFail/ItemQuality-loader.jsTASK DETAILhttps://phabricator.wikimedia.org/T195703EMAIL PREFERENCEShttps://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] [Assigned] T194653: Ban clients of WDQS which don't follow throttling directives for some time

2018-05-18 Thread Halfak
Halfak assigned this task to Gehel. TASK DETAILhttps://phabricator.wikimedia.org/T194653EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Gehel, HalfakCc: gerritbot, Cirdan, Smalyshev, Gehel, Aklapper, Versusxo, Majesticalreaper22, Giuliamocci, Davinaclare77

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T194748: Research: How are bot edits different than human edits on Wikidata?

2018-05-16 Thread Halfak
Halfak added a subscriber: Hall1467.Halfak added a comment. @Hall1467 and I have been working on measuring how humans and bots drive the quality of Wikidata items differently. We're specifically looking at alignment between demand (views of articles that sitelinked from Wikidata) and production

[Wikidata-bugs] [Maniphest] [Commented On] T189962: Run analysis of revert time and number changes over time for wikidata

2018-04-09 Thread Halfak
Halfak added a comment. We discussed getting the plots cleaned up and adding English/Spanish Wikipedia at our sync meeting.TASK DETAILhttps://phabricator.wikimedia.org/T189962EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak

[Wikidata-bugs] [Maniphest] [Commented On] T189962: Run analysis of revert time and number changes over time for wikidata

2018-03-28 Thread Halfak
Halfak added a comment. I think we need a plot of this data. I'd also suggested using the geometric mean for looking at time-to-revert. E.g. geometric_mean = function(x){ exp(mean(log(x))) } In python, I'd do: >>> from statistics import mean >>> from math import log, exp >

[Wikidata-bugs] [Maniphest] [Unblock] T76230: [Epic] data quality and trust

2018-02-23 Thread Halfak
Halfak closed subtask T127470: Deploy item quality classification model for Wikidata as "Resolved". TASK DETAILhttps://phabricator.wikimedia.org/T76230EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Herzi.Pinki, SandraF_WMF, Tbayer, Aschr

[Wikidata-bugs] [Maniphest] [Closed] T127470: Deploy item quality classification model for Wikidata

2018-02-23 Thread Halfak
Halfak edited projects, added Scoring-platform-team (Current); removed Scoring-platform-team.Halfak closed this task as "Resolved". TASK DETAILhttps://phabricator.wikimedia.org/T127470EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Glorian_WD

[Wikidata-bugs] [Maniphest] [Unblock] T90870: selfcontained projects around Wikidata (tracking)

2018-02-23 Thread Halfak
Halfak closed subtask T127470: Deploy item quality classification model for Wikidata as "Resolved". TASK DETAILhttps://phabricator.wikimedia.org/T90870EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: MichaelSchoenitzer_WMDE,

[Wikidata-bugs] [Maniphest] [Commented On] T180450: ORES thresholds for Wikidata is too strict

2017-11-25 Thread Halfak
Halfak added a comment. Merged! Please make an update to our deploy config.TASK DETAILhttps://phabricator.wikimedia.org/T180450EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Ladsgroup, HalfakCc: Halfak, Jklamo, Sjoerddebruin, Stashbot, gerritbot, Aklapper

[Wikidata-bugs] [Maniphest] [Commented On] T176312: Don’t check format constraint via SPARQL (safely evaluating user-provided regular expressions)

2017-10-03 Thread Halfak
Halfak added a comment. I'm not sure we'd make use of an external service. In our case, I think a more robust timeout and some testing gets us what we need.TASK DETAILhttps://phabricator.wikimedia.org/T176312EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T151717: Usage tracking: record which statement group is used

2017-09-11 Thread Halfak
Halfak added a comment. I think the idea is that we'll be able to include wbc_entity_usage to increase granularity in watchlists once this is solved for. It will require some new work though :) Happy to see @Bawolff excited about this functionality.TASK DETAILhttps://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] [Updated] T127467: Finding items that should be merged

2017-09-06 Thread Halfak
Halfak edited projects, added Scoring-platform-team-Backlog; removed Scoring-platform-team-Backlog (Current). TASK DETAILhttps://phabricator.wikimedia.org/T127467EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Esc3300, Liuxinyu970226, matej_suchanek

[Wikidata-bugs] [Maniphest] [Updated] T158025: Support ORES for propagated Wikidata edits

2017-09-06 Thread Halfak
Halfak edited projects, added Scoring-platform-team-Backlog; removed Scoring-platform-team-Backlog (Current). TASK DETAILhttps://phabricator.wikimedia.org/T158025EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Catrope, jmatazzoni, Aklapper

[Wikidata-bugs] [Maniphest] [Updated] T166427: Measure maturity/quality of Wikidata items in a way it can be queried

2017-09-06 Thread Halfak
Halfak edited projects, added Scoring-platform-team-Backlog; removed Scoring-platform-team-Backlog (Current). TASK DETAILhttps://phabricator.wikimedia.org/T166427EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Esc3300, PokestarFan, Smalyshev, Halfak

[Wikidata-bugs] [Maniphest] [Updated] T171953: ORES and Enhanced Recent Changes: Backgound color stucks after patrolling

2017-09-06 Thread Halfak
Halfak edited projects, added Scoring-platform-team-Backlog; removed Scoring-platform-team-Backlog (Current). TASK DETAILhttps://phabricator.wikimedia.org/T171953EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan, Raymond

[Wikidata-bugs] [Maniphest] [Updated] T171953: ORES and Enhanced Recent Changes: Backgound color stucks after patrolling

2017-08-16 Thread Halfak
Halfak added a project: Collaboration-Team-Triage. TASK DETAILhttps://phabricator.wikimedia.org/T171953EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan, Raymond, bkowshik, GoranSMilovanovic, QZanden, Avner, Mkdw, Izno, Wikidata

[Wikidata-bugs] [Maniphest] [Updated] T171953: ORES and Enhanced Recent Changes: Backgound color stucks after patrolling

2017-08-16 Thread Halfak
Halfak edited projects, added Scoring-platform-team; removed Scoring-platform-team-Backlog. TASK DETAILhttps://phabricator.wikimedia.org/T171953EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan, Raymond, bkowshik

[Wikidata-bugs] [Maniphest] [Updated] T151717: Usage tracking: record which statement group is used

2017-08-11 Thread Halfak
Halfak added a subtask: T172905: Lua: Make it possible to load only specific Statements using mw.wikibase. TASK DETAILhttps://phabricator.wikimedia.org/T151717EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Ottomata, PokestarFan, Ladsgroup, Stashbot

[Wikidata-bugs] [Maniphest] [Updated] T172905: Lua: Make it possible to load only specific Statements using mw.wikibase

2017-08-11 Thread Halfak
Halfak added a parent task: T151717: Usage tracking: record which statement group is used. TASK DETAILhttps://phabricator.wikimedia.org/T172905EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: eranroz, HalfakCc: gerritbot, Aklapper, daniel, Lydia_Pintscher

[Wikidata-bugs] [Maniphest] [Updated] T151717: Usage tracking: record which statement group is used

2017-08-11 Thread Halfak
Halfak added a subtask: T172914: mw.wikibase.entity: Use __index to lazy register entity usages. TASK DETAILhttps://phabricator.wikimedia.org/T151717EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Ottomata, PokestarFan, Ladsgroup, Stashbot

[Wikidata-bugs] [Maniphest] [Updated] T172914: mw.wikibase.entity: Use __index to lazy register entity usages

2017-08-11 Thread Halfak
Halfak added a parent task: T151717: Usage tracking: record which statement group is used. TASK DETAILhttps://phabricator.wikimedia.org/T172914EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, daniel, Lydia_Pintscher, aude, Liuxinyu970226

[Wikidata-bugs] [Maniphest] [Updated] T173032: Replace all current "X" usage with nuanced usage information (label/description/sitelink) via Lua table

2017-08-11 Thread Halfak
Halfak added a parent task: T173124: Improve wikidata usage tracking via Lua . TASK DETAILhttps://phabricator.wikimedia.org/T173032EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: hoo, Doc_James, PokestarFan, eranroz, Izno, Oliv0, Ltrlg, Scott_WUaS

[Wikidata-bugs] [Maniphest] [Updated] T173124: Improve wikidata usage tracking via Lua

2017-08-11 Thread Halfak
Halfak added a subtask: T151717: Usage tracking: record which statement group is used. TASK DETAILhttps://phabricator.wikimedia.org/T173124EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, Halfak, PokestarFan, GoranSMilovanovic, QZanden

[Wikidata-bugs] [Maniphest] [Updated] T173124: Improve wikidata usage tracking via Lua

2017-08-11 Thread Halfak
Halfak added a subtask: T173032: Replace all current "X" usage with nuanced usage information (label/description/sitelink) via Lua table. TASK DETAILhttps://phabricator.wikimedia.org/T173124EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc

[Wikidata-bugs] [Maniphest] [Updated] T151717: Usage tracking: record which statement group is used

2017-08-11 Thread Halfak
Halfak added a parent task: T173124: Improve wikidata usage tracking via Lua . TASK DETAILhttps://phabricator.wikimedia.org/T151717EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Ottomata, PokestarFan, Ladsgroup, Stashbot, gerritbot, Halfak, jcrespo

[Wikidata-bugs] [Maniphest] [Created] T173124: Improve wikidata usage tracking via Lua

2017-08-11 Thread Halfak
Halfak created this task.Halfak added projects: Wikidata, MediaWiki-extensions-WikibaseClient.Herald added subscribers: PokestarFan, Aklapper. TASK DESCRIPTIONCurrently accessing Wikidata information via Lua results in writing a big confusing "X" in the wbc_entity_usage table. We shou

[Wikidata-bugs] [Maniphest] [Updated] T173123: Implement nuanced usage tracking for Wikidata

2017-08-11 Thread Halfak
Halfak added a subtask: T138409: [Task] Terms usage tracking: Only track parts of the language fallback chain that were actually used. TASK DETAILhttps://phabricator.wikimedia.org/T173123EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper

[Wikidata-bugs] [Maniphest] [Updated] T138409: [Task] Terms usage tracking: Only track parts of the language fallback chain that were actually used

2017-08-11 Thread Halfak
Halfak added a parent task: T173123: Implement nuanced usage tracking for Wikidata. TASK DETAILhttps://phabricator.wikimedia.org/T138409EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: hoo, HalfakCc: gerritbot, PokestarFan, Aklapper, daniel, aude

[Wikidata-bugs] [Maniphest] [Updated] T110528: [Task] Use JOIN to find changes relevant for a given wiki

2017-08-11 Thread Halfak
Halfak added a subtask: T173123: Implement nuanced usage tracking for Wikidata. TASK DETAILhttps://phabricator.wikimedia.org/T110528EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Ladsgroup, PokestarFan, Lydia_Pintscher, hoo, Ricordisamoa

[Wikidata-bugs] [Maniphest] [Updated] T90436: Improve usage tracking granularity to avoid irrelevant changes showing in the watchlist.

2017-08-11 Thread Halfak
Halfak added a subtask: T173123: Implement nuanced usage tracking for Wikidata. TASK DETAILhttps://phabricator.wikimedia.org/T90436EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Doc_James, hoo, PokestarFan, IKhitron, eranroz, Izno, Oliv0, Ltrlg

[Wikidata-bugs] [Maniphest] [Updated] T173123: Implement nuanced usage tracking for Wikidata

2017-08-11 Thread Halfak
Halfak added parent tasks: T90436: Improve usage tracking granularity to avoid irrelevant changes showing in the watchlist., T173121: Rank wikidata changes for patrolling by usage, T110528: [Task] Use JOIN to find changes relevant for a given wiki. TASK DETAILhttps://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] [Updated] T173121: Rank wikidata changes for patrolling by usage

2017-08-11 Thread Halfak
Halfak added a subtask: T173123: Implement nuanced usage tracking for Wikidata. TASK DETAILhttps://phabricator.wikimedia.org/T173121EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: hoo, eranroz, PokestarFan, Aklapper, Halfak, GoranSMilovanovic

[Wikidata-bugs] [Maniphest] [Created] T173123: Implement nuanced usage tracking for Wikidata

2017-08-11 Thread Halfak
Halfak created this task.Halfak added projects: Wikidata, MediaWiki-extensions-WikibaseClient.Herald added subscribers: PokestarFan, Aklapper. TASK DESCRIPTIONCurrently Wikidata usage is very coarse and it is not tracked effectively when Lua templates are used. This task is a container

[Wikidata-bugs] [Maniphest] [Updated] T173121: Rank wikidata changes for patrolling by usage

2017-08-11 Thread Halfak
Halfak added a project: MediaWiki-extensions-WikibaseClient. TASK DETAILhttps://phabricator.wikimedia.org/T173121EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: hoo, eranroz, PokestarFan, Aklapper, Halfak, GoranSMilovanovic, QZanden, Izno, Wikidata

[Wikidata-bugs] [Maniphest] [Updated] T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage

2017-08-11 Thread Halfak
Halfak added a project: MediaWiki-extensions-WikibaseClient. TASK DETAILhttps://phabricator.wikimedia.org/T173119EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan, eranroz, hoo, Halfak, GoranSMilovanovic, QZanden, Izno, Wikidata

[Wikidata-bugs] [Maniphest] [Edited] T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage

2017-08-11 Thread Halfak
Halfak updated the task description. (Show Details) CHANGES TO TASK DESCRIPTIONOnce we have (LINKME: more nuanced usage tracking), we should be able to use it to direct patrollers' efforts more effectively. E.g. we can make sure that only changes that affect the rendering of a page show up

[Wikidata-bugs] [Maniphest] [Created] T173121: Rank wikidata changes for patrolling by usage

2017-08-11 Thread Halfak
Halfak created this task.Halfak added projects: Wikidata, MediaWiki-Watchlist. TASK DESCRIPTIONPatrollers on client wikis will likely want to prioritize patrolling of changes to Wikidata entities that are most relevant to their client wiki (e.g. used a lot). Similarly, Wikidata patrollers

[Wikidata-bugs] [Maniphest] [Block] T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage

2017-08-11 Thread Halfak
Halfak created subtask T173121: Rank wikidata changes for patrolling by usage. TASK DETAILhttps://phabricator.wikimedia.org/T173119EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan, eranroz, hoo, Halfak, GoranSMilovanovic

[Wikidata-bugs] [Maniphest] [Updated] T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage

2017-08-11 Thread Halfak
Halfak added a subtask: T90436: Improve usage tracking granularity to avoid irrelevant changes showing in the watchlist.. TASK DETAILhttps://phabricator.wikimedia.org/T173119EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan

[Wikidata-bugs] [Maniphest] [Updated] T90436: Improve usage tracking granularity to avoid irrelevant changes showing in the watchlist.

2017-08-11 Thread Halfak
Halfak added a parent task: T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage. TASK DETAILhttps://phabricator.wikimedia.org/T90436EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Doc_James, hoo, PokestarFan

[Wikidata-bugs] [Maniphest] [Updated] T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage

2017-08-11 Thread Halfak
Halfak added a parent task: T108944: [Epic] Improve change dispatching. TASK DETAILhttps://phabricator.wikimedia.org/T173119EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: Aklapper, PokestarFan, eranroz, hoo, Halfak, GoranSMilovanovic, QZanden, Izno

[Wikidata-bugs] [Maniphest] [Updated] T108944: [Epic] Improve change dispatching

2017-08-11 Thread Halfak
Halfak added a subtask: T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage. TASK DETAILhttps://phabricator.wikimedia.org/T108944EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc: PokestarFan, Sjoerddebruin, hoo

[Wikidata-bugs] [Maniphest] [Created] T173119: Implement more nuanced patrolling relevant changes in Wikidata based on usage

2017-08-11 Thread Halfak
Halfak created this task.Halfak added projects: Wikidata, MediaWiki-Watchlist.Herald added subscribers: PokestarFan, Aklapper. TASK DESCRIPTIONOnce we have (LINKME: more nuanced usage tracking), we should be able to use it to direct patrollers' efforts more effectively. E.g. we can make sure

[Wikidata-bugs] [Maniphest] [Commented On] T151717: Usage tracking: record which statement group is used

2017-08-11 Thread Halfak
Halfak added a comment. One more thing. Part of our test deployment to elwiki as described in this task is to help get a sense for the rate that these usage rows will be written to MariaDB, so we're likely to have a better answer for you soon.TASK DETAILhttps://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] [Commented On] T151717: Usage tracking: record which statement group is used

2017-08-11 Thread Halfak
Halfak added a comment. Hard to say. N for each page rendering where N is the number of wikidata "usages" on the page. A lua template or parser function can use labels, descriptions, statements, sitelinks, or ... other... ??? TASK DETAILhttps://phabricator.wikimedia.org/T1

[Wikidata-bugs] [Maniphest] [Block] T172914: mw.wikibase.entity: Use __index to lazy register entity usages

2017-08-10 Thread Halfak
Halfak created subtask T173032: Replace all current "X" usage with nuanced usage information (label/description/sitelink) via Lua table. TASK DETAILhttps://phabricator.wikimedia.org/T172914EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: HalfakCc

[Wikidata-bugs] [Maniphest] [Created] T173032: Replace all current "X" usage with nuanced usage information (label/description/sitelink) via Lua table

2017-08-10 Thread Halfak
Halfak created this task.Halfak added projects: MediaWiki-extensions-WikibaseClient, Wikidata. TASK DESCRIPTIONWe're already working on tracking statement usage within Lua tables (T151717). Let's extend that to all the other fun things you can find in a Wikidata entity. E.g. labels, descriptions

  1   2   3   >