GoranSMilovanovic added a comment.
Here is a new version of the report with the Grading Scheme <https://www.wikidata.org/wiki/Wikidata:Item_quality#Grading_scheme> for Wikidata items included: F30434504: Wikidata Quality Report.nb.html <https://phabricator.wikimedia.org/F30434504> @abian Thanks! > what dimensions of data quality (completeness, accuracy, consistency...) are you guys considering when you speak of "quality" in this scope? The Grading Scheme <https://www.wikidata.org/wiki/Wikidata:Item_quality#Grading_scheme> explains the criteria used. > The term "quality" is a buzzword used by people to name things that sometimes have no relationship to each other I agree up to .7 with you (I am a Bayesian, so consider `.7` to be a subjective measure of degree of belief) > so I'm not sure what it means here in practical terms, In practical terms, and in the scope of this Report, it signifies exactly what the Grading Scheme <https://www.wikidata.org/wiki/Wikidata:Item_quality#Grading_scheme> defines as item quality. I know that this answer provides a pure, maybe not so useful operational definition <https://en.wikipedia.org/wiki/Operational_definition> (if not even a pure ostensive definition <https://en.wikipedia.org/wiki/Ostensive_definition>). However, faced with the fact that data quality is a concept of immense complexity, a subject of immense discussions, and on the other hand faced with a need to start reporting on Wikidata item quality, this is the best answer that we can provide right now. I image that the quality assessment system will be re-designed one day following tons of philosophical, methodological, and (hopefully) practical discussions. Until then, this Report is what we have. > I don't know what factors are included in the equation (and which are excluded and should be measured separately) This is a question for the ORES team: I am sure that @Halfak can provide additional information in that respect. I have only a conceptual understanding of ORES (i.e. what ML approach does it take), but the details of its feature engineering (and your question @abian seems to be pointing right there) are beyond my knowledge. @abian In WikidataCon 2019 we will have a Data quality panel <https://www.wikidata.org/wiki/Wikidata:WikidataCon_2019/Program/Sessions/Data_quality_panel>, as well as a Data quality meetup <https://www.wikidata.org/wiki/Wikidata:WikidataCon_2019/Program/Sessions/Data_quality_meetup>. I also hope to learn more about the possible ways of Wikidata quality assessment there. See you in Berlin this October maybe? TASK DETAIL https://phabricator.wikimedia.org/T195702 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: abian, WMDE-leszek, darthmon_wmde, Ladsgroup, elal, Halfak, RazShuty, hoo, Aklapper, Esc3300, Lydia_Pintscher, DannyS712, Nandana, Lahi, Gq86, Xinbenlv, Vacio, GoranSMilovanovic, Fz-29, QZanden, LawExplorer, _jensen, rosalieper, Mkdw, notconfusing, srodlund, Wikidata-bugs, aude, Alchimista, Mbch331, Rxy
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs