GoranSMilovanovic added a comment.

  Here is a new version of the report with the Grading Scheme 
<https://www.wikidata.org/wiki/Wikidata:Item_quality#Grading_scheme> for 
Wikidata items included:
  
  F30434504: Wikidata Quality Report.nb.html 
<https://phabricator.wikimedia.org/F30434504>
  
  @abian Thanks!
  
  > what dimensions of data quality (completeness, accuracy, consistency...) 
are you guys considering when you speak of "quality" in this scope?
  
  The Grading Scheme 
<https://www.wikidata.org/wiki/Wikidata:Item_quality#Grading_scheme> explains 
the criteria used.
  
  > The term "quality" is a buzzword used by people to name things that 
sometimes have no relationship to each other
  
  I agree up to .7 with you (I am a Bayesian, so consider `.7` to be a 
subjective measure of degree of belief)
  
  > so I'm not sure what it means here in practical terms,
  
  In practical terms, and in the scope of this Report, it signifies exactly 
what the Grading Scheme 
<https://www.wikidata.org/wiki/Wikidata:Item_quality#Grading_scheme> defines as 
item quality. I know that this answer provides a pure, maybe not so useful 
operational definition <https://en.wikipedia.org/wiki/Operational_definition> 
(if not even a pure ostensive definition 
<https://en.wikipedia.org/wiki/Ostensive_definition>). However, faced with the 
fact that data quality is a concept of immense complexity, a subject of immense 
discussions, and on the other hand faced with a need to start reporting on 
Wikidata item quality, this is the best answer that we can provide right now.
  
  I image that the quality assessment system will be re-designed one day 
following tons of philosophical, methodological, and (hopefully) practical 
discussions. Until then, this Report is what we have.
  
  > I don't know what factors are included in the equation (and which are 
excluded and should be measured separately)
  
  This is a question for the ORES team: I am sure that @Halfak can provide 
additional information in that respect. I have only a conceptual understanding 
of ORES (i.e. what ML approach does it take), but the details of its feature 
engineering (and your question @abian seems to be pointing right there) are 
beyond my knowledge.
  
  @abian In WikidataCon 2019 we will have a Data quality panel 
<https://www.wikidata.org/wiki/Wikidata:WikidataCon_2019/Program/Sessions/Data_quality_panel>,
 as well as a Data quality meetup 
<https://www.wikidata.org/wiki/Wikidata:WikidataCon_2019/Program/Sessions/Data_quality_meetup>.
 I also hope to learn more about the possible ways of Wikidata quality 
assessment there. See you in Berlin this October maybe?

TASK DETAIL
  https://phabricator.wikimedia.org/T195702

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: abian, WMDE-leszek, darthmon_wmde, Ladsgroup, elal, Halfak, RazShuty, hoo, 
Aklapper, Esc3300, Lydia_Pintscher, DannyS712, Nandana, Lahi, Gq86, Xinbenlv, 
Vacio, GoranSMilovanovic, Fz-29, QZanden, LawExplorer, _jensen, rosalieper, 
Mkdw, notconfusing, srodlund, Wikidata-bugs, aude, Alchimista, Mbch331, Rxy
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to