GoranSMilovanovic added a comment.

  @Lydia_Pintscher
  
  From what we now have on https://wd-ref-island.toolforge.org/stats.php 
(thanks @ItamarWMDE):
  
  - three .csv files are delivered here (see below):
  - `datatype_ratio.csv` - per datatype statistics (aggregated),
  - `property_ratio.csv`  - per property statistics (aggregated), and finally
  - `item_property_value_ratio.csv` - statistics for each `Item` x `Property` x 
`Extracted Value` combination.
  
  Columns:
  
  - `accepted` - number of users who have accepted the suggested value;
  - `rejected` - number of users who have rejected the suggested value;
  - `ratio` - the ratio of `accepted` to `rejected`;
  - `percent_accepted` - % of users who have accepted the suggested value;
  - `total_decisions` - total number of users who have assessed the suggested 
value (i.e. num.accepted + num.rejected observations);
  - `is_accepted` (only in `item_property_value_ratio.csv`): see 
T253552#6227594 <https://phabricator.wikimedia.org/T253552#6227594>, we decide 
to accept the suggested value, for a given property and a given item, if (1) 
there were at least 5 `total_decisions` made, and (2) `percent_accepted` is >= 
95%.
  
  Notes:
  
  - only complete observations were analyzed (e.g. if a datatype was not 
parsed, the observation was discounted; it happens only once in these datasets);
  - still no data on any external identifiers;
  - all datasets are sorted in a decreasing order of the number of 
`total_decisions` made;
  - 1162 item x property x extracted value combinations were observed in 
`item_property_value_ratio.csv`;
  - only 26 of these combinations have received >= 5 decisions;
  - only 5 of these combinations satisfy both criteria (1, 2) to be accepted.
  
  Files:
  
  F31942426: item_property_value_ratio.csv 
<https://phabricator.wikimedia.org/F31942426>
  
  F31942425: property_ratio.csv <https://phabricator.wikimedia.org/F31942425>
  
  F31942424: datatype_ratio.csv <https://phabricator.wikimedia.org/F31942424>

TASK DETAIL
  https://phabricator.wikimedia.org/T253552

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: Ladsgroup, darthmon_wmde, ItamarWMDE, Tarrow, Aklapper, Lydia_Pintscher, 
Ferdinand0101, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to