GoranSMilovanovic added a comment.
@Lydia_Pintscher From what we now have on https://wd-ref-island.toolforge.org/stats.php (thanks @ItamarWMDE): - three .csv files are delivered here (see below): - `datatype_ratio.csv` - per datatype statistics (aggregated), - `property_ratio.csv` - per property statistics (aggregated), and finally - `item_property_value_ratio.csv` - statistics for each `Item` x `Property` x `Extracted Value` combination. Columns: - `accepted` - number of users who have accepted the suggested value; - `rejected` - number of users who have rejected the suggested value; - `ratio` - the ratio of `accepted` to `rejected`; - `percent_accepted` - % of users who have accepted the suggested value; - `total_decisions` - total number of users who have assessed the suggested value (i.e. num.accepted + num.rejected observations); - `is_accepted` (only in `item_property_value_ratio.csv`): see T253552#6227594 <https://phabricator.wikimedia.org/T253552#6227594>, we decide to accept the suggested value, for a given property and a given item, if (1) there were at least 5 `total_decisions` made, and (2) `percent_accepted` is >= 95%. Notes: - only complete observations were analyzed (e.g. if a datatype was not parsed, the observation was discounted; it happens only once in these datasets); - still no data on any external identifiers; - all datasets are sorted in a decreasing order of the number of `total_decisions` made; - 1162 item x property x extracted value combinations were observed in `item_property_value_ratio.csv`; - only 26 of these combinations have received >= 5 decisions; - only 5 of these combinations satisfy both criteria (1, 2) to be accepted. Files: F31942426: item_property_value_ratio.csv <https://phabricator.wikimedia.org/F31942426> F31942425: property_ratio.csv <https://phabricator.wikimedia.org/F31942425> F31942424: datatype_ratio.csv <https://phabricator.wikimedia.org/F31942424> TASK DETAIL https://phabricator.wikimedia.org/T253552 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: Ladsgroup, darthmon_wmde, ItamarWMDE, Tarrow, Aklapper, Lydia_Pintscher, Ferdinand0101, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
