agray added a comment.
@GoranSMilovanovic thanks! Looking back at my tests for P.1614, these are the new numbers in the overlap data column (tables view, left hand column). They're a lot higher, but I think they're still incomplete. - P.1614/P.214 overlap - reported as 1654, should be ~2807 - P.1614/P.2015 overlap - reported as 1194, should be ~2369 - P.1614/P.1415 overlap - reported as 1650, should be ~3171 (SPARQL for all three <https://w.wiki/33B>) Checking some random other pairs: - P.1802/P.213 overlap - reported as 3035, should be ~5707 (SPARQL <https://w.wiki/35c>) - P.2042/P.1816 overlap - reported as 532, should be ~1113 (SPARQL <https://w.wiki/35d>) - P.2040/P.5037 overlap - reported as 8393, should be ~11163 (SPARQL <https://w.wiki/35f>) - P.402/P.1566 overlap - reported as 31675, should be ~62623 (SPARQL <https://w.wiki/35g>) The SPARQL for all of these *should* be ignoring multiple instances and only counting each item once, so I think this is still a real undercount. It's interesting that they're mostly around the same range (50-60%, one outlier at 75%). TASK DETAIL https://phabricator.wikimedia.org/T204440 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic, agray Cc: Jheald, agray, Envlh, Lea_Lacroix_WMDE, VIGNERON, Pintoch, Daniel_Mietchen, connorshea, Moebeus, Multichill, Hjfocs, RazShuty, GoranSMilovanovic, Aklapper, Lydia_Pintscher, alaa_wmde, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
