agray added a comment.

  @GoranSMilovanovic - I can confirm that the numbers on the tables seem a bit 
off for some other properties. I've been looking at P1614 
<https://phabricator.wikimedia.org/P1614> (History of Parliament), which is 
complete and fairly stable. It currenty has 21428 IDs on 17942 items (there's a 
lot of items with two/three IDs).
  
  The totals in the "usage data" column are pretty good. The dashboard has 
17950, which is probably correct (I did some duplicate cleanup last month, so 
I'd expect the numbers to be a little different).  But the "overlap data" 
column has the same sort of problems @Envlh and @VIGNERON report.
  
  For VIAF (P214) the dashboard reports 440 items, against a SPARQL total of 
2807. For Hansard ID (P2015 <https://phabricator.wikimedia.org/P2015>), the 
dashboard has 110 and a SPARQL query has 2369. Most dramatically, for the 
Oxford DNB (P1415 <https://phabricator.wikimedia.org/P1415>), the dashboard has 
two items and SPARQL has 3171. Both Hansard and Oxford IDs should be reasonably 
constant - there hasn't been any substantical activity around these identifiers 
for at least a year - so it shouldn't be linked to the dump timings.
  
  Looking at P1415 <https://phabricator.wikimedia.org/P1415> specifically, 
since it's the weirdest one there, the "overlap data" for that property is even 
lower - the most frequent item is VIAF, but only 61 matches. In reality, this 
should be ~40,000 matches out of ~61,000 items. Perhaps some specific 
properties have worse data than others, for some reason?

TASK DETAIL
  https://phabricator.wikimedia.org/T204440

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic, agray
Cc: agray, Envlh, Lea_Lacroix_WMDE, VIGNERON, Pintoch, Daniel_Mietchen, 
connorshea, Moebeus, Multichill, Hjfocs, RazShuty, GoranSMilovanovic, Aklapper, 
Lydia_Pintscher, alaa_wmde, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, 
rosalieper, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to