[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-07-10 Thread Lydia_Pintscher
Lydia_Pintscher removed a parent task: T90870: selfcontained projects around Wikidata (tracking). TASK DETAIL https://phabricator.wikimedia.org/T204440 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic, Lydia_Pintscher Cc: Jheald,

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-19 Thread GoranSMilovanovic
GoranSMilovanovic added a subscriber: Jheald. GoranSMilovanovic added a comment. @Lydia_Pintscher - Everything else takes place once the WD JSON dump copy to HDFS (T209655 ) is in production, and the Analytics-Engineering tell me that is going

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-19 Thread GoranSMilovanovic
GoranSMilovanovic added a comment. - Implementing changes in the WD external identifier class visualizations: **DONE**; - in relation to T204440#5097057 , a compromise was introduced: - the WD identifier class network is generated to

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-15 Thread agray
agray added a comment. @GoranSMilovanovic - I can confirm that the numbers on the tables seem a bit off for some other properties. I've been looking at P1614 (History of Parliament), which is complete and fairly stable. It currenty has 21428 IDs on

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-15 Thread Envlh
Envlh added a comment. FYI, I also have very different results for P380 (even though it is data from the dump of 2019-04-08). If you follow the link "Usage history" on

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-15 Thread VIGNERON
VIGNERON added a comment. In T204440#5110497 , @GoranSMilovanovic wrote: > @VIGNERON The latest processed dump in Hadoop has a timestamp of 20190204, so February 4th this year I would say. > **Q.** If you have followed the usage of

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-15 Thread VIGNERON
VIGNERON added a comment. Hi, I just tested this new dashboard. The visualisations are great, but I'm more a number cruncher myself. I've been to the "Tables" tab (gain great idea) but something seems a bit off with the numbers. I tried for P380

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-04-09 Thread Pintoch
Pintoch added a comment. This is nice! However, when visualizing properties by category, it seems that subclasses are not taken into account: only the properties bearing that exact category as P31 value are listed. This gives a pretty inaccurate view:

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-03-29 Thread GoranSMilovanovic
GoranSMilovanovic added a comment. - The updates for the WD External Identifiers Dashboard will be productionized once T209655 is settled. TASK DETAIL

[Wikidata-bugs] [Maniphest] [Updated] T204440: analyze and visualize the identifier landscape of Wikidata

2019-03-22 Thread GoranSMilovanovic
GoranSMilovanovic added a comment. @Lydia_Pintscher Something to begin with. F28446940: plot_zoom_png.png It will take some time before I have this thing sorted out perfectly - it's complicated. I cannot exclude the possibility that I