| Hall1467 added a comment. |
To follow up on @Halfak's database usage assessments, the estimate of 5 properties per entity/page relationship seems reasonable and conservative since the average number of statements per entity is in fact ~5 as seen here: https://grafana.wikimedia.org/dashboard/db/wikidata-datamodel-statements (its worth noting the average is increasing). This assumes that most of the time, a Wikipedia page is referencing data from just its corresponding Wikidata entity.
@jcrespo, do we have a reasonable estimate of the storage requirements? @Halfak's estimate was 333.8 MB for the table that I proposed.
@Halfak noted there are 3927150 entities with "X" or "O" as the aspect. If this number rose to be equal to the number of items in Wikidata (~16 million) as a conservative upper bound, we would see a total usage of roughly 333.8 * ~4 MB which is 1.34 GB. Although the average number of statements per entity is increasing, this upper bound still seems quite conservative.
Cc: Halfak, jcrespo, TomT0m, Hall1467, hoo, zhuyifei1999, Eloquence, Lydia_Pintscher, Sannita, Ainali, Liuxinyu970226, MZMcBride, Ricordisamoa, Micru, jayvdb, Daniel_Mietchen, Tobi_WMDE_SW, Legoktm, Abraham, Wikidata-bugs, liangent, jeremyb, aude, Candalua, Bianjiang, Aklapper, DixonD, daniel, D3r1ck01, Izno, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
