ArielGlenn added a subscriber: MarkTraceur.
ArielGlenn added a comment.
From email from @MarkTraceur
Database needs
--------------
- 54 million files on Commons
- Estimated average of 10-20 statements per file
- Estimated 1 revision per statement
- Therefore, (very) roughly 1 billion estimated rows added to revisions table
External storage needs
----------------------
- Each file will have its own MediaInfo entity, which will be analogous to
Wikidata items
- So, given Wikidata has about 57 million items, the storage needs should be
about the same
- Obviously that would need to be additional storage, not including the
existing Wikitext
Rates
-----
- We expect multiple bots to run over Commons very shortly after release
(within the next few months)
- Don't anticipate these will be drastically faster than normal bot runs
- Could see Multichill's bots for examples - I believe he's rate-limited
them aggressively
- There will likely be micro-contributions as well
- Think Magnus's "Wikidata game" style, likely similar rates
- Also sanctioned on-wiki machine-aided work (for depicts statements)
- By the end of the calendar year, we expect at least 5 million files to have
structured data
- We're currently sitting in the low six figures (100-300k)
TASK DETAIL
https://phabricator.wikimedia.org/T226093
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: ArielGlenn
Cc: MarkTraceur, ArielGlenn, Aklapper, darthmon_wmde, Legado_Shulgin, Nandana,
JKSTNK, thifranc, AndyTan, Davinaclare77, Qtn1293, Lahi, PDrouin-WMF, Gq86,
E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic,
Lunewa, Th3d3v1ls, Hfbn0, QZanden, Tramullas, Acer, LawExplorer, Salgo60,
Zppix, Silverfish, _jensen, rosalieper, Susannaanas, Wong128hk, gnosygnu,
Jane023, Wikidata-bugs, Base, matthiasmullie, aude, Ricordisamoa, Wesalius,
Lydia_Pintscher, Fabrice_Florin, Raymond, faidon, Steinsplitter, Mbch331,
Jay8g, fgiunchedi
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs