faidon added a comment.
I don't think it makes sense to perpetuate a vertical scaling model. Both of the options listed here (adding disks, RAID 0) are things that we generally do not do, due to the hidden costs and burdens for everyone involved. Taking machines offline and rebuilding them from scratch just because a disk failed or because we need more storage is really something that we need to avoid, and something that the data center operations team cannot really support with its existing staffing (esp. taking into account the failure rate of disks). PoC/MVPs with vertical scaling are obviously OK and we can be somewhat flexible in HW needs for those, but for a service with the maturity and popularity of WDQS I think it makes sense to start designing it for scale at this point. It's clear that it's here to stay, and that going down a vertical scaling route today would only mean that we're deferring the problem until the next storage expansion, i.e. creating more tech debt. Horizontal scaling and quick/cheap operations for expansion (= adding boxes over time, staggering them over multiple FY) is really the way to go here. Budget-wise we can be flexible indeed! We can allocate some funds without defining exactly how we would spend them (within reason), and defer that decision until there is more clarity on the technical front. Does that make sense? TASK DETAIL https://phabricator.wikimedia.org/T221632 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Gehel, faidon Cc: faidon, Smalyshev, Aklapper, Gehel, alaa_wmde, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
