| Gehel added a comment. |
Coming back to this discussion, I'll try to make my point more clear:
wdqs public endpoint is by nature a service more fragile than most of our other services. The update lag is a good example of a problem we don't seem to be able to get under control on the public endpoint. The consequence of that is that we are starting to ignore those lag alerts. And we don't see any major consequences to this lag on the public cluster, which is an indication that our current alerting threshold does not match the reality of what is needed by the clients of that service. I might be wrong here and maybe this lag is an important issue, and if it is, we need to address it with a high priority.
In T199228#4420685, @Smalyshev wrote:WDQS public endpoint is not expected to have high availability / stability guarantees.
Well, this sounds a bit like giving up on availability (even if it's not the intention), so I think we want to have something. Let's think/brainstorm on what this something could be and how we could measure it.
Yes, it is giving up on at least some level of availability. Or matching expectations with the reality of that availability. Or taking a strong product decision that this public sparql endpoint is expected to have higher availability than it has and acting on that decision.
I'm not sure how much we should formalize an SLO on this endpoint, but expecting the same level of service as from other endpoint does not match the reality and never will unless we take drastic actions. This influences how we should react to failures on this endpoint, so it should be defined in some way.
Cc: Stashbot, Lydia_Pintscher, EBjune, debt, Joe, Smalyshev, Gehel, Aklapper, Nandana, AndyTan, Davinaclare77, Qtn1293, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Th3d3v1ls, Hfbn0, QZanden, merbst, LawExplorer, Zppix, Jonas, Xmlizer, Wong128hk, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, faidon, Mbch331, Jay8g, fgiunchedi
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
