Smalyshev created this task.
Smalyshev added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.

TASK DESCRIPTION

In order to avoid issues like we've had with Updater getting stale data (T210901: Stale reads for WDQS Updater) we may want to enable using ChronologyProtector functionality for RDF exports consumed by the Updater.

According to advice by @aaron this is what we can do:

<AaronSchulz>	so, in preOutputCommit(), the main DB commit happens, deferred updates run, CP positions are saved, then post-send deferred updates. I suppose if the code that enqueues to kafka put the ChronologyProtector::getClientId() value in the message, and made sure to enqueue post-send, then the updater could relay that client ID as a header for the RDF HTTP request.
<AaronSchulz>	so the updater would want to grab values from kafka (themselves from MW) to use for the ChronologyClientId HTTP header to Special:EntityData

This would require a patch to:

  1. Code that generates Kafka change events for Wikidata, to add Chronology ID to Kafka data
  2. Code in Updater that sends requests, to add ChronologyClientId header

TASK DETAIL
https://phabricator.wikimedia.org/T212550

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Smalyshev
Cc: Aklapper, Smalyshev, aaron, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, D3r1ck01, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to