[Wikidata-bugs] [Maniphest] T349147: Follow up on rdf-streaming-updater failure 2023-10-17

2023-10-17 Thread bking
bking created this task. bking added projects: Wikidata-Query-Service, Data-Platform-SRE. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION We had a brief outage of the RDF streaming updater today. Users could have been impacted from about ~2022 - 2050 UTC. Specifically

[Wikidata-bugs] [Maniphest] T349095: Migrate staging rdf-streaming-updater to flink operator

2023-10-17 Thread bking
bking added a comment. We could also deploy via a new namespace, but I wonder what implications that would have for our monitoring/tooling etc. Open to feedback/suggestions on this one. TASK DETAIL https://phabricator.wikimedia.org/T349095 EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-10-17 Thread bking
bking added a subtask: T349095: Migrate staging rdf-streaming-updater to flink operator. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata

[Wikidata-bugs] [Maniphest] T349095: Migrate staging rdf-streaming-updater to flink operator

2023-10-17 Thread bking
bking added a parent task: T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model. TASK DETAIL https://phabricator.wikimedia.org/T349095 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking

[Wikidata-bugs] [Maniphest] T349095: Migrate staging rdf-streaming-updater to flink operator

2023-10-17 Thread bking
bking created this task. bking added projects: Wikidata-Query-Service, Data-Platform-SRE. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION We have tested the flink operator mode in dse-k8s <https://phabricator.wikimedia.org/T342149> . Our next step is to m

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-10-17 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking, Aklapper, dcausse, Danny_Benjafield_WMDE

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-10-16 Thread bking
bking added a comment. Apologies for not catching this earlier: - to be aligned with the dumps that are imported into hdfs we must select a particular set of files, when selecting a "-all" file the preceding "lexemes" one must be taken, e.g. if wikidata-20230

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-10-16 Thread bking
bking added a subtask: T349011: Improve data-reload cookbook based on graph split needs. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, dcausse, Aklapper, bking

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-10-12 Thread bking
bking reopened subtask T342149: Test common operations in the flink operator/k8s/Flink ZK environment as In Progress. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-10-12 Thread bking
bking added a comment. I started a data reload for hosts`wdqs1022-1024`. These are running in a tmux window under my user on`cumin1001`. Based on T323096 <https://phabricator.wikimedia.org/T323096> , we expect this process to fail multiple times, which is why we're running in on

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-10-05 Thread bking
bking closed subtask T342149: Test common operations in the flink operator/k8s/Flink ZK environment as Resolved. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-10-05 Thread bking
bking changed the status of subtask T342149: Test common operations in the flink operator/k8s/Flink ZK environment from Open to In Progress. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-10-04 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: dcausse, Aklapper, bking, Danny_Benjafield_WMDE, Isabelladantes1983, Themindcoder, Adamm71, Jersione

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-10-04 Thread bking
bking claimed this task. bking moved this task from Incoming to In Progress on the Data-Platform-SRE board. TASK DETAIL https://phabricator.wikimedia.org/T347504 WORKBOARD https://phabricator.wikimedia.org/project/board/6524/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-10-04 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, dcausse, Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BTullis

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-10-04 Thread bking
bking added a comment. More details related to dump loading in T325114 <https://phabricator.wikimedia.org/T325114> . TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, d

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-10-04 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: dcausse, Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BTullis, karapayneWMDE

[Wikidata-bugs] [Maniphest] T347605: Document process for getting JNL files/consider automation

2023-10-04 Thread bking
bking added a comment. Checksum does not match the version from `wdqs1016`, which is: sha1sum wikidata.jnl.zst e3197eb5177dcd1aa0956824cd8dc4afc2d8796c wikidata.jnl.zst I also downloaded the file locally after putting it up in Cloudflare, which has a different checksum

[Wikidata-bugs] [Maniphest] T326914: Migrate the WDQS streaming updater from FlinkKafkaConsumer/Producer to KafkaSource/Sink

2023-10-03 Thread bking
bking added a parent task: T340548: [EPIC] Deployment of the Search Update Pipeline on Flink / k8s. TASK DETAIL https://phabricator.wikimedia.org/T326914 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, bking Cc: dr0ptp4kt, bking, dcausse

[Wikidata-bugs] [Maniphest] T326914: Migrate the WDQS streaming updater from FlinkKafkaConsumer/Producer to KafkaSource/Sink

2023-10-03 Thread bking
bking added a parent task: T342149: Test common operations in the flink operator/k8s/Flink ZK environment. TASK DETAIL https://phabricator.wikimedia.org/T326914 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, bking Cc: dr0ptp4kt, bking

[Wikidata-bugs] [Maniphest] T347605: Document process for getting JNL files/consider automation

2023-10-02 Thread bking
bking added a comment. A few notes on this process: - I used zstd compression <https://facebook.github.io/zstd/> to compress the JNL file, as it supposedly offers the best speed. I used `zstd -T0 -19 wikidata.jnl` as my compression command (all cores, maximum compression),

[Wikidata-bugs] [Maniphest] T344905: Publish WDQS JNL files to dumps.wikimedia.org

2023-09-28 Thread bking
bking added a comment. @dr0ptp4kt and I were looking at this today and it occurred to me that the JNL file is uncompressed. Thus, I gzipped the main wikidata JNL file from `wdqs1016`, which takes ~4 hours using pigz at maximum compression rate, and we end up with a ~400 GB file

[Wikidata-bugs] [Maniphest] T347605: Document process for getting JNL files/consider automation

2023-09-28 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347605 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Addshore, dr0ptp4kt, Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BTullis

[Wikidata-bugs] [Maniphest] T344905: Publish WDQS JNL files to dumps.wikimedia.org

2023-09-28 Thread bking
bking added a project: Data-Platform-SRE. TASK DETAIL https://phabricator.wikimedia.org/T344905 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, Krinkle, dr0ptp4kt, Abbe98, Gehel, Addshore, Aklapper, Danny_Benjafield_WMDE, Mohamed

[Wikidata-bugs] [Maniphest] T347605: Document process for getting JNL files/consider automation

2023-09-28 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347605 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Addshore, dr0ptp4kt, Aklapper, bking, AWesterinen, BTullis, Namenlos314, Gq86, Lucas_Werkmeister_WMDE

[Wikidata-bugs] [Maniphest] T347605: Document process for getting JNL files/consider automation

2023-09-28 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347605 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst

[Wikidata-bugs] [Maniphest] T347605: Document process for getting JNL files/consider automation

2023-09-28 Thread bking
bking created this task. bking added projects: Wikidata-Query-Service, Data-Platform-SRE. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION Blazegraph (the application that serves WDQS) stores all its data in a single JNL file. The WDQS file is very large (~1.2TB) so moving

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking added a subscriber: dcausse. bking added a comment. @dcausse couple of questions: - Are we OK to start the data load as soon as these hosts are in production? - Does each host need its data loaded, or can we load on one and data-transfer to the others? TASK DETAIL https

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-09-27 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: dcausse, Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BTullis, karapayneWMDE

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BTullis, karapayneWMDE, Invadibot

[Wikidata-bugs] [Maniphest] T337013: [Epic] Splitting the graph in WDQS

2023-09-27 Thread bking
bking added a subtask: T347505: Prepare new WDQS hosts for graph splitting. TASK DETAIL https://phabricator.wikimedia.org/T337013 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: dr0ptp4kt, RKemper, bking, tfmorris, elal, karapayneWMDE

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-09-27 Thread bking
bking added a parent task: T337013: [Epic] Splitting the graph in WDQS. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis, Namenlos314, Gq86

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking added a parent task: T347505: Prepare new WDQS hosts for graph splitting. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-09-27 Thread bking
bking added a subtask: T347504: WDQS graph split: load data from dumps into new hosts. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis, Namenlos314

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking removed a subtask: T347505: Prepare new WDQS hosts for graph splitting. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-09-27 Thread bking
bking removed a parent task: T347504: WDQS graph split: load data from dumps into new hosts. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis

[Wikidata-bugs] [Maniphest] T337013: [Epic] Splitting the graph in WDQS

2023-09-27 Thread bking
bking removed a subtask: T345475: Service implementation for wdqs202[3-5].codfw.wmnet. TASK DETAIL https://phabricator.wikimedia.org/T337013 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: dr0ptp4kt, RKemper, bking, tfmorris, elal

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking added a subtask: T347505: Prepare new WDQS hosts for graph splitting. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, Danny_Benjafield_WMDE, Astuthiodit_1

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-09-27 Thread bking
bking added a parent task: T347504: WDQS graph split: load data from dumps into new hosts. TASK DETAIL https://phabricator.wikimedia.org/T347505 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis

[Wikidata-bugs] [Maniphest] T347505: Prepare new WDQS hosts for graph splitting

2023-09-27 Thread bking
bking created this task. bking added projects: Data-Platform-SRE, Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION wdqs1017-1024 are newly-deployed WDQS hosts. Let's claim 2022-2024 for the graph splitting. These will be a new tier of WDQS

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking added a parent task: T337013: [Epic] Splitting the graph in WDQS. TASK DETAIL https://phabricator.wikimedia.org/T347504 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis, Namenlos314, Gq86

[Wikidata-bugs] [Maniphest] T337013: [Epic] Splitting the graph in WDQS

2023-09-27 Thread bking
bking added a subtask: T347504: WDQS graph split: load data from dumps into new hosts. TASK DETAIL https://phabricator.wikimedia.org/T337013 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: dr0ptp4kt, RKemper, bking, tfmorris, elal

[Wikidata-bugs] [Maniphest] T347504: WDQS graph split: load data from dumps into new hosts

2023-09-27 Thread bking
bking created this task. bking added projects: Wikidata-Query-Service, Data-Platform-SRE. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION In order to proceed on T337013 <https://phabricator.wikimedia.org/T337013> , we need to do a full data reload on three hosts, s

[Wikidata-bugs] [Maniphest] T347284: Restore service for https://query.wikidata.org/bigdata/ldf

2023-09-25 Thread bking
bking added a subtask: T347355: Create alerts for https://query.wikidata.org/bigdata/ldf. TASK DETAIL https://phabricator.wikimedia.org/T347284 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, bking, MisterSynergy, dcausse

[Wikidata-bugs] [Maniphest] T347355: Create alerts for https://query.wikidata.org/bigdata/ldf

2023-09-25 Thread bking
bking added a parent task: T347284: Restore service for https://query.wikidata.org/bigdata/ldf. TASK DETAIL https://phabricator.wikimedia.org/T347355 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking, AWesterinen, BTullis

[Wikidata-bugs] [Maniphest] T347355: Create alerts for https://query.wikidata.org/bigdata/ldf

2023-09-25 Thread bking
bking created this task. bking added projects: Data-Platform-SRE, Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION Per T347284 <https://phabricator.wikimedia.org/T347284> , we lost the LDF endpoint for a few days. Creating this ticket to add

[Wikidata-bugs] [Maniphest] T347284: Restore service for https://query.wikidata.org/bigdata/ldf

2023-09-25 Thread bking
bking closed this task as "Resolved". bking moved this task from Incoming to Done on the Data-Platform-SRE board. bking claimed this task. bking added a comment. @MisterSynergy After applying the last patch, https://query.wikidata.org/bigdata/ldf seems to be back up. Thus, I'm

[Wikidata-bugs] [Maniphest] T347284: Restore service for https://query.wikidata.org/bigdata/ldf

2023-09-25 Thread bking
bking added a project: Data-Platform-SRE. TASK DETAIL https://phabricator.wikimedia.org/T347284 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, bking, MisterSynergy, dcausse, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1

[Wikidata-bugs] [Maniphest] T347284: Restore service for https://query.wikidata.org/bigdata/ldf

2023-09-25 Thread bking
bking renamed this task from "https://query.wikidata.org/bigdata/ldf is broken" to "Restore service for https://query.wikidata.org/bigdata/ldf;. TASK DETAIL https://phabricator.wikimedia.org/T347284 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/e

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-09-22 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking, Aklapper, dcausse, Danny_Benjafield_WMDE

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-09-21 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking, Aklapper, dcausse, Danny_Benjafield_WMDE

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-09-21 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking, Aklapper, dcausse, Danny_Benjafield_WMDE

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-09-20 Thread bking
bking added a subtask: T342149: Test common operations in the flink operator/k8s/Flink ZK environment. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-09-19 Thread bking
bking closed subtask T341792: Provision Zookeeper Cluster for storing Flink HA data as Resolved. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena

[Wikidata-bugs] [Maniphest] T346456: Improve concurrency limits configuration of the wdqs updater

2023-09-15 Thread bking
bking added a parent task: T342149: Test common operations in the flink operator/k8s/Flink ZK environment. TASK DETAIL https://phabricator.wikimedia.org/T346456 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, bking

[Wikidata-bugs] [Maniphest] T337013: [Epic] Splitting the graph in WDQS

2023-09-15 Thread bking
bking added a subscriber: RKemper. bking added a comment. When T345475 <https://phabricator.wikimedia.org/T345475> is done, we should have 3 new WDQS hosts in CODFW that could be used for the graph splitting experiment. @RKemper let us know if you have any objections to this plan.

[Wikidata-bugs] [Maniphest] T337013: [Epic] Splitting the graph in WDQS

2023-09-15 Thread bking
bking added a subtask: T345475: Service implementation for wdqs202[3-5].codfw.wmnet. TASK DETAIL https://phabricator.wikimedia.org/T337013 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, bking, tfmorris, elal, karapayneWMDE

[Wikidata-bugs] [Maniphest] T344284: Rename usages of whitelist to allowlist in query service rdf repo

2023-09-14 Thread bking
bking moved this task from Ready for Work to Done on the Data-Platform-SRE board. bking added a comment. We successfully deployed this yesterday; moving to "Done" on the workboard. TASK DETAIL https://phabricator.wikimedia.org/T344284 WORKBOARD https://phabricator.wikimedia.o

[Wikidata-bugs] [Maniphest] T326914: Migrate the WDQS streaming updater from FlinkKafkaConsumer/Producer to KafkaSource/Sink

2023-09-14 Thread bking
bking added a project: Data-Platform-SRE. TASK DETAIL https://phabricator.wikimedia.org/T326914 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, dcausse, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BTullis

[Wikidata-bugs] [Maniphest] T344882: Some servers for the Commons query service (WCQS) are missing data

2023-09-05 Thread bking
bking moved this task from Ready for Work to Done on the Data-Platform-SRE board. bking added a comment. Hello, I've fixed wdqs1003 as well and it appears we have the same amount of triples now <https://grafana.wikimedia.org/d/00489/wikidata-query-service?orgId=1=1m=now-24h=now_n

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-09-05 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking, Aklapper, dcausse, Danny_Benjafield_WMDE

[Wikidata-bugs] [Maniphest] T337296: Allow federated queries with the NLG endpoint (data.nlg.gr)

2023-08-30 Thread bking
bking added a comment. Thanks for the quick response. It could very well be our fault, as we just changed how we deploy our federation allowlist <https://phabricator.wikimedia.org/T343856> . Thus, we'll continue to troubleshoot from our side as well. TASK DETAIL

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-08-30 Thread bking
bking claimed this task. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking, Aklapper, dcausse, Danny_Benjafield_WMDE, fbalicchia

[Wikidata-bugs] [Maniphest] T337296: Allow federated queries with the NLG endpoint (data.nlg.gr)

2023-08-30 Thread bking
bking added a comment. @Epidosis sorry for the delay on this ticket. We've added your endpoint, can you please test it and let us know if it works? Thanks for your patience. TASK DETAIL https://phabricator.wikimedia.org/T337296 EMAIL PREFERENCES https://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] T337296: Allow federated queries with the NLG endpoint (data.nlg.gr)

2023-08-28 Thread bking
bking claimed this task. bking updated Other Assignee, added: RKemper. TASK DETAIL https://phabricator.wikimedia.org/T337296 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Aklapper, EBernhardson, dcausse, Epidosis, Danny_Benjafield_WMDE

[Wikidata-bugs] [Maniphest] T344882: Some servers for the Commons query service (WCQS) are missing data

2023-08-24 Thread bking
bking added a comment. I've repooled wcqs1001 and wcqs1002 after verifying they have the correct amount of triples. We've left wcqs1003 depooled as we troubleshoot further. Let us know if you notice any other issues. TASK DETAIL https://phabricator.wikimedia.org/T344882 EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] T344882: Some servers for the Commons query service (WCQS) are missing data

2023-08-24 Thread bking
bking added a comment. Thanks for bringing this to our attention, and sorry for the inconvenience. I have depooled the eqiad datacenter while we work to address the issue. Please test our your queries and let us know if you're still getting inconsistent results. TASK DETAIL https

[Wikidata-bugs] [Maniphest] T343856: Move whitelist.txt from WDQS deploy repo into puppet and rename it to "allow list"

2023-08-22 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T343856 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, bking Cc: Gehel, Reedy, bking, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, Themindcoder

[Wikidata-bugs] [Maniphest] T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model

2023-08-21 Thread bking
bking added a subtask: T344614: Add Zookeeper config to 'flink-app' test service. TASK DETAIL https://phabricator.wikimedia.org/T326409 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, BTullis, JMeybohm, gmodena, Ottomata, bking

[Wikidata-bugs] [Maniphest] T343856: Move whitelist.txt from WDQS deploy repo into puppet and rename it to "allow list"

2023-08-15 Thread bking
bking updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T343856 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, Reedy, bking, Aklapper, Danny_Benjafield_WMDE, Isabelladantes1983, Themindcoder, Adamm71

[Wikidata-bugs] [Maniphest] T336134: wdqs2*** lagged for more than one day

2023-08-14 Thread bking
bking closed subtask T337801: WDQS: Document procedure for switching between Kubernetes and Yarn Streaming Updater as Resolved. TASK DETAIL https://phabricator.wikimedia.org/T336134 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc

[Wikidata-bugs] [Maniphest] T336134: wdqs2*** lagged for more than one day

2023-08-11 Thread bking
bking added a subtask: T337801: WDQS: Document procedure for switching between Kubernetes and Yarn Streaming Updater. TASK DETAIL https://phabricator.wikimedia.org/T336134 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: karapayneWMDE

[Wikidata-bugs] [Maniphest] T339347: qlever dblp endpoint for wikidata federated query nomination

2023-08-08 Thread bking
bking added a comment. @WolfgangFahl We've whitelisted the endpoints, but the query you linked above <https://w.wiki/6q2i> still does not work. Can you verify that is it working as expected? My teammate mentioned "it's returning application/sparql-results+xml but we on

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-08-07 Thread bking
bking closed subtask T330714: Document SRE steps for deploying a new WDQS (and WCQS) host as Resolved. TASK DETAIL https://phabricator.wikimedia.org/T332314 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, Gehel, Aklapper

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-07-17 Thread bking
bking added a comment. Update: `wdqs2016.codfw.wmnet` is the last host that needs to be configured for production. `wdqs2020.codfw.wmnet` has been receiving production traffic for a week now, with no observed issues. We should be able to finish the rest pretty soon and start

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-07-10 Thread bking
bking added a comment. Update: I forgot to target 2013 in my last command, here is the latest list of hosts that need a data transfer and a deploy: (4) wdqs[2013-2016].codfw.wmnet - OUTPUT of 'du -hcxs /srv/de...6756ebe194261756' - 132M /srv/deployment/wdqs/wdqs

[Wikidata-bugs] [Maniphest] T339347: qlever dblp endpoint for wikidata federated query nomination

2023-07-10 Thread bking
bking set the point value for this task to "1". TASK DETAIL https://phabricator.wikimedia.org/T339347 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, Aklapper, WolfgangFahl, Astuthiodit_1, AWesterinen, BTullis, kar

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-07-07 Thread bking
bking added a comment. Update: wdqs[2017-2021].codfw.wmnet are now production ready: = NODE GROUP = (4) wdqs[2014-2016,2022].codfw.wmnet - OUTPUT of 'du -hcxs /srv/de...6756ebe194261756' - 132M /srv/deployment/wdqs/wdqs-cache/revs

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-07-07 Thread bking
bking added a comment. Current state: 2019 and 2020 are production-ready. The others need a data transfer and/or scap deploy to be complete. The command below checks the deployment directory size. If the directory size is smaller than 471M, that means `git-fat` isn't working and the host

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-07-06 Thread bking
bking added a subtask: T330714: Document SRE steps for deploying a new WDQS (and WCQS) host. TASK DETAIL https://phabricator.wikimedia.org/T332314 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: RKemper, Gehel, Aklapper, Astuthiodit_1

[Wikidata-bugs] [Maniphest] T321605: Make WCQS/WDQS data transfer cookbook more reliable

2023-06-22 Thread bking
bking reopened this task as "In Progress". TASK DETAIL https://phabricator.wikimedia.org/T321605 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Vgutierrez, RKemper, Volans, Aklapper, bking, Astuthiodit_1, AWesterinen, kar

[Wikidata-bugs] [Maniphest] T332314: Configure new WDQS servers in codfw (wdqs20[13-22])

2023-06-21 Thread bking
bking claimed this task. TASK DETAIL https://phabricator.wikimedia.org/T332314 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, Aklapper, Astuthiodit_1, AWesterinen, BTullis, karapayneWMDE, Invadibot, MPhamWMF, maantietaja

[Wikidata-bugs] [Maniphest] T321605: Make WCQS/WDQS data transfer cookbook more reliable

2023-06-13 Thread bking
bking added a project: Discovery-Search (Current work). TASK DETAIL https://phabricator.wikimedia.org/T321605 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Vgutierrez, RKemper, Volans, Aklapper, bking, Isabelladantes1983, Themindcoder

[Wikidata-bugs] [Maniphest] T321605: Make WCQS/WDQS data transfer cookbook more reliable

2023-06-13 Thread bking
bking moved this task from monitoring to in progress on the Wikidata board. bking closed this task as "Resolved". bking claimed this task. bking added a comment. I believe this is complete; moving to 'needs review' status. TASK DETAIL https://phabricator.wikimedia.org/T321605

[Wikidata-bugs] [Maniphest] T336709: Allow federated queries with the BNCF SPARQL endpoint

2023-06-13 Thread bking
bking added a comment. @Epidosis sorry for the long turnaround on this. Can you try with 'https://' as opposed to 'http://' ? Here <https://w.wiki/6puk> is the result we get when using 'https://'. TASK DETAIL https://phabricator.wikimedia.org/T336709 EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] T321605: Make WCQS/WDQS data transfer cookbook more reliable

2023-05-31 Thread bking
bking added a comment. I've been working on this a bit more lately. The Transfer.py documentation <https://doc.wikimedia.org/transferpy/master/transferpy/transferpy.html#module-transferpy.Firewall> mentions "remote_execution" but does not mention that it's a required argumen

[Wikidata-bugs] [Maniphest] T336574: Review alerting around Wikidata Query Service update pipeline

2023-05-31 Thread bking
bking added a comment. Per today's SRE meeting, the larger SRE org is working on a comprehensive alert review <https://etherpad.wikimedia.org/p/alert-review-may-2023> . We should work with the SREs to help out and use their methods to review our own alerts. TASK DETAIL

[Wikidata-bugs] [Maniphest] T336577: Update WDQS Runbook following update lag incident

2023-05-30 Thread bking
bking added projects: Sustainability, SRE-OnFire. TASK DETAIL https://phabricator.wikimedia.org/T336577 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, dcausse, Gehel, Aklapper, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot

[Wikidata-bugs] [Maniphest] T336574: Review alerting around Wikidata Query Service update pipeline

2023-05-30 Thread bking
bking edited projects, added Sustainability, SRE-OnFire; removed Sustainability (Incident Followup). TASK DETAIL https://phabricator.wikimedia.org/T336574 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, Aklapper, Gehel

[Wikidata-bugs] [Maniphest] T337230: Include LiLa Linking Latin SPARQL endpoint in whitelist for federated queries

2023-05-30 Thread bking
bking added a comment. @DL2204 Thanks for your patience. We've added https://lila-erc.eu/sparql/lila_knowledge_base/sparql as an allowed SPARQL endpoint for WDQS. Please test this out and respond here with your results-good or bad. TASK DETAIL https://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] T335994: Allow federated queries to the UNESCO SPARQL endpoint

2023-05-30 Thread bking
bking added a comment. @Nikki Thanks for your patience. We've added https://vocabularies.unesco.org/sparql as an allowed SPARQL endpoint for WDQS. Please test this out and respond here with your results-good or bad. TASK DETAIL https://phabricator.wikimedia.org/T335994 EMAIL

[Wikidata-bugs] [Maniphest] T336709: Allow federated queries with the BNCF SPARQL endpoint

2023-05-30 Thread bking
bking added a comment. @Epidosis Thanks for your patience. We've added https://digitale.bncf.firenze.sbn.it/openrdf-workbench/repositories/NS/query as an allowed SPARQL endpoint for WDQS. Please test this out and respond here with your results-good or bad. TASK DETAIL https

[Wikidata-bugs] [Maniphest] T336577: Update WDQS Runbook following update lag incident

2023-05-25 Thread bking
bking added a comment. Other action items: - Add link to new WDQS superset dashboard to WDQS runbook page. - Fix dead logstash link on WDQS runbook page <https://wikitech.wikimedia.org/wiki/Wikidata_Query_Service/Runbook#Timeouts> - Better documentation of throttling be

[Wikidata-bugs] [Maniphest] T336577: Update WDQS Runbook following update lag incident

2023-05-22 Thread bking
bking claimed this task. TASK DETAIL https://phabricator.wikimedia.org/T336577 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, dcausse, Gehel, Aklapper, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, Zabe, MPhamWMF

[Wikidata-bugs] [Maniphest] T336577: Update WDQS Runbook following update lag incident

2023-05-22 Thread bking
bking added subscribers: dcausse, bking. bking added a comment. Updated the Streaming Updater operations docs <https://wikitech.wikimedia.org/wiki/Wikidata_Query_Service/Streaming_Updater#The_consumers_are_backlogged> after today's pairing session with @dcausse . We'll continue to

[Wikidata-bugs] [Maniphest] T336574: Review alerting around Wikidata Query Service update pipeline

2023-05-22 Thread bking
bking added a comment. Revised totals for alerts in the last year after looking at Logstash: `RdfStreamingUpdaterHighConsumerUpdateLag` 373 `RdfStreamingUpdaterFlinkProcessingLatencyIsHigh` 63 `RdfStreamingUpdaterFlinkJobUnstable` 125 The majority of all three alert types fired

[Wikidata-bugs] [Maniphest] T336574: Review alerting around Wikidata Query Service update pipeline

2023-05-18 Thread bking
bking added a comment. Quick notes here before I forget. Checking my "alerts" email folder for the past year (not the most reliable source), I have: - 89 alerts with title RdfStreamingUpdaterHighConsumerUpdateLag - 72 alerts with title RdfStreamingUpdaterFlinkProcessingLat

[Wikidata-bugs] [Maniphest] T325602: Decide whether or not to keep wdqs-heavy-queries and wdqs-ssl PyBal pools

2023-05-15 Thread bking
bking removed a project: Discovery-Search (Current work). TASK DETAIL https://phabricator.wikimedia.org/T325602 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: Gehel, Aklapper, RKemper, bking, Astuthiodit_1, AWesterinen, karapayneWMDE

[Wikidata-bugs] [Maniphest] T193473: Add HTTPS support to wdqs-internal service

2023-05-15 Thread bking
bking removed a project: Discovery-Search (Current work). TASK DETAIL https://phabricator.wikimedia.org/T193473 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, Aklapper, Smalyshev, Gehel, Astuthiodit_1, AWesterinen, karapayneWMDE

[Wikidata-bugs] [Maniphest] T274270: WDQS servers taking up to 30 minutes to reboot

2023-05-15 Thread bking
bking edited projects, added Discovery-Search; removed Discovery-Search (Current work). TASK DETAIL https://phabricator.wikimedia.org/T274270 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking Cc: bking, RKemper, Gehel, Aklapper, Astuthiodit_1

<    1   2   3   4   >