[Wikidata-bugs] [Maniphest] T316882: RdfStreamingUpdaterHighConsumerUpdateLag alert is not fired

2022-09-01 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T316882 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Aklapper, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE

[Wikidata-bugs] [Maniphest] T293063: Write and adapt Runbooks and cookbooks related to the WDQS Streaming Updater and kubernetes

2022-09-01 Thread dcausse
dcausse added a comment. @JMeybohm thanks for the write-up! I added few more notes. TASK DETAIL https://phabricator.wikimedia.org/T293063 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: RKemper, Gehel, bking, JMeybohm, Jelto, Aklapper

[Wikidata-bugs] [Maniphest] T293063: Write and adapt Runbooks and cookbooks related to the WDQS Streaming Updater and kubernetes

2022-09-01 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T293063 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: RKemper, Gehel, bking, JMeybohm, Jelto, Aklapper, jijiki, dcausse, Astuthiodit_1, AWesterinen

[Wikidata-bugs] [Maniphest] T316882: RdfStreamingUpdaterHighConsumerUpdateLag alert is not fired

2022-09-01 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION RdfStreamingUpdaterHighConsumerUpdateLag is an alert that should be fired when WDQS or WCQS machine has a lag greater than 10 minutes. Looking

[Wikidata-bugs] [Maniphest] T316031: Clean up the rdf-streaming-updater-codfw container from thanos-swift.

2022-08-29 Thread dcausse
dcausse added a comment. @bking thanks for running the cleanup! I can confirm that the `wikidata` and `commons` pseudo-folders are empty, the `flink_ha_storage` folder also needs to be emptied. Something I don't fully understand yet is why https://thanos.wikimedia.org/graph?g0

[Wikidata-bugs] [Maniphest] T316496: WCQS does not report proper lag information

2022-08-29 Thread dcausse
dcausse claimed this task. dcausse moved this task from Prioritized to Needs review on the Discovery-Search (Current work) board. TASK DETAIL https://phabricator.wikimedia.org/T316496 WORKBOARD https://phabricator.wikimedia.org/project/board/1227/ EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T316496: WCQS does not report proper lag information

2022-08-29 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T316496 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Multichill, Aklapper, HenkvD, Bugreporter, RP88, dcausse, Astuthiodit_1, AWesterinen

[Wikidata-bugs] [Maniphest] T316236: Reload WCQS from dumps

2022-08-29 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T316236 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: HenkvD, Aklapper, dcausse, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, MPhamWMF

[Wikidata-bugs] [Maniphest] T316496: WCQS does not report proper lag information

2022-08-29 Thread dcausse
dcausse removed dcausse as the assignee of this task. TASK DETAIL https://phabricator.wikimedia.org/T316496 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Multichill, Aklapper, HenkvD, Bugreporter, RP88, dcausse, Astuthiodit_1

[Wikidata-bugs] [Maniphest] T289836: Upgrade the WDQS streaming updater to latest flink (1.15)

2022-08-29 Thread dcausse
dcausse renamed this task from "Upgrade to latest flink (1.14)" to "Upgrade the WDQS streaming updater to latest flink (1.15)". dcausse added a subscriber: Event-Platform Value Stream. TASK DETAIL https://phabricator.wikimedia.org/T289836 EMAIL PREFERENCES https://phabr

[Wikidata-bugs] [Maniphest] T316496: WCQS does not report proper lag information

2022-08-29 Thread dcausse
dcausse created this task. dcausse added projects: Commons, Wikidata-Query-Service. TASK DESCRIPTION Fixing T314703 <https://phabricator.wikimedia.org/T314703> introduced a regression in the way the lag is reported to both users (from the UI) and in the prometheus

[Wikidata-bugs] [Maniphest] T314703: Structured data for deleted files on Commons still visible in SPARQL engine after deletion

2022-08-29 Thread dcausse
dcausse added a comment. In T314703#8189526 <https://phabricator.wikimedia.org/T314703#8189526>, @Bugreporter wrote: > Note suppress delete should need special handling. Compare T105427: Need a way for WDQS updater to become aware of suppressed delete

[Wikidata-bugs] [Maniphest] T314703: Structured data for deleted files on Commons still visible in SPARQL engine after deletion

2022-08-25 Thread dcausse
dcausse added a subtask: T316236: Reload WCQS from dumps. TASK DETAIL https://phabricator.wikimedia.org/T314703 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: RP88, Bugreporter, HenkvD, Aklapper, Multichill, Hellket777, LisafBia6531

[Wikidata-bugs] [Maniphest] T316236: Reload WCQS from dumps

2022-08-25 Thread dcausse
dcausse added a parent task: T314703: Structured data for deleted files on Commons still visible in SPARQL engine after deletion. TASK DETAIL https://phabricator.wikimedia.org/T316236 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc

[Wikidata-bugs] [Maniphest] T316236: Reload WCQS from dumps

2022-08-25 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION Folluwup on T314703 <https://phabricator.wikimedia.org/T314703> where the consumer side of the updater was misconfigured and caused all d

[Wikidata-bugs] [Maniphest] T315124: Add OpenDataSweden to the SPARQL whitelist

2022-08-25 Thread dcausse
dcausse claimed this task. dcausse moved this task from Ready for Development to Needs review on the Discovery-Search (Current work) board. TASK DETAIL https://phabricator.wikimedia.org/T315124 WORKBOARD https://phabricator.wikimedia.org/project/board/1227/ EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T314703: Structured data for deleted files on Commons still visible in SPARQL engine after deletion

2022-08-25 Thread dcausse
dcausse claimed this task. dcausse moved this task from Ready for Development to In Progress on the Discovery-Search (Current work) board. TASK DETAIL https://phabricator.wikimedia.org/T314703 WORKBOARD https://phabricator.wikimedia.org/project/board/1227/ EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T315532: Investigate number of existing geoshape usages

2022-08-23 Thread dcausse
dcausse added a project: Wikidata-Query-Service. TASK DETAIL https://phabricator.wikimedia.org/T315532 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: lilients_WMDE, dcausse Cc: dcausse, Gehel, awight, ECohen_WMDE, Lena_WMDE, Aklapper, thiemowmde

[Wikidata-bugs] [Maniphest] T316031: Clean up the rdf-streaming-updater-codfw container from thanos-swift.

2022-08-23 Thread dcausse
dcausse merged a task: T316003: Cleanup the rdf-streaming-updater-codfw swift container in thanos . TASK DETAIL https://phabricator.wikimedia.org/T316031 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: bking, dcausse Cc: Aklapper, fgiunchedi

[Wikidata-bugs] [Maniphest] T316003: Cleanup the rdf-streaming-updater-codfw swift container in thanos

2022-08-23 Thread dcausse
dcausse closed this task as a duplicate of T316031: Clean up the rdf-streaming-updater-codfw container from thanos-swift.. TASK DETAIL https://phabricator.wikimedia.org/T316003 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: fgiunchedi

[Wikidata-bugs] [Maniphest] T314835: wdqs space usage on thanos-swift

2022-08-23 Thread dcausse
dcausse added a comment. The 3 tasks above should be the followups of this incident. The root cause of the incident is I think a mix of the poor `swift` client used by the flink H/A component and possibly the instability of thanos-fe2001 that exacerbated the poor behaviors of this swift

[Wikidata-bugs] [Maniphest] T316028: Run the rdf-streaming-updater from k8s@codfw

2022-08-23 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION This is a followup on the incident T314835 <https://phabricator.wikimedia.org/T314835>. To mitigate the issue the flink job was starte

[Wikidata-bugs] [Maniphest] T316005: Add monitoring and alerting on the usage of the rdf-streaming-updater swift containers in thanos

2022-08-23 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION As a maintainer of the W[DC]QS Streaming Updater I want to be monitor and be alerted when the space usage of these flink jobs reach a certain

[Wikidata-bugs] [Maniphest] T316003: Cleanup the rdf-streaming-updater-codfw swift container in thanos

2022-08-23 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION As a followup of T314835 <https://phabricator.wikimedia.org/T314835> we should cleanup the `rdf-streaming-updater-codfw` container. The

[Wikidata-bugs] [Maniphest] T296014: relevancy score for Items in RDF output

2022-08-22 Thread dcausse
dcausse added a project: Wikidata-Query-Service. TASK DETAIL https://phabricator.wikimedia.org/T296014 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: hoo, dcausse Cc: Lucas_Werkmeister_WMDE, Manuel, Aklapper, Lydia_Pintscher, Hellket777

[Wikidata-bugs] [Maniphest] T314835: wdqs space usage on thanos-swift

2022-08-09 Thread dcausse
dcausse added a comment. Unfortunately I could not finish the cleanup of the `flink_ha_storage` folder to properly resume operations from k8s. I resumed the job from yarn using the same swift container `rdf-streaming-updater-codfw` (I had tried to resume the jobs from a fresh container

[Wikidata-bugs] [Maniphest] T314835: wdqs space usage on thanos-swift

2022-08-09 Thread dcausse
dcausse added a comment. Current status: - all flink jobs are stopped in codfw - wdqs traffic is eqiad - wikidata maxlag is only checking eqiad - the rdf-streaming-updater namespace in k8s@codfw has been wiped out in preparation of the deployment of https://gerrit.wikimedia.org/r

[Wikidata-bugs] [Maniphest] T314835: wdqs space usage on thanos-swift

2022-08-09 Thread dcausse
dcausse added a comment. It seems (still not 100% sure yet but seeing a lot of failures related to this) that the repeated failures are caused by the bad swift client we are still using for the flink ha storage, we stopped using this client for the job states (T302494 <ht

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-07-26 Thread dcausse
dcausse added a comment. @Lucas_Werkmeister_WMDE yes I think it's doable, attached a quick patch to demonstrate how TASK DETAIL https://phabricator.wikimedia.org/T307869 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Lucas_Werkmeister_WMDE

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-07-18 Thread dcausse
dcausse edited projects, added Discovery-Search; removed Discovery-Search (Current work). TASK DETAIL https://phabricator.wikimedia.org/T307869 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Lucas_Werkmeister_WMDE, dcausse Cc: Lea_WMDE

[Wikidata-bugs] [Maniphest] T304976: Investigate how to make WDQS label service fall back to mul labels

2022-07-15 Thread dcausse
dcausse added a comment. @Manuel sorry for the late response. I'm not very familiar with the query service UI features but if the `autocompletion suggestions` you mention are the ones you obtain when hitting `Ctlr-space` in the SPARQL editor I believe it would make sense indeed

[Wikidata-bugs] [Maniphest] T312107: Inline search on Wikidata is unable to find all mul labels and aliases

2022-07-15 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T312107 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Manuel, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE

[Wikidata-bugs] [Maniphest] T312107: Inline search on Wikidata is unable to find all mul labels and aliases

2022-07-15 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T312107 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Manuel, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE

[Wikidata-bugs] [Maniphest] T312107: Inline search on Wikidata is unable to find all mul labels and aliases

2022-07-15 Thread dcausse
dcausse added a comment. I think the reason why it's not finding this particular item is that the CirrusSearch index needs to be recreated to take into account the new `mul` language field. The problem is less visible during fulltext search as the term `John Doe II` is present in the main

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-06-29 Thread dcausse
dcausse added a comment. No new profiles should be created for other wikibase installation as most of the wikidata specific options are managed in wmf specific config, not Wikibase nor CirrusSearch so the new Lexeme creation page should behave exactly as before. All the fixes we had

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-06-28 Thread dcausse
dcausse added a comment. There is yet another problem (see patch above that should fix it). I'm sorry that deploying this profile is such a pain, it demonstrates a clear problem in the way we (the search team) deploy such features/profiles and I filed T311528 <ht

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-06-27 Thread dcausse
dcausse added a comment. Sorry about that, there was yet another issue in the WikibaseCirrusSearch Hook that caused the config to be ignored and cause the language selector profile context to simply use exactly the same settings as the classic entity completion search. There was also

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-06-23 Thread dcausse
dcausse added a comment. The above patch should fix the issue, I forgot that profile repositories must have have unique names, sorry about that! TASK DETAIL https://phabricator.wikimedia.org/T307869 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] T307869: Request for new search profile for Wikidata that boosts Items for languages

2022-06-01 Thread dcausse
dcausse added a comment. The patches above add few placeholder to allow tuning a custom profile meant to be use by the language selector on Special:NewLexeme: - https://gerrit.wikimedia.org/r/c/mediawiki/extensions/WikibaseLexemeCirrusSearch/+/801791/ adds a new `profile context` named

[Wikidata-bugs] [Maniphest] T268864: WikibaseCirrusSearch uses Elastica's Match class

2022-05-18 Thread dcausse
dcausse changed the status of subtask T271777: Bump rufin/elastica (and related libraries) to versions that support PHP 8.0 from Stalled to Open. TASK DETAIL https://phabricator.wikimedia.org/T268864 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] T268865: WikibaseLexemeCirrusSearch uses Elastica's Match class

2022-05-18 Thread dcausse
dcausse changed the status of subtask T271777: Bump rufin/elastica (and related libraries) to versions that support PHP 8.0 from Stalled to Open. TASK DETAIL https://phabricator.wikimedia.org/T268865 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] T307635: Query service results are missing some variables on some servers

2022-05-09 Thread dcausse
dcausse added a comment. This is extremely weird and I suspect a serious blazegraph bug that causes this. I could not reproduce the problem at the moment running the python script provided but it might certainly happen again in the future. I'm not sure how to proceed here but perhaps

[Wikidata-bugs] [Maniphest] T306054: Upgrade deployment-wdqs01 host to Buster

2022-04-25 Thread dcausse
dcausse added a project: Discovery-Search (Current work). TASK DETAIL https://phabricator.wikimedia.org/T306054 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Lucas_Werkmeister_WMDE, Mathew.onipe, Aklapper, Majavah, Peachey88

[Wikidata-bugs] [Maniphest] T305983: query.wikidata.org/bigdata/ldf - Language string should include language tag

2022-04-25 Thread dcausse
dcausse moved this task from Incoming to For Later on the Wikidata-Query-Service board. dcausse triaged this task as "Medium" priority. TASK DETAIL https://phabricator.wikimedia.org/T305983 WORKBOARD https://phabricator.wikimedia.org/project/board/891/ EMAIL PREFERENC

[Wikidata-bugs] [Maniphest] T306054: Upgrade deployment-wdqs01 host to Buster

2022-04-14 Thread dcausse
dcausse added a comment. I can confirm, this host is not used. TASK DETAIL https://phabricator.wikimedia.org/T306054 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Lucas_Werkmeister_WMDE, Mathew.onipe, Aklapper, Majavah

[Wikidata-bugs] [Maniphest] T305818: Perform a data transfer to wdqs2004 & wdqs1004 to reclaim burnt allocators

2022-04-11 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION wdqs2004 & wdqs1004 lost their free allocators too quickly (known issue that pops up //randomly//). We should do a data transfer from a sane so

[Wikidata-bugs] [Maniphest] T302189: Regularly purge orphaned sitelink, value and reference nodes

2022-04-05 Thread dcausse
dcausse added a comment. Reason is that this data //may// be referenced by other items and thus cannot be deleted blindly without asking blazegraph: //"is this data used by another item?"// which would be too costly to ask for every edit. Another approach is to reload blaze

[Wikidata-bugs] [Maniphest] T293862: Investigate using jvmquake to limit the time a JVM is unusable due to GC overhead

2022-04-01 Thread dcausse
dcausse added a comment. Actually wdqs2007, wdqs2004 and wdqs2003 also triggered jvmquake, GC activity increased and wdqs2007 & wdqs2003 were unresponsive for a couple minutes. For wdqs2004 there are no visible blips in the various graph. I guess we should relax the settings a bit

[Wikidata-bugs] [Maniphest] T293862: Investigate using jvmquake to limit the time a JVM is unusable due to GC overhead

2022-04-01 Thread dcausse
dcausse added a comment. With the settings we properly detected wdqs1006 going down for 30minutes at `2022-04-22T12:30:00` (this 2minutes after the first blip in the graph). Unfortunately there was a false positive wdqs1012 at `2022-04-22T10:00:00` as this machine was unavailable from 2

[Wikidata-bugs] [Maniphest] T304365: Add property predicates to WCQS

2022-04-01 Thread dcausse
dcausse moved this task from Incoming to Scaling on the Wikidata-Query-Service board. dcausse added a comment. I agree that federation is adding a lot of //boiler plate// and inspecting the shape of the IRIs is very fragile. But merging multiple graphs into the same store for ease of use

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-03-31 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T301147 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: elukey, akosiaris, Gehel, RKemper, bking, toan, Addshore, JMeybohm, Michael, Aklapper, dcausse

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-03-31 Thread dcausse
dcausse added a comment. Thanks for the quick answer! (response inline) In T301147#7821582 <https://phabricator.wikimedia.org/T301147#7821582>, @JMeybohm wrote: >> - If the above is not possible could we mitigate this problem by over-allocating resources (increas

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-03-31 Thread dcausse
dcausse moved this task from Ready for Development to Needs review on the Discovery-Search (Current work) board. dcausse added a comment. Tentatively moving this ticket to //needs review// as I'm not sure sure we can do much more from the search team perspective. I think the last point

[Wikidata-bugs] [Maniphest] T305068: Alert when flink does not have the number of expected task managers

2022-03-31 Thread dcausse
dcausse claimed this task. dcausse moved this task from Incoming to Needs review on the Discovery-Search (Current work) board. TASK DETAIL https://phabricator.wikimedia.org/T305068 WORKBOARD https://phabricator.wikimedia.org/project/board/1227/ EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T209859: Wikidata autocomplete (wbsearchentities) results with score <= 0

2022-03-30 Thread dcausse
dcausse removed EJoseph as the assignee of this task. dcausse added a subscriber: EJoseph. TASK DETAIL https://phabricator.wikimedia.org/T209859 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: EJoseph, Liuxinyu970226, dcausse, Smalyshev

[Wikidata-bugs] [Maniphest] T209859: Wikidata autocomplete (wbsearchentities) results with score <= 0

2022-03-30 Thread dcausse
dcausse moved this task from Wikibase Search to needs triage on the Discovery-Search board. dcausse assigned this task to EJoseph. TASK DETAIL https://phabricator.wikimedia.org/T209859 WORKBOARD https://phabricator.wikimedia.org/project/board/1849/ EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T238751: Only generate maxlag from pooled query service servers.

2022-03-30 Thread dcausse
dcausse added a project: Discovery-Search. TASK DETAIL https://phabricator.wikimedia.org/T238751 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Joe, dcausse Cc: karapayneWMDE, Lucas_Werkmeister_WMDE, Ladsgroup, Gehel, Jheald, Joe, Addshore

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-03-30 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T301147 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: elukey, akosiaris, Gehel, RKemper, bking, toan, Addshore, JMeybohm, Michael, Aklapper, dcausse

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-03-30 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T301147 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: elukey, akosiaris, Gehel, RKemper, bking, toan, Addshore, JMeybohm, Michael, Aklapper, dcausse

[Wikidata-bugs] [Maniphest] T305068: Alert when flink does not have the number of expected task managers

2022-03-30 Thread dcausse
dcausse created this task. dcausse added projects: Wikidata-Query-Service, Wikidata, Discovery-Search (Current work). TASK DESCRIPTION As a maintainer of a flink session cluster I want to be alerted when the number of taskmanagers is not what the deployment expects so that I can react

[Wikidata-bugs] [Maniphest] T294076: Blazegraph and MariaDB contain different sitelinks at Wikidata

2022-03-29 Thread dcausse
dcausse moved this task from Waiting to Needs Reporting on the Discovery-Search (Current work) board. dcausse added a comment. The reconciliation process is running and should auto-correct missed updates couple hours after they're performed. I also fixed the inconsistencies listed here

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-29 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: bking, Aklapper, dcausse, Fernandobacasegua34, Astuthiodit_1, 786, Suran38, Biggs657

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-29 Thread dcausse
dcausse moved this task from In Progress to Needs Reporting on the Discovery-Search (Current work) board. dcausse added a comment. Moved remaining work in T304914 <https://phabricator.wikimedia.org/T304914>. TASK DETAIL https://phabricator.wikimedia.org/T302494 WORKBOARD

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-29 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: bking, Aklapper, dcausse, Fernandobacasegua34, Astuthiodit_1, 786, Suran38, Biggs657

[Wikidata-bugs] [Maniphest] T304914: Remove the presto client for swift from the flink image

2022-03-29 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION As a maintainer of a flink session cluster I want to stop using the presto client for swift present in the flink image so that I can migrate

[Wikidata-bugs] [Maniphest] T242453: Deadlock in blazegraph blocking all queries and updates

2022-03-28 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T242453 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: RKemper, bking, RLazarus, Legoktm, Gehel, William_Avery, CDanis, Addshore, dcausse, Aklapper

[Wikidata-bugs] [Maniphest] T242453: Deadlock in blazegraph blocking all queries and updates

2022-03-28 Thread dcausse
dcausse reopened this task as "Open". dcausse added a comment. re-opening, seems to happen more frequently TASK DETAIL https://phabricator.wikimedia.org/T242453 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: RLazarus, Lego

[Wikidata-bugs] [Maniphest] T303256: WDQS servers should use skolem for wikibaseSomeValueMode

2022-03-15 Thread dcausse
dcausse moved this task from Needs review to Needs Reporting on the Discovery-Search (Current work) board. dcausse added a comment. `wikibase:isSomeValue` is functioning properly again. TASK DETAIL https://phabricator.wikimedia.org/T303256 WORKBOARD https://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] T302396: Investigate EOFException when performing the first checkpoint after restoring from a savepoint

2022-03-15 Thread dcausse
dcausse added a comment. S3 <https://phabricator.wikimedia.org/S3> is confirmed to have fixed this issue, all jobs are now running 0.3.104 of the streaming-updater and are using the s3 client to persist their durable state. TASK DETAIL https://phabricator.wikimedia.org/T302396

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-15 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: bking, Aklapper, dcausse, Fernandobacasegua34, Astuthiodit_1, 786, Suran38, Biggs657

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-15 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: bking, Aklapper, dcausse, Fernandobacasegua34, Astuthiodit_1, 786, Suran38, Biggs657

[Wikidata-bugs] [Maniphest] T302830: query service: (Alert) Reduced availability for job jmx_wdqs_updater in eqiad

2022-03-14 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302830 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: Aklapper, RKemper, Astuthiodit_1, karapayneWMDE, Invadibot, MPhamWMF, maantietaja

[Wikidata-bugs] [Maniphest] T303256: WDQS servers should use skolem for wikibaseSomeValueMode

2022-03-14 Thread dcausse
dcausse added a comment. might be accidentally fixed by merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/742670 (it's still unclear why it's broken in the first place) TASK DETAIL https://phabricator.wikimedia.org/T303256 EMAIL PREFERENCES https://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-14 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: Aklapper, dcausse, Fernandobacasegua34, Astuthiodit_1, 786, Suran38, Biggs657

[Wikidata-bugs] [Maniphest] T293063: Write and adapt Runbooks and cookbooks related to the WDQS Streaming Updater and kubernetes

2022-03-14 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T293063 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: JMeybohm, Jelto, Aklapper, jijiki, dcausse, Astuthiodit_1, Arnoldokoth, karapayneWMDE, Invadibot

[Wikidata-bugs] [Maniphest] T293063: Write and adapt Runbooks and cookbooks related to the WDQS Streaming Updater and kubernetes

2022-03-14 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T293063 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: JMeybohm, Jelto, Aklapper, jijiki, dcausse, Astuthiodit_1, Arnoldokoth, karapayneWMDE, Invadibot

[Wikidata-bugs] [Maniphest] T303256: WDQS servers should use skolem for wikibaseSomeValueMode

2022-03-08 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T303256 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Aklapper, dcausse, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst

[Wikidata-bugs] [Maniphest] T303256: WDQS servers should use skolem for wikibaseSomeValueMode

2022-03-08 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T303256 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Aklapper, dcausse, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst

[Wikidata-bugs] [Maniphest] T303256: WDQS servers should use skolem for wikibaseSomeValueMode

2022-03-08 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION The function `wikibase:isSomeValue` should filter skolem and not blank nodes on wdqs servers. It seems that this setting was recently acctidentaly

[Wikidata-bugs] [Maniphest] T293862: Investigate using jvmquake to limit the time a JVM is unusable due to GC overhead

2022-03-07 Thread dcausse
dcausse added a comment. Pushed https://gitlab.wikimedia.org/repos/search-platform/jvmquake/-/merge_requests/1 (up for review) to have a debian package that we could install on production machines. TASK DETAIL https://phabricator.wikimedia.org/T293862 EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-03-07 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: RKemper, dcausse Cc: Aklapper, dcausse, Fernandobacasegua34, 786, Suran38, Biggs657, karapayneWMDE, Invadibot

[Wikidata-bugs] [Maniphest] T279541: Add a reconciliation strategy to the wdqs streaming updater

2022-02-25 Thread dcausse
dcausse added a comment. Deployment of this feature has been stopped due to T302340 <https://phabricator.wikimedia.org/T302340>. TASK DETAIL https://phabricator.wikimedia.org/T279541 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcau

[Wikidata-bugs] [Maniphest] T294076: Blazegraph and MariaDB contain different sitelinks at Wikidata

2022-02-25 Thread dcausse
dcausse merged a task: T302458: Q108896181 keeps showing up as having zero statements. dcausse added subscribers: Sjoerddebruin, Lucas_Werkmeister_WMDE. TASK DETAIL https://phabricator.wikimedia.org/T294076 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] T302458: Q108896181 keeps showing up as having zero statements

2022-02-25 Thread dcausse
dcausse closed this task as a duplicate of T294076: Blazegraph and MariaDB contain different sitelinks at Wikidata. TASK DETAIL https://phabricator.wikimedia.org/T302458 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse

[Wikidata-bugs] [Maniphest] T302458: Q108896181 keeps showing up as having zero statements

2022-02-25 Thread dcausse
dcausse added a comment. Thanks for the report, these inconsistencies are due to missed updates which will be automatically fixed once T279541 <https://phabricator.wikimedia.org/T279541> is deployed. I'm marking this ticket as a duplicate of T294076 <https://phabricator.wiki

[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol

2022-02-24 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION Followup of T302396 <https://phabricator.wikimedia.org/T302396>. The thanos-swift cluster is S3 <https://phabricator.wikimed

[Wikidata-bugs] [Maniphest] T302396: Investigate EOFException when performing the first checkpoint after restoring from a savepoint

2022-02-23 Thread dcausse
dcausse added a comment. Root cause seems swift related: Saw this in taskmanager logs: `Received IOException while reading 'swift://rdf-streaming-updater-codfw.thanos-swift/wikidata/savepoints/savepoint-0d1c37-86ed4cb29023/bc3ce8ed-70b2-4e91-a81b-07f585dd0f1f', attempting to reopen

[Wikidata-bugs] [Maniphest] T302396: Investigate EOFException when performing the first checkpoint after restoring from a savepoint

2022-02-23 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302396 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Aklapper, dcausse, karapayneWMDE, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana

[Wikidata-bugs] [Maniphest] T302396: Investigate EOFException when performing the first checkpoint after restoring from a savepoint

2022-02-23 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302396 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: Aklapper, dcausse, karapayneWMDE, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana

[Wikidata-bugs] [Maniphest] T302396: Investigate EOFException when performing the first checkpoint after restoring from a savepoint

2022-02-23 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION While deploying a new version of the streaming-updater (0.3.103) flink failed with: java.io.EOFException at java.base

[Wikidata-bugs] [Maniphest] T302189: Query service retains orphaned sitelinks

2022-02-21 Thread dcausse
dcausse closed this task as "Declined". dcausse added a comment. Sitelink orphans are not clean up at update time for performance reasons, same is done for orphaned `values` and `references`. The database is cleaned up during a full reload which we try to plan once a year.

[Wikidata-bugs] [Maniphest] T301695: Special:EntityData on test-commons.wikimedia.org produces wrong sdcdata prefix in its RDF output

2022-02-14 Thread dcausse
dcausse created this task. dcausse added a project: SDC General. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION When I extract the RDF content using Special:EntityData of an image with a MediaInfo slot on test-commons-wikimedia.org I want it to be configured simarly

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-02-10 Thread dcausse
dcausse added a comment. In T301147#7692414 <https://phabricator.wikimedia.org/T301147#7692414>, @JMeybohm wrote: > In T301147#7689837 <https://phabricator.wikimedia.org/T301147#7689837>, @dcausse wrote: > >> @JMeybohm we're still investigating why the appl

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-02-10 Thread dcausse
dcausse updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T301147 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: toan, Addshore, JMeybohm, Michael, Aklapper, dcausse, Invadibot, MPhamWMF, maantietaja, CBogen

[Wikidata-bugs] [Maniphest] T293862: Investigate using jvmquake to limit the time a JVM is unusable due to GC overhead

2022-02-09 Thread dcausse
dcausse claimed this task. dcausse moved this task from Ready for Development to In Progress on the Discovery-Search (Current work) board. TASK DETAIL https://phabricator.wikimedia.org/T293862 WORKBOARD https://phabricator.wikimedia.org/project/board/1227/ EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-02-08 Thread dcausse
dcausse added a comment. k8s seems to have tried to kill the container for the whole period according messages like: Container flink-session-cluster-main-taskmanager failed liveness probe, will be restarted <https://logstash.wikimedia.org/app/discover#/doc/logstash-*/logstash-sys

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-02-07 Thread dcausse
dcausse added a subscriber: JMeybohm. dcausse added a comment. @JMeybohm we're still investigating why the application did not properly recover while kubernetes1014 went down but if you have ideas on the two questions in the ticket description this would be very helpful, thanks! TASK DETAIL

[Wikidata-bugs] [Maniphest] T301147: The WDQS streaming updater went unstable for several hours (2022-02-06T23:00:00 - 2022-02-07T06:20:00)

2022-02-07 Thread dcausse
dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION For 7 hours (`2022-02-06T23:00:00` to `2022-02-07T06:20:00`) the streaming updater in `eqiad` stopped working properly preventing edits to flow

[Wikidata-bugs] [Maniphest] T289770: Add hints in response headers for 404 responses in Special:EntityData

2022-02-04 Thread dcausse
dcausse closed this task as "Declined". dcausse added a comment. Thanks for the investigation on this! We don't plan to pursue this route given the added complexity and it's not clear if the benefit is worth the effort, esp. over tuning retries on 404 and the work on reconcili

[Wikidata-bugs] [Maniphest] T299290: Unexpected behavior in federated queries with LinguaLibre in WDQS

2022-02-01 Thread dcausse
dcausse removed projects: Discovery-Search (Current work), Wikidata, Wikidata-Query-Service. dcausse added a comment. Tried to debug this a bit and I believe the problem is on the lingualibre side. I suspect a weird bug happening because of the query length. Query that passes: https

<    1   2   3   4   5   6   7   8   9   10   >