[Wikidata-bugs] [Maniphest] [Updated] T197161: Gather information on users of wb_terms replicas on WMF cloud infrastructure

2018-07-02 Thread CommunityTechBot
CommunityTechBot removed projects: TCB-Team, Mail, New-Editor-Experiences, Language-2018-Apr-June, KartoEditor, JADE, Hashtags, Gamepress, Tamil-Sites, Connected-Open-Heritage-Batch-uploads (RAÄ-KMB_1_2017-02), CheckUser.
TASK DETAILhttps://phabricator.wikimedia.org/T197161EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: CommunityTechBotCc: Lucas_Werkmeister_WMDE, Edgars2007, MusikAnimal, Daniel_Mietchen, Nikki, Magnus, Mahir256, Nikerabbit, GoranSMilovanovic, Lea_Lacroix_WMDE, WMDE-leszek, Lahi, Gq86, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331, AndyTan, Zylc, 1978Gage2001, herron, Chicocvenancio, alanajjar, Tbscho, Lea_WMDE, Mattias_Ostmar-WMSE, JJMC89, Jseddon, Ryuch, Mkdw, RuyP, JEumerus, Trizek-WMF, KasiaWMDE, 0x010C, srodlund, Luke081515, grin, Bsadowski1, mys_721tx, Snowolf, Huji, Gryllida, jayvdb, Tobi_WMDE_SW, revi, scfc, He7d3r, Romaine, Jay8g, Glaisher, Krenair, chasemp___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Updated] T197161: Gather information on users of wb_terms replicas on WMF cloud infrastructure

2018-06-28 Thread WMDE-leszek
WMDE-leszek added a subscriber: Lucas_Werkmeister_WMDE.WMDE-leszek added a comment.
@Lucas_Werkmeister_WMDE is amazing and created a script that finds tools that are somehow using wb_terms. The results are at P7299. The script, for reference, is P7298. Thanks Lucas!
We'll use this input as well. But of course more detailed and personal input like from folks above is always helpful!TASK DETAILhttps://phabricator.wikimedia.org/T197161EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: WMDE-leszekCc: Lucas_Werkmeister_WMDE, Edgars2007, MusikAnimal, Daniel_Mietchen, Nikki, Magnus, Mahir256, Nikerabbit, GoranSMilovanovic, Lea_Lacroix_WMDE, WMDE-leszek, Aklapper, Lahi, Gq86, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Updated] T197161: Gather information on users of wb_terms replicas on WMF cloud infrastructure

2018-06-20 Thread Nikki
Nikki added a comment.
I don't have any tools but I have used the wb_terms table in Quarry a number of times. I am usually trying to select all terms for a particular language, all terms which match a particular regex or to count how many terms there are. Things like finding labels containing disambiguation information, finding descriptions written like sentences, finding terms containing HTML entities, finding labels which have namespace prefixes when they shouldn't or vice versa, finding misspelt words, listing the most common descriptions for a language...

I use the columns term_full_entity_id (or term_entity_id in older queries), term_entity_type, term_language, term_type and term_text.

It is usually not possible to use SPARQL because the queries are too slow and the timeout for queries in the query service is much lower than for Quarry. In particular, querying for all terms in a particular language is very slow which I already created a ticket for - T167361 .TASK DETAILhttps://phabricator.wikimedia.org/T197161EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: NikkiCc: Nikki, Magnus, Mahir256, Nikerabbit, GoranSMilovanovic, Lea_Lacroix_WMDE, WMDE-leszek, Aklapper, Lahi, Gq86, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs