[Wikidata-bugs] [Maniphest] T361950: Ensure that WDQS query throttling does not interfere with federation
TJones set the point value for this task to "3". TASK DETAIL https://phabricator.wikimedia.org/T361950 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Daniel_Mietchen, Aklapper, dcausse, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T361950: Ensure that WDQS query throttling does not interfere with federation
TJones updated the task description. TJones removed the point value for this task. TASK DETAIL https://phabricator.wikimedia.org/T361950 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Daniel_Mietchen, Aklapper, dcausse, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T352538: [EPIC] Evaluate the impact of the graph split
TJones renamed this task from "Evaluate the impact of the graph split" to "[EPIC] Evaluate the impact of the graph split". TJones added a project: Epic. TASK DETAIL https://phabricator.wikimedia.org/T352538 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Gehel, me, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, BeautifulBold, Suran38, karapayneWMDE, Invadibot, maantietaja, Peteosx1x, NavinRizwi, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T349246: Bad ranking of Wikidata item search results on Special:Search when non-default namespaces are included
TJones added a comment. Somewhat unfortunately, this is the expected behavior. There's a specialized query for Wikidata items that doesn't work with other namespaces. When you include other namespaces, we have to fall back to a less good query that works consistently across namespaces and allows us to merge results from multiple namespaces. That query also allows us to search all namespaces with a single request. In the //Finno-Ugric// case, there aren't any results from other namespaces, but you still get the ranking from the less good query. There are approaches to merging result lists that were scored with different scoring methods, but we haven't seriously investigated doing that for on-wiki search. There is also the expense of running multiple queries before you can merge their results, which can make such queries much more expensive. Reframing the situation (and this is approximately what actually happened from the programmers' point of view), if you limit yourself to default namespaces, we can run a specialized query that uses much better ranking for Wikidata items. TASK DETAIL https://phabricator.wikimedia.org/T349246 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: TJones, Aklapper, Nikki, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T346456: Improve concurrency limits configuration of the wdqs updater
TJones moved this task from needs triage to Current work on the Discovery-Search board. TJones edited projects, added Discovery-Search (Current work); removed Discovery-Search. TASK DETAIL https://phabricator.wikimedia.org/T346456 WORKBOARD https://phabricator.wikimedia.org/project/board/1849/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, bking, Clement_Goubert, dcausse, Danny_Benjafield_WMDE, Kappakayala, Astuthiodit_1, AWesterinen, Arnoldokoth, karapayneWMDE, Invadibot, maantietaja, wkandek, JMeybohm, ItamarWMDE, Akuckartz, Nandana, Namenlos314, jijiki, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T302494: The WDQS Streaming Updater should use S3 to access thanos-swift instead of the native swift protocol
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T302494 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, dcausse, Fernandobacasegua34, 786, Suran38, Biggs657, karapayneWMDE, Invadibot, Lalamarie69, MPhamWMF, maantietaja, Juan90264, Alter-paule, Beast1978, CBogen, Un1tY, Akuckartz, Hook696, Kent7301, joker88john, CucyNoiD, Nandana, Namenlos314, Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, Bsandipan, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T293862: Investigate using jvmquake to limit the time a JVM is unusable due to GC overhead
TJones removed the point value for this task. TASK DETAIL https://phabricator.wikimedia.org/T293862 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, dcausse, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T293862: Investigate using jvmquake to limit the time a JVM is unusable due to GC overhead
TJones set the point value for this task to "5". TASK DETAIL https://phabricator.wikimedia.org/T293862 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, dcausse, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T294076: Blazegraph and MariaDB contain different sitelinks at Wikidata
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T294076 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: RShigapov, Aklapper, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T288230: Promote MediaInfo RDF format to stable
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T288230 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Zbyszko, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287231: Consider moving WDQS "munging" of RDF into Wikibase RDF output code
TJones renamed this task from "Consider moving WDQS "munging" of RDF into WIkibase RDF output code" to "Consider moving WDQS "munging" of RDF into Wikibase RDF output code". TASK DETAIL https://phabricator.wikimedia.org/T287231 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: RhinosF1, Aklapper, Addshore, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281359: Onboard teams with Grafana alerts to AlertManager
TJones renamed this task from "Onboard teams with Grafana alerts to AM" to "Onboard teams with Grafana alerts to AlertManager". TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T281359 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: fgiunchedi, Aklapper, Invadibot, MPhamWMF, maantietaja, lmata, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, herron, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Chicocvenancio, QZanden, EBjune, merbst, LawExplorer, Volans, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281454: Onboard teams with Prometheus-based alerts to AlertManager
TJones renamed this task from "Onboard teams with Prometheus-based alerts to AM" to "Onboard teams with Prometheus-based alerts to AlertManager". TASK DETAIL https://phabricator.wikimedia.org/T281454 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: fgiunchedi, Aklapper, Invadibot, MPhamWMF, maantietaja, lmata, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, herron, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Chicocvenancio, QZanden, EBjune, merbst, LawExplorer, Volans, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T238796: Entity and non-entity edits should use different maxlag
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T238796 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Addshore, Aklapper, Bugreporter, Invadibot, MPhamWMF, maantietaja, FRomeo_WMF, CBogen, Nintendofan885, Akuckartz, Nandana, JKSTNK, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T275251: Rest Search API is not wikidata aware (only accepts queries beginning with Q)
TJones renamed this task from "Rest Search API is not wikidata aware (only accepts queries beginning with Q" to "Rest Search API is not wikidata aware (only accepts queries beginning with Q)". TASK DETAIL https://phabricator.wikimedia.org/T275251 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Yair_rand, MPhamWMF, ovasileva, Addshore, Lydia_Pintscher, Aklapper, Jdlrobson, Selby, caldera, maantietaja, Akuckartz, Demian, darthmon_wmde, WDoranWMF, holger.knust, EvanProdromou, DannyS712, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, pmiazga, LawExplorer, JJMC89, Iniquity, _jensen, rosalieper, Agabi10, Scott_WUaS, Pchelolo, Volker_E, Niedzielski, Izno, abian, Wikidata-bugs, aude, GWicke, Dinoguy1000, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T276750: Add means to upgrade the flink code even when incompatible serialization changes are involved
TJones renamed this task from "Add a mean to upgrade the flink code even when incompatible serialization changes are involved" to "Add means to upgrade the flink code even when incompatible serialization changes are involved". TASK DETAIL https://phabricator.wikimedia.org/T276750 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, TJones Cc: Aklapper, dcausse, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, abian, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T275782: "Execute Query" button on WDQS not visible for long text query
TJones renamed this task from ""Execute Query" button on WDQS not vissible for long text query" to ""Execute Query" button on WDQS not visible for long text query". TASK DETAIL https://phabricator.wikimedia.org/T275782 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Bouzinac, amy_rc, Lydia_Pintscher, Aklapper, Mohammed_Sadat_WMDE, MPhamWMF, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, abian, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T270614: Automatically depool wdqs servers that are "lagged"
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T270614 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, dcausse, MPhamWMF, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T265290: Rediscover, review, and update the federation input process for WDQS
TJones renamed this task from "Review the federation input process for WDQS" to "Rediscover, review, and update the federation input process for WDQS". TASK DETAIL https://phabricator.wikimedia.org/T265290 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Zache, Bugreporter, Lea_Lacroix_WMDE, Aklapper, Gehel, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T264404: Determine a way of separating truthy queries
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T264404 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Gehel, Aklapper, CBogen, Akuckartz, darthmon_wmde, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T263088: Allow to download WDQS and WCQS results as Excel spreadsheet
TJones triaged this task as "Low" priority. TASK DETAIL https://phabricator.wikimedia.org/T263088 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Akuckartz, Jarekt, Aklapper, CBogen, darthmon_wmde, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T263088: Allow to download WDQS and WCQS results as Excel spreadsheet
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T263088 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Akuckartz, Jarekt, Aklapper, CBogen, darthmon_wmde, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones added a comment. Help:CirrusSearch <https://www.mediawiki.org/wiki/Help:CirrusSearch> is very long, but also marked up for translation, so I'm trying to edit very lightly, since any edit requires translation in dozens of languages. Parts of the page are very out of date, however. It is slow going. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones triaged this task as "Medium" priority. TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T257058: Review / Improve Search Platform team documentation
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T257058 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: CBogen, EBernhardson, dcausse, TJones, Gehel, Aklapper, Zbyszko, NavinRizwi, Akuckartz, apaskulin, Pavithraes, darthmon_wmde, DannyS712, Nandana, Namenlos314, Cpaulf30, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Ivana_Isadora, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Dinoguy1000, Manybubbles, Mbch331, Jay8g ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T141080: Wikidata items with two coordinates do not show up in geosearch
TJones triaged this task as "Medium" priority. TASK DETAIL https://phabricator.wikimedia.org/T141080 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: TheDJ, JeanFred, Braveheart, edwardbetts, Sjoerddebruin, Aklapper, simon04, CBogen, Akuckartz, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, LawExplorer, TerraCodes, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, MaxSem, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T232565: case-sensitive equivalent of haswbstatement
TJones triaged this task as "Low" priority. TASK DETAIL https://phabricator.wikimedia.org/T232565 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Mvolz, Aklapper, Bugreporter, CBogen, Akuckartz, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T248449: [WDQS Streaming Updater] Add error handling for Streaming Updater
TJones updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T248449 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: dcausse, Aklapper, Zbyszko, CBogen, Akuckartz, darthmon_wmde, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T249196: Test the impact of the wdqs updater performance by disabling values cleanup
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T249196 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, TJones Cc: Aklapper, dcausse, CBogen, darthmon_wmde, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T248101: WDQS query logs lack http.client_ip
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T248101 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, TJones Cc: Aklapper, dcausse, CBogen, darthmon_wmde, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T246524: Current time displayed as modification date
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T246524 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EBernhardson, TJones Cc: EBernhardson, Lydia_Pintscher, Aklapper, Bugreporter, CBogen, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, Jayprakash12345, QZanden, EBjune, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wong128hk, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Unblock] T221632: Storage capacity upgrade for WDQS
TJones closed subtask T246343: Service implementation on wdqs200[7-8].codfw.wmnet as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T221632 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Gehel, TJones Cc: wiki_willy, mark, RobH, faidon, Smalyshev, Aklapper, Gehel, CBogen, darthmon_wmde, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T246343: Service implementation on wdqs200[7-8].codfw.wmnet
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T246343 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Gehel, TJones Cc: elukey, Aklapper, Gehel, CBogen, darthmon_wmde, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Unblock] T221916: Create RDF export for structured data stored for files
TJones closed subtask T222321: Make /entity/ alias work for Commons as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T221916 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, TJones Cc: DD063520, WMDE-leszek, Poyekhali, Steinsplitter, Aklapper, Lydia_Pintscher, Tgr, Ramsey-WMF, Jarekt, Addshore, Tpt, Salgo60, Lucas_Werkmeister_WMDE, Smalyshev, CBogen, darthmon_wmde, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, E1presidente, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, merbst, LawExplorer, Silverfish, _jensen, rosalieper, Taiwania_Justo, Scott_WUaS, Cirdan, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, Fabrice_Florin, Raymond, Jdforrester-WMF, Mbch331, Keegan ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T222321: Make /entity/ alias work for Commons
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T222321 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Gehel, TJones Cc: EBernhardson, Salgo60, Cparle, Multichill, dcausse, Abbe98, Lucas_Werkmeister_WMDE, Tpt, Addshore, Ramsey-WMF, Lydia_Pintscher, Aklapper, Smalyshev, CBogen, Iflorez, darthmon_wmde, alaa_wmde, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, E1presidente, Anooprao, SandraF_WMF, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, merbst, LawExplorer, Silverfish, Poyekhali, _jensen, rosalieper, Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, suriyaa, Wong128hk, Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, Southparkfan, Fabrice_Florin, Raymond, Jdforrester-WMF, Steinsplitter, Mbch331, Rxy, Glaisher, Krenair, Keegan ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T239750: org.wikidata.query.rdf.tool.Updater - Importer error: ConcurrentModificationException: KafkaConsumer is not safe for multi-threaded access
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T239750 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, TJones Cc: dcausse, Aklapper, CBogen, darthmon_wmde, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T196165: Commons image: when pasting the exact title, get the correct file first in the suggester
TJones added a comment. We talked about this in our team meeting today, and @dcausse opened T245642 <https://phabricator.wikimedia.org/T245642> after our discussion. We also uncovered the difference between these two queries: - `Barcelonnette - Villa du Parc du Mercantour -984` (12 results) - `Barcelonnette - Villa du Parc du Mercantour -984.jpg` (13 results, including the desired one) We have //another// index of titles and the second query above (with `.jpg`) is an exact match (after parsing) of the title of the file, so it gets added into the mix. It doesn't score well enough to make it to the top of the list, but at least it is there. The current boosting of the `near_match` is 2, which was set a long time ago. Many changes have been made since then. Increasing the boost to 10 should improve the ranking of most title matches. It may not cover every possible case, but it should raise many of these to the #1 spot, and it should raise many others into the top 10 so that the P18 <https://phabricator.wikimedia.org/P18> patch above can find them and elevate them the rest of the way. (That patch will still be very helpful in cases like the //Eglise Notre Dame de l'Assomption// searches above, since all of those files will appear identical to the `near_match` scoring.) I'll also copy over some examples from here as test cases for T245642 <https://phabricator.wikimedia.org/T245642>. TASK DETAIL https://phabricator.wikimedia.org/T196165 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Silvan_WMDE, TJones Cc: hoo, EBernhardson, TJones, dcausse, Ladsgroup, Silvan_WMDE, Addshore, Bencemac, Aklapper, Ayack, Liuxinyu970226, Smalyshev, Lydia_Pintscher, Lea_Lacroix_WMDE, Iflorez, darthmon_wmde, alaa_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T196165: Commons image: when pasting the exact title, get the correct file first in the suggester
TJones added a comment. I think we both have incorrect models of what's happening! **//Neat!//** I thought the apparent negation was removing the target file and something else on the Javascript side was putting it back in the wrong spot. You seem to have thought the Search API was putting the exact match specifically in the last spot. (But maybe not.) Turns out neither is correct. I'm still not 100% sure what's going on with the apparent negation. Without the //.jpg// at the end, the negation does in fact prevent the target from showing up. With it, it does show up, but not necessarily at the end. In fact, more often than not, the target is the first result! I //think// what is happening is that the exact match //is// being injected into the list on the backend, but not at any particular spot on the list. It gets a score and the result is whatever it is. In the cases of the `Barcelonnette - Villa du Parc du Mercantour...` searches, they happen to end up 10th out of 10. Others are first: - Benouville Churchyard -9.JPG <https://commons.wikimedia.org/w/api.php?action=query&list=search&srnamespace=6&srlimit=100&format=json&srsort=relevance&srsearch=Benouville%20Churchyard%20-9.JPG> - Rue des grands carmes 5 -9.jpg <https://commons.wikimedia.org/w/api.php?action=query&list=search&srnamespace=6&srlimit=100&format=json&srsort=relevance&srsearch=Rue%20des%20grands%20carmes%205%20-9.jpg> - Gmunden Kammerhofgasse 3 Arkadenhof -9173.jpg - Dug-out Zonnebeke -12.jpg Others are neither first nor last: - 2nd: Bourges - avenue du 95e-de-Ligne - Portail Saint-Ursin -991.jpg - 5th: Lannes (Lot-et-Garonne) - Église Sainte-Marie - Vitraux -9.JPG - 6th: Steyr Michaelerkirche Bürgerspital -9659.jpg <https://commons.wikimedia.org/w/api.php?action=query&list=search&srnamespace=6&srlimit=100&format=json&srsort=relevance&srsearch=Steyr%20Michaelerkirche%20B%C3%BCrgerspital%20-9659.jpg> - 34th: Eglise Notre Dame de l'Assomption.JPG <https://commons.wikimedia.org/w/api.php?action=query&list=search&srnamespace=6&srlimit=100&format=json&srsort=relevance&srsearch=Eglise%20Notre-Dame-de-l%27Assomption.JPG> So, the question now is whether we should try to modify search to push these results higher, overriding the current scoring method in some way, or whether the P18 search box should try to find any exact match, and either move it or insert it at the top of the list. The easiest solution, from my point of view, would be to use the completion suggester API instead of the regular search API for search-as-you-type, since it was designed for that. It does the exact right thing for all of these cases—but it only works for title/name–matching. Would it be possible to add a toggle to the P18 search box UI to allow the user to specify whether they are searching for a title match or doing a general search? (Just brainstorming.. but that might be the best of both worlds.) Modifying the search results might take a fairly long time to get on our schedule (likely), might slow down search (not sure), and might be subject to failure at a later date or causing weird side effects for some other search (depending on implementation). I'll put this on the agenda for our team meeting next Monday or Wednesday, and get an update back to you afterward. TASK DETAIL https://phabricator.wikimedia.org/T196165 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Silvan_WMDE, TJones Cc: EBernhardson, TJones, dcausse, Ladsgroup, Silvan_WMDE, Addshore, Bencemac, Aklapper, Ayack, Liuxinyu970226, Smalyshev, Lydia_Pintscher, Lea_Lacroix_WMDE, Iflorez, darthmon_wmde, alaa_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T241291: Simplify WDQS Packaging
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T241291 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Mstyles, TJones Cc: Jdforrester-WMF, WMDE-leszek, Mathew.onipe, Tarrow, Lucas_Werkmeister_WMDE, Ladsgroup, Gehel, akosiaris, Addshore, Aklapper, Mstyles, Un1tY, Hook696, Daryl-TTMG, RomaAmorRoma, 0010318400, E.S.A-Sheild, darthmon_wmde, Meekrab2012, joker88john, ET4Eva, CucyNoiD, Nandana, NebulousIris, Gaboe420, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Af420, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, EBjune, merbst, LawExplorer, WSH1906, Avner, Lewizho99, Maathavan, _jensen, rosalieper, Scott_WUaS, Jonas, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T196165: Commons image: when pasting the exact title, get the correct file first in the suggester
TJones added a comment. > Brilliant answer, thanks for the insights and also for your lovely blog post. Glad to help. I'm always happy to try to figure out what's going on with unexpected search results. > I agree that some kind of extra algorithm must be putting the exact file match to the bottom of the list - moving it to the top would be perfect. If search platform team can do that, please provide a quick feedback as to how and when it could happen. I don't think it's happening on the search side. Looking at the Javascript behind the P18 search in my browser, I see calls to Special:ItemByTitle <https://commons.wikimedia.org/wiki/Special:ItemByTitle> (search the code for //ItemByTitle,// the colon can be url-encoded). I'm not up to speed on modern Javascript and the code is very complex, but I think that might be the call that's fetching the exact title match and appending it to the list. Prepending it might do the trick, but someone much more proficient in Javascript and with better dev tools should take a look. TASK DETAIL https://phabricator.wikimedia.org/T196165 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Silvan_WMDE, TJones Cc: EBernhardson, TJones, dcausse, Ladsgroup, Silvan_WMDE, Addshore, Bencemac, Aklapper, Ayack, Liuxinyu970226, Smalyshev, Lydia_Pintscher, Lea_Lacroix_WMDE, Iflorez, darthmon_wmde, alaa_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T196165: Commons image: when pasting the exact title, get the correct file first in the suggester
TJones added subscribers: dcausse, TJones. TJones added a comment. I'm not 100% sure how the P18 search is configured, but I think I see generally what's going on. The search is using basically the same search as you get on the Commons Special:Search <https://commons.wikimedia.org/w/index.php?search=&title=Special:Search&go=Go> page, possibly with restriction to the `File:` namespace, and maybe with an additional hack or two. As such, it is interpreting the input string as search syntax. I searched for `dog` and then negated one word in the first result in each new result set until I got to this: `dog -racing -chien -reflection -stick -unid -natural -drone -oak`. Obviously that's unlikely to match any file names. Thus, any partial file names like this one—`Barcelonnette - Villa du Parc du Mercantour -984`—are not going match very well because the `-984` is interpreted as //not 984,// thus excluding the desired file. I think there is an additional hack added so that the full exact file name `Barcelonnette - Villa du Parc du Mercantour -984.jpg` matches and is tacked onto the end. In T196165#5643614 <https://phabricator.wikimedia.org/T196165#5643614>, @Lea_Lacroix_WMDE wrote: > This issue was mentioned again here <https://www.wikidata.org/wiki/Wikidata:Contact_the_development_team#Image_not_shown_in_suggester_when_typing_exact_name>, with the case that the correct image doesn't show up at all in the suggester's list. Could we look at it again? The discussion has since been moved to an archive page here <https://www.wikidata.org/wiki/Wikidata:Contact_the_development_team/Archive/2019/11#Image_not_shown_in_suggester_when_typing_exact_name>. The search in question is `Eglise Notre-Dame-de-l'Assomption.JPG`. Here I think the problem is that there are too many files with almost the exact same file name (and many more with some variation of "eglise notre dame de l'assomption" as part of the file name): - File:Eglise Notre Dame de l'Assomption.JPG - File:Église Notre-Dame-de-l'Assomption.jpg - File:Église Notre Dame de l'Assomption.jpg - File:Église Notre-Dame-de-l'assomption.JPG - File:Église Notre dame de l'assomption.jpg - File:Eglise Notre-Dame de l'Assomption.jpg After parsing for search, these are all identical. If you require every word in the title ( intitle:Eglise intitle:Notre intitle:Dame intitle:de intitle:l'Assomption intitle:jpg <https://commons.wikimedia.org/w/index.php?search=intitle%3AEglise+intitle%3ANotre+intitle%3ADame+intitle%3Ade+intitle%3Al%27Assomption+intitle%3Ajpg&title=Special%3ASearch&profile=advanced&fulltext=1&ns6=1> ) you get over 3500 results. That's more than you'd get if you search for "bobby <https://commons.wikimedia.org/w/index.php?search=intitle%3Abobby&title=Special:Search&profile=advanced&fulltext=1&ns6=1>" in the title (less than 2700)—and I would not be surprised if a search for "bobby" failed to return one specific desired result as the top hit—especially if we had files named //Bobby.jpg, bobby.JPG, BOBBY.JPG, BoBbY.JpG,// etc. So, the question is what is the P18 search intended to do? Is it supposed to be a general search that can go poorly in unusual circumstances (unintended negation—which I wrote a blog post <https://blog.wikimedia.org/2017/11/06/searching-techniques/> about a couple of years ago—or unexpectedly ambiguous searches) like the main search on the Special:Search page? Or is it supposed to be a file name/title–matching search like we have in the upper corner on Commons? Or is it trying to be both? If it is a general search, then it is working more-or-less as intended, and these odd corner cases—particularly negation in the title/file name—are going to perform poorly. If it is a file name/title–matching search, then it is using the wrong API, and should use the completion suggester API. @dcausse will be back next week (Feb 17) and he'd probably be the best one to ask about doing that the best way possible—maybe including prefixing searches with "File:" behind the scenes, though that may not be needed. If it's supposed to be both, then the obvious options to me are to either live with the general search as is, or do something much more complicated like interleaving the general search and completion suggester results together. (My hypothesis is that something at least a little like that is already happening since the partly negated `Barcelonnette - Villa du Parc du Mercantour -984.jpg` search gets an exact match at the bottom of the list—maybe moving it to the top of the list of general search results would be sufficient.) TASK DETAIL https://phabricator.wikimedia.org/T196165 EMAIL PREFERENCES https://phabrica
[Wikidata-bugs] [Maniphest] [Created] T240350: Locally override the name of crh from "Crimean Turkish" to "Crimean Tatar"
TJones created this task. TJones added a project: Wikidata. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION The English name for Qırımtatarca in Wikidata is given as "Crimean Turkish", but "Crimean Tatar" is the preferred name in English. The English Wikipedia page on the language has "Crimean Tatar language" as its title (noting "also called Crimean Turkish"). Ethnologue <http://www.ethnologue.com/16/show_language/crh/> lists "Crimean Tatar" as the main name, as does the Library of Congress Subject Headings <http://id.loc.gov/authorities/subjects/sh85034019.html>. CLDR incorrectly uses "Crimean Turkish" and a ticket has been opened upstream <https://unicode-org.atlassian.net/browse/CLDR-10991>; however, they move very slowly, and a local override until CLDR is corrected would be much appreciated. See more discussion here: https://www.wikidata.org/wiki/Wikidata:Project_chat#The_Crimean_Tatar_language See also T189511 <https://phabricator.wikimedia.org/T189511> for a similar request for Mediawiki. TASK DETAIL https://phabricator.wikimedia.org/T240350 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, TJones, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T101013: Log Wikidata Query Service queries to the event gate infrastructure
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T101013 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse, TJones Cc: Igorkim78, JAllemandou, Ottomata, Smalyshev, Deskana, Aklapper, 4748kitoko, Hook696, Daryl-TTMG, RomaAmorRoma, 0010318400, E.S.A-Sheild, darthmon_wmde, holger.knust, Meekrab2012, joker88john, ET4Eva, DannyS712, CucyNoiD, Nandana, NebulousIris, Akovalyov, Gaboe420, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Af420, Darkminds3113, Bsandipan, Lordiis, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, EBjune, merbst, LawExplorer, WSH1906, Avner, Lewizho99, Maathavan, Gehel, _jensen, rosalieper, Scott_WUaS, Jonas, FloNight, Xmlizer, mobrovac, terrrydactyl, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, GWicke, Manybubbles, Mbch331, jeremyb ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Unblock] T234968: Measure performance impact of code optimization and/or blazegraph settings on real traffic data
TJones closed subtask T101013: Log Wikidata Query Service queries to the event gate infrastructure as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T234968 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: JAllemandou, Mathew.onipe, dcausse, Igorkim78, Aklapper, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Unblock] T141602: [Objective Fiscal 19-20/Q2] (9) Provide a Proof of Concept SPARQL endpoint in support of SDoC project (stretch)
TJones closed subtask T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T141602 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Gehel, Nicolas_Raoul, Mmarx, Multichill, Nemo_bis, Husky, jleedev, Silverfish, Ainali, VIGNERON, JeanFred, Abbe98, ChristianFerrer, Jheald, Lucas_Werkmeister_WMDE, Salgo60, MB-one, Tpt, Addshore, Jarekt, Ramsey-WMF, Tgr, Bugreporter, Lydia_Pintscher, Aklapper, Steinsplitter, Poyekhali, Smalyshev, darthmon_wmde, DannyS712, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, E1presidente, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, merbst, LawExplorer, _jensen, rosalieper, Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, Fabrice_Florin, Raymond, Mbch331, Keegan ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T238408: Metrics from the wdqs updater are no longer collected
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T238408 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Mathew.onipe, TJones Cc: Ladsgroup, Mathew.onipe, dcausse, Aklapper, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T232297 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Mathew.onipe, TJones Cc: Liuxinyu970226, Gehel, Mathew.onipe, Igorkim78, Aklapper, darthmon_wmde, Legado_Shulgin, DannyS712, Nandana, JKSTNK, Davinaclare77, Qtn1293, Techguru.pc, Lahi, PDrouin-WMF, Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, Th3d3v1ls, Hfbn0, QZanden, EBjune, Tramullas, Acer, LawExplorer, Salgo60, Zppix, Silverfish, _jensen, rosalieper, Scott_WUaS, Susannaanas, Wong128hk, Jane023, Wikidata-bugs, Base, matthiasmullie, aude, Ricordisamoa, Wesalius, Lydia_Pintscher, Fabrice_Florin, Raymond, faidon, Jdforrester-WMF, Steinsplitter, Mbch331, Rxy, Jay8g, fgiunchedi ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T238232: blazegraph journal on wdqs1005 is oversized
TJones closed this task as "Resolved". TJones claimed this task. TASK DETAIL https://phabricator.wikimedia.org/T238232 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Mathew.onipe, Igorkim78, Gehel, Aklapper, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T213401: Create a cookbook to copy data between WDQS servers
TJones edited projects, added Discovery-Search; removed Discovery-Search (Current work). TASK DETAILhttps://phabricator.wikimedia.org/T213401EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Mathew.onipe, TJonesCc: Volans, Mathew.onipe, Gehel, Aklapper, Legado_Shulgin, crusnov, Nandana, thifranc, AndyTan, Davinaclare77, Jugando, Qtn1293, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Th3d3v1ls, Hfbn0, QZanden, EBjune, merbst, LawExplorer, Zppix, _jensen, Jonas, Xmlizer, Wong128hk, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, faidon, Mbch331, Jay8g, fgiunchedi___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T208215: Metrics from wdqs updater JMX should be prefixed
TJones edited projects, added Discovery-Search; removed Discovery-Search (Current work). TASK DETAILhttps://phabricator.wikimedia.org/T208215EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Mathew.onipe, TJonesCc: gerritbot, Smalyshev, Gehel, Mathew.onipe, fgiunchedi, Aklapper, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T211033: Analyze wbsearchentities AB test from nov/doc
TJones added a comment. Thanks for the report. It is odd that the number of characters didn't go down—as discussed elsewhere—but the change in clicks@1 vs clicks@2 is a nice clear step in a good direction.TASK DETAILhttps://phabricator.wikimedia.org/T211033EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EBernhardson, TJonesCc: TJones, Smalyshev, debt, gabriel-wmde, Lazhar, EBjune, Aklapper, Liuxinyu970226, EBernhardson, ET4Eva, Nandana, Lahi, Gq86, Darkminds3113, GoranSMilovanovic, QZanden, LawExplorer, Avner, Gehel, _jensen, D3r1ck01, FloNight, Wikidata-bugs, aude, jayvdb, Mbch331, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T202299: Create a system to create language-agnostic wiki links
TJones added a comment. While Wikidata items would be unambiguous, I think referencing a particular wiki as source is better, because it provides a better fallback than English if no article in the user's target language is available. Of course, using Wikidata as the intermediate stage to go from one language to another makes perfect sense. Oh, and as long as Chris gets to ask for a pony, I want to ask for a notice on the target page that shows the original link, like a redirect notice, such as "(Shared from German Wikipedia Gift)" or "(Shared from English Wikipedia Gift)"—or whatever wording makes sense.TASK DETAILhttps://phabricator.wikimedia.org/T202299EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: TJones, Whatamidoing-WMF, Aklapper, CKoerner_WMF, Trizek-WMF, Johan, Phukettaxigroup, Lahi, Gq86, GoranSMilovanovic, Jayprakash12345, QZanden, LawExplorer, Srdjan_m, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T202299: Create a system to create language-agnostic wiki links
TJones added a comment. It might make sense to specify an initial language (ie., en.share.wikipedia.org/wiki/Goat or en.wikipedia.org/wiki/Special:MyLanguage/Goat or whatever), otherwise the intent can be ambiguous. e.g., should share.wikipedia.org/Gift go to English "Gift" or German "Gift" ("poison")? Perhaps something similar to what the wikipedia.org portal does to determine language? That's the Accept-Language header, I believe. For users who are not logged in, that would be a great fallback for determining the user's language. Speaking of fallbacks, specifying an initial language also makes it the fallback, and mitigates the problem of "(ugh) English as a last resort". If I send the link from German Wikipedia to someone who speaks Italian but no Italian version is available, then the original German link is a better last resort than an unexpected English link. (Also, there may not be an English version for, say, a minor German celebrity or historical figure or small German town, so falling back to English won't always work.) So my suggested revision to the initial rough logic would be: Use some URL element to indicate that this is a "share-in-your-language" link, coming from some specific source language (e.g., any of share.en.wikipedia.org, en.share.wikipedia.org, share.wikipedia.org/en, en.wikipedia.org/wiki/Special:MyLanguage, etc.). Is this user logged in to Wikipedia? If yes, what is their language preference? Show the article if available in that language, use fallback languages if possible, and use the original language as a last resort. Is the user logged out? Then take a guess via browser settings, location, or some other known identifier (or combination via confidence scoring). Show the article in that language if available, use fallback languages if possible, and use the original language as a last resort. A random thought: it would probably be best to redirect to a specific article page, and not to a second share/MyLanguage link on the best-guess target wiki. Otherwise you could set up a redirect loop. For example, if on English Wikipedia I have my language pref set to Spanish, but on Spanish Wikipedia I'm not logged in and my browser language pref is English, then a share-in-your-language link on English would send me to Spanish, but a link on Spanish would send me back to English, rinse and repeat. It's a weird case, but I can see myself having such settings as a result of doing testing.TASK DETAILhttps://phabricator.wikimedia.org/T202299EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: TJones, Whatamidoing-WMF, Aklapper, CKoerner_WMF, Trizek-WMF, Johan, Phukettaxigroup, Lahi, Gq86, GoranSMilovanovic, Jayprakash12345, QZanden, LawExplorer, Srdjan_m, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T195447: Lexical Data user scenario: Exporting Lexical Data to create morphology and stemming for search engines
TJones renamed this task from "8dcaaa" to "Lexical Data user scenario: Exporting Lexical Data to create morphology and stemming for search engines".TJones raised the priority of this task from "High" to "Needs Triage".TJones added a subscriber: Aklapper.TJones removed projects: TCB-Team, Mail, New-Editor-Experiences, Language-2018-Apr-June, KartoEditor, JADE, Hashtags, Gamepress, Tamil-Sites, Connected-Open-Heritage-Batch-uploads (RAÄ-KMB_1_2017-02), CheckUser.TJones updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION26570726f6475636520796f757220627567207573696e67206120726563656e742076657273696f6e206f662074686520736f6674776172652c20746f2068652077696b6920636f6e74656e74206c616e67756167652e0a0a5468616e6b20796f752e0a546167730a436865636b557365720ad70a436f6e6e65637465642d4f70656e2d48657269746167652d42617463682d75706c6f61647320285241c42d4b4d425f315f323031372d3032290ad70a54616d696c2d53697465730ad70a47616d6570726573730ad70a48617368746167730ad70a4a4144450ad70a4b6172746f456469746f720ad70a4c616e67756167652d323031382d4170722d4a756e650ad70a4e65772d456469746f722d457870657269656e6365730ad70a4d61696c0ad70a5443422d5465616d0ad70a53756273637269626572730a4465736372697074696f6e20507265766965770a436f6e74656e77a6f6e652073657474696e6720696e20796f75722070726f66696c652c20636c69636b20746f207265636f6e63696c652eThe English language has very simple morphology, and this makes it relatively easy to build search engines that can find different forms of a word with no effort from the end user. Many other languages have a complex morphology with declinations, conjugations, clitics, agglutination, etc. Some search engines can plug in morphology and stemming support for particular languages. Support for each language must be developed and maintained independently. When Wikidata's Lexical Data is able to create all declined forms of a word, the output can be reused in both ways to build stemming engines: To find the base form (or forms) of a word from a declined form, and to find the declined forms from a base form. Wikibase should provide APIs that make such usage as easy as possible. Notes: - Like other subtasks of T186421, this is not a particular bug, but an idea for how Lexical Data can be useful in the long term. I am filing it in the hope that knowing the possible user scenarios will be useful to Wikibase developers when they are making decisions about developing the infrastructure, and to Wikidata community members when they are proposing properties, developing bots, and so on. - This is comparable to T186429 and T186420, but for search engines. - I'm subscribing @TJones and @Smalyshev, who know far more about stemming engines than I do.TASK DETAILhttps://phabricator.wikimedia.org/T195447EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: Aklapper, Amire80, Smalyshev, TJones, Mringgaard, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Darkdadaah, Mbch331, AndyTan, Zylc, 1978Gage2001, herron, Chicocvenancio, alanajjar, Tbscho, Lea_WMDE, Mattias_Ostmar-WMSE, JJMC89, Jseddon, Ryuch, Mkdw, RuyP, JEumerus, Trizek-WMF, KasiaWMDE, 0x010C, srodlund, Luke081515, grin, Bsadowski1, mys_721tx, Snowolf, Huji, Gryllida, jayvdb, Tobi_WMDE_SW, revi, scfc, He7d3r, Romaine, Jay8g, Glaisher, Krenair, chasemp___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T195447: Lexical Data user scenario: Exporting Lexical Data to create morphology and stemming for search engines
TJones added a comment. Thanks for this ticket, @Amire80! I've been thinking about building a tool to scrape lexical info from Wiktionary in order to feed it into a statistical model to build stemmers for languages that don't have them. Getting it from Wikibase would probably be a lot easier. I'm not sure whether an API would be able to keep up with the rate required for stemming while indexing wikis, but it would still be an awesome tool overall for other stemming applications that have lower throughput requirements. (An induced stemmer model, whether statistical or rule-based, would also be able to handle novel forms.)TASK DETAILhttps://phabricator.wikimedia.org/T195447EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: Aklapper, Amire80, Smalyshev, TJones, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Darkdadaah, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T165167: games using Wikidata's data
TJones added a comment. Game suggestion (to show people how much Wikidata knows, rather than to generate edits or new data): Twenty Questions, which is a guessing game where one player thinks of a thing and the guesser tries to figure out what it is with up to 20 yes/no questions. It's been turned into a handheld game called 20Q which learned what questions to ask by having people play on a website and asking for new questions when it lost the game. It seems like the hierarchy of Wikidata properties would provide a similar way to find a specific thing based on careful guessing—and guesses wouldn't have to be binary, either.TASK DETAILhttps://phabricator.wikimedia.org/T165167EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: TJones, Lucas_Werkmeister_WMDE, MichaelSchoenitzer_WMDE, Tobi_WMDE_SW, Crang115, D3r1ck01, abian, Aklapper, Lydia_Pintscher, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, Jayprakash12345, JakeTheDeveloper, QZanden, LawExplorer, Culex, Puik, Envlh, Wikidata-bugs, aude, Tobias1984, TheDJ, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180169: Make list of languages where using stemmed analyzer for Wikibase is beneficial
TJones added a comment. @Smalyshev, I think this covers the info you need. Let me know if I can give more info or help with anything else. :) TL;DR: yep, text is useful compared to plain for of ar, bg, ca, ckb, cs, da, de, el, en, en-ca, en-gb, es, eu, fa, fi, fr, ga, gl, hi, hu, hy, id, it, ja, ko, lt, lv, nb, nl, nn, pt, pt-br, ro, ru, simple, sv, th, and tr. Also, if the standard plugins are installed, include pl, zh, he, and uk. You should possibly note that bo, dz, gan, ja, km, lo, my, th, wuu, zh, zh-classical/lzh, zh-yue/yue, bug, cdo, cr, hak, jv, and zh-min-nan probably do better with the icu_tokenizer rather than the standard tokenizer. For everything else, keep in mind that the difference between text and plain is that plain has word_break_helper enabled. Details: The default plain analyzer is the standard tokenizer, the ICU Normalizer (which does some folding but much less than full ICU Folding) and the "word break helper" (which breaks words on periods, underscores, and parens). So default below is the same as "standard + icu_normalizer + word_break_helper". All of the analyzers except CJK, Persian, and Thai have stemmers, which I assume do something useful. Persian and Thai have stop words (as do most of the others), which I also assume do something useful. CJK has the CJK bigram filter (whick gives overlapping bigrams as tokens) and—oddly—English stop words; that seems useful. Also, if this is in an environment where the usual plugins are installed, you also have custom analyzers for pl, zh, he, and uk, so I've included them below in their own little sub-table. There are also a list of languages that have the icu_tokenizer enabled rather than the standard tokenizer: bo, dz, gan, ja, km, lo, my, th, wuu, zh, zh-classical/lzh, zh-yue/yue, bug, cdo, cr, hak, jv, and zh-min-nan. That might be worth having as another config option for those languages. For all of the languages without a custom analyzer, (including the ones using the icu_tokenizer), there is always a difference betweeen text and plain: plain includes word_break_helper. Most of the language-specific analyzers do not word_break_helper. Default Elastic analyzers: CodeLgtextplain arArabicarabicdefault bgBulgarianbulgariandefault caCatalancatalandefault ckbSoranisoranidefault csCzechczechdefault daDanishdanishdefault deGermangermandefault elGreekgreekstandard + icu_normalizer + +icu_folding + word_break_helper enEnglishenglishstandard + icu_normalizer + +icu_folding + word_break_helper en-caCanadian Englishenglishstandard + icu_normalizer + +icu_folding + word_break_helper en-gbBritish Englishenglishstandard + icu_normalizer + +icu_folding + word_break_helper esSpanishspanishdefault euBasquebasquedefault faPersianpersiandefault fiFinnishfinnishdefault frFrenchfrenchstandard + icu_normalizer + +icu_folding + word_break_helper gaIrishirishdefault glGaliciangaliciandefault hiHindihindidefault huHungarianhungariandefault hyArmenianarmeniandefault idIndonesianindonesiandefault itItalianitalianstandard + icu_normalizer + ascii_folding + dedupe_asciifolding jaJapanesecjkicu_tokenizer + icu_normalizer + word_break_helper koKoreancjkdefault ltLithuanianlithuaniandefault lvLatvianlatviandefault nbNorwegian Bokmålnorwegiandefault nlDutchdutchdefault nnNorwegian Nynorsknorwegiandefault ptPortuguesebraziliandefault pt-brBrazilian Portugueseportuguesedefault roRomanianromaniandefault ruRussianrussianstandard + icu_normalizer + russian_char_filter + word_break_helper simpleSimple Englishenglishstandard + icu_normalizer + +icu_folding + word_break_helper svSwedishswedishstandard + icu_normalizer + +icu_folding + word_break_helper thThaithaidefault trTurkishturkishdefault Analyzers with usual plugins: CodeLgtextplain plPolishpolishdefault zhChinesechineseicu_tokenizer + smartcn_stop + icu_normalizer + word_break_helper heHebrewhebrewstandard + icu_normalizer + +icu_folding + word_break_helper ukUkrainianukrainiandefault ICU Tokenization languages: CodeLg boTibetan dzDzongkha ganGan jaJapanese kmKhmer loLao myBurmese thThai wuuWu zhChinese zh-classicalClassical Chinese zh-yueCantonese bugBuginese cdoMin Dong crCree hakHakka jvJavanese zh-min-nanMin Nan TASK DETAILhttps://phabricator.wikimedia.org/T180169EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: TJones, Aklapper, EBernhardson, Lydia_Pintscher, hoo, aude, Smalyshev, dcausse, Lahi, GoranSMilovanovic, QZanden, EBjune, Avner, debt, Gehel, Jdrewniak, FloNight, Wikidata-bugs, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T170779: Wikidata search suggestions do not display on screen if character whose decomposition contains nukta is present in search query
TJones added a comment. @Smalyshev, thanks for tracking this one down! That was some weird behavior, but things getting normalized and not matching makes sense.TASK DETAILhttps://phabricator.wikimedia.org/T170779EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: gerritbot, hoo, Smalyshev, debt, Liuxinyu970226, TJones, daniel, thiemowmde, Aftabuzzaman, Mahir256, Aklapper, Lahi, Lordiis, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, EBjune, Lewizho99, Maathavan, Jdrewniak, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T170779: Wikidata search suggestions do not display on screen if character whose decomposition contains nukta is present in search query
TJones added a comment. Note that you don’t need to change your interface to Bengali to see these effects, and the fact that it is the Bengali keyword for “category” doesn’t seem to matter either. You can search for single characters and get the described behavior. (Be sure to clear the search box between examples—otherwise you the old results, as @Mahir256 noted above.) For Bengali and Devanagari characters, the precomposed versions hang, and the decomposed versions et suggestions: Bengaliয়U+09DFprecomposedhangs Bengaliয়U+09AF U+09BCdecomposedworks Devanagariग़U+095Aprecomposedhangs Devanagariग़U+0917 U+093Cdecomposedworks Gurmukhiਗ਼U+0A5Aprecomposedhangs Gurmukhiਗ਼U+0A17 U+0A3Cdecomposedworks Oddly, the opposite behavior happens for Latin, Cyrillic, and Greek characters—the precomposed versions work and the decomposed versions hang: LatinñU+00F1precomposedworks LatinñU+006E U+303decomposedhangs LatinéU+00E9precomposedworks LatinéU+0065 U+0301decomposedhangs LatinởU+1EDFprecomposedworks LatinởU+01A1 U+0309decomposedhangs CyrillicЃU+0403precomposedworks CyrillicЃU+0413 U+0301decomposedhangs CyrillicЀU+0400precomposedworks CyrillicЀU+0415 U+0300decomposedhangs CyrillicЍU+040Dprecomposedworks CyrillicЍU+0418 U+0300decomposedhangs GreekἆU+1F06precomposedworks GreekἆU+1F00 U+0342decomposedhangs However, when there is no precomposed alternative, the decomposed version works fine (depending on your fonts, the mixed script versions may or may not look right): Latinq́U+0071 U+0301decomposedworks Latinq̀U+0071 U+0300decomposedworks Latinq̃U+0071 U+0303decomposedworks Latinq̉U+0071 U+0309decomposedworks Latinq͂U+0071 U+0342decomposedworks Latin + Bengaliq়U+0071 U+09BCdecomposedworks Latin + Devanagariq़U+0071 U+093Cdecomposedworks Latin + Gurmukhiq਼U+0071 U+0A3Cdecomposedworks So, I’m really not sure what’s going on here, but it looks like it is more than just Indic languages that have the problem, and there seems to be an “expected” form which works, and an “unexpected” form that doesn’t—and the (pre|de)composition difference can break in either direction for a given script.TASK DETAILhttps://phabricator.wikimedia.org/T170779EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: TJonesCc: debt, Liuxinyu970226, TJones, PokestarFan, daniel, thiemowmde, Aftabuzzaman, Mahir256, Aklapper, Lahi, GoranSMilovanovic, QZanden, EBjune, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T104783: Retry fetching update data if Wikidata returns 503
TJones removed a subscriber: Manybubbles. TJones set Security to None. TASK DETAIL https://phabricator.wikimedia.org/T104783 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T104783: Retry fetching update data if Wikidata returns 503
TJones removed a subscriber: TJones. TASK DETAIL https://phabricator.wikimedia.org/T104783 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Manybubbles, Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T104783: Retry fetching update data if Wikidata returns 503
TJones added subscribers: Manybubbles, TJones. TJones added a comment. @Manybubbles wrote a mini-proxy to help test/intentionally break this. I had some trouble getting it to work, but that's probably me, not the proxy. The patch (as yet unmerged) is here: https://gerrit.wikimedia.org/r/#/c/226020/ . TASK DETAIL https://phabricator.wikimedia.org/T104783 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: TJones, Manybubbles, Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Closed] T104021: Add "updated up to" information to WDQS GUI.
TJones closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T104021 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: gerritbot, Aklapper, Smalyshev, Wikidata-bugs, aude, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Project Column] T104021: Add "updated up to" information to WDQS GUI.
TJones moved this task to Done on the Discovery-Wikidata-Query-Service-Sprint workboard. TASK DETAIL https://phabricator.wikimedia.org/T104021 WORKBOARD https://phabricator.wikimedia.org/project/board/1239/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: gerritbot, Aklapper, Smalyshev, Wikidata-bugs, aude, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T104021: Add "updated up to" information to WDQS GUI.
TJones removed a project: Wikidata-Query-Service. TASK DETAIL https://phabricator.wikimedia.org/T104021 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Smalyshev, Wikidata-bugs, aude, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Project Column] T104021: Add "updated up to" information to WDQS GUI.
TJones moved this task to Done on the Discovery-Wikidata-Query-Service-Sprint workboard. TASK DETAIL https://phabricator.wikimedia.org/T104021 WORKBOARD https://phabricator.wikimedia.org/project/board/1239/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Project Column] T104021: Add "updated up to" information to WDQS GUI.
TJones moved this task to Done on the Wikidata-Query-Service workboard. TASK DETAIL https://phabricator.wikimedia.org/T104021 WORKBOARD https://phabricator.wikimedia.org/project/board/891/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Project Column] T104021: Add "updated up to" information to WDQS GUI.
TJones moved this task to In Dev/Progress on the Wikidata-Query-Service workboard. TASK DETAIL https://phabricator.wikimedia.org/T104021 WORKBOARD https://phabricator.wikimedia.org/project/board/891/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Claimed] T104021: Add "updated up to" information to WDQS GUI.
TJones claimed this task. TJones set Security to None. TASK DETAIL https://phabricator.wikimedia.org/T104021 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: TJones Cc: Aklapper, Smalyshev, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs