[Wikidata-bugs] [Maniphest] T356773: [tracking] Community feedback for the WDQS Split the Graph project
EgonWillighagen added a comment. I tried to get the federation working, but got time outs too. The problem is that the current setup makes splits at a statement level. That is, given statements with some property (e.g. P2860 <https://phabricator.wikimedia.org/P2860>), some results are in one QS instance and some are in the other. That means a lot of federation-union combinations to get all results. I posted an example query that is affected (the first I tried) in this issue report: https://github.com/WDscholia/scholia/issues/2423 TASK DETAIL https://phabricator.wikimedia.org/T356773 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Sannita, EgonWillighagen Cc: EgonWillighagen, ArthurPSmith, Sj, dcausse, valerio.bozzolan, tfmorris, Gehel, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries
EgonWillighagen added a comment. > Note that early experiments can be done by federating wdqs with itself, e.g. https://w.wiki/7vE9. Thanks for the example. Before I can experiment, I need to know which item types end up in which SPARQL endpoint. The example query suggest the author information will also go into the split. I am looking forward to the first experimental splitted endpoint to be available. TASK DETAIL https://phabricator.wikimedia.org/T349911 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T325871: Wikibase QuickStatements incorrectly assumes HTTP for unit item IRIs
EgonWillighagen created this task. EgonWillighagen added a project: Wikidata. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION **Problem:** In Wikibase instances on https://www.wikibase.cloud/ QuickStatements where unit information is given fail to execute because the Uxxx in the QuickStatements gets expended to the wrong entity IRI, which uses the HTTP as on Wikidata instead of HTTPS in Wikibase installations. **List of steps to reproduce** (step by step, including full links if applicable): - create QuickStatements with a literal with unit information, something like CREATE LASTP1 Q2 LASTDen "chemical compound" LASTP12 "CN(CC1=CN=CC=C1)C(=O)C2=NOC(=C2)COC3=CC4=C(4)C=C3" S14 Q5 LASTP3 "C₂₂H₂₃N₃O₃"S14 Q5 LASTP2 377.4372U3 S14 Q5 LASTP9 "InChI=1S/C22H23N3O3/c1-25(14-16-5-4-10-23-13-16)22(26)21-12-20(28-24-21)15-27-19-9-8-17-6-2-3-7-18(17)11-19/h4-5,8-13H,2-3,6-7,14-15H2,1H3" S14 Q5 LASTP10 "MPDNXORLVWKNOG-UHFFFAOYSA-N" LASTP13 "24793226" S14 Q4 (Obviously, the exact P/Q-ids will differ per Wikibase) **What happens?**: Executing these QuickStatements on a Wikibase will expand the U3 <https://phabricator.wikimedia.org/U3> to http://compoundcloud.wikibase.cloud/entity/U3 F35889582: image.png <https://phabricator.wikimedia.org/F35889582> **What should have happened instead?**: The expanded IRI should use HTTPS instead of HTTP. It should be: https://compoundcloud.wikibase.cloud/entity/U3 TASK DETAIL https://phabricator.wikimedia.org/T325871 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers
EgonWillighagen updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T314999 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers
EgonWillighagen updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T314999 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers
EgonWillighagen updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T314999 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers
EgonWillighagen created this task. EgonWillighagen added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION **Steps to replicate the issue** (include links if applicable): In the Wikidata Query Service (https://query.wikidata.org/){F35423485}, If you want to negate a predicate (e.g. "!wdt:P31 <https://phabricator.wikimedia.org/P31>" for "not instance of") or reverse a predicate (e.g. "^wdt:P921 <https://phabricator.wikimedia.org/P921>" for "is main subject of") then autocomplete does not work. 1. take a SELECT {} template 2. start a query 3. type "^wdt:instan" and try to autocomplete **What happens?**: What happens is that it reports it does not know the "^wdt" namespace. **What should have happened instead?**: It should be aware of the ! and ^ modifiers and remove that in the namespace lookup. **Software version** (skip for WMF-hosted wikis like Wikipedia): **Other information** (browser name/version, screenshots, etc.): TASK DETAIL https://phabricator.wikimedia.org/T314999 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. This ticket can be closed. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Bugreporter, ArthurPSmith, Manuel, TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Matlin, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. Thanks for the ping! That page was indeed the lead I had at the time and reason to file this issue, because I could not work out (in the time I had) how to update that. But the solution turned out to be a lot easier for Wikidata: https://www.wikidata.org/w/index.php?title=MediaWiki%3AGadget-AuthorityControl.js=revision=1694196586=1409657932 This was done by Nikky only last weekend (https://chem-bla-ics.blogspot.com/2022/08/wikidata-now-escapes-smiles-and-cxsmiles.html) and since I had to focus on student report grading, I forgot to update this ticket. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Bugreporter, ArthurPSmith, Manuel, TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Matlin, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. In T307662#7906276 <https://phabricator.wikimedia.org/T307662#7906276>, @EgonWillighagen wrote: > I will write up some examples later today using the "bug" template, to highlight some issues. One done: https://phabricator.wikimedia.org/T307662#7911276 TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. **List of steps to reproduce** (step by step, including full links if applicable): - got to https://www.wikidata.org/wiki/Q26075#P233 - click the link (formatter URL) for the canonical SMILES C#N - notice the SVG shows CH4 instead of C#N **What happens?**: CDKDepict which the canonical SMILES links to shows methane F35109602: image.png <https://phabricator.wikimedia.org/F35109602> instead of hydrogen cyanide: F35109600: image.png <https://phabricator.wikimedia.org/F35109600> **What should have happened instead?**: The canonical SMILES should be URL encoded before added as $1 in the formatter URL, giving https://www.simolecule.com/cdkdepict/depict/bow/svg?smi=C%23N TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. In T307662#7906210 <https://phabricator.wikimedia.org/T307662#7906210>, @TheDJ wrote: > This is a url encoding problem then. Do you have a link where this is actually occurring ? I will write up some examples later today using the "bug" template, to highlight some issues. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. In T307662#7906232 <https://phabricator.wikimedia.org/T307662#7906232>, @TheDJ wrote: > Math-Chemistry-Support is a project specifically about defining these symbols using our Math/LateX wikicode extension. Ah, got it. Yeah, theoretically possible, as there is a LaTeX package for drawing chemical structures, but I'm not aware of a really good, open source tool to convert SMILES (-variants) into TeX. Yes, agreed it doesn't fit there. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. In T307662#7906222 <https://phabricator.wikimedia.org/T307662#7906222>, @TheDJ wrote: > This is essentially: T160281 <https://phabricator.wikimedia.org/T160281> yes, same issue, but maybe not the same solution. TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties
EgonWillighagen added a comment. @TheDJ, that Math-Chemistry-Support is not (also) about chemistry? TASK DETAIL https://phabricator.wikimedia.org/T307662 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata
EgonWillighagen added a comment. 1,939,738 authors -> https://w.wiki/3o2i trying to get all unique properties of these times out. Samples 50k authors for properties with an author as subject, https://w.wiki/3o3C, results: - 96% is linked to a profession (P106 <https://phabricator.wikimedia.org/P106>) - 94% is linked to country of citizenship (P27 <https://phabricator.wikimedia.org/P27>) - 90% is linked to a place of birth (P19 <https://phabricator.wikimedia.org/P19>) - 36% is linked to an employer (P108 <https://phabricator.wikimedia.org/P108>) - 17% is linked to a notable work (P800 <https://phabricator.wikimedia.org/P800>) - 9% is linked to their doctoral advisor (P184 <https://phabricator.wikimedia.org/P184>) - 8% is linked to the political party they are member of (P102) These specific properties can be used to calculate the overall statistics. The inverse properties (where the author is the object) seems a bit more trickier and I'm running into time outs there. I hope this helps. TASK DETAIL https://phabricator.wikimedia.org/T281854 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AKhatun_WMF, EgonWillighagen Cc: AKhatun_WMF, Esc3300, SCIdude, Sj, Harej, Andrawaag, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata
EgonWillighagen added a comment. @AKhatun_WMF, when you write "authors connected to other subgraphs", do you mean subgraphs within Wikidata (so, excluding external identifiers), or also graphs from other resources part of, for example, the Linked Open Data Cloud? TASK DETAIL https://phabricator.wikimedia.org/T281854 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AKhatun_WMF, EgonWillighagen Cc: AKhatun_WMF, Esc3300, SCIdude, Sj, Harej, Andrawaag, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata
EgonWillighagen added a comment. In T281854#7185253 <https://phabricator.wikimedia.org/T281854#7185253>, @Multichill wrote: > No it's not, please have a look at the task description. This is about getting metrics. Can you elaborate on the "this plan" in that description? What do you know more that others do not? TASK DETAIL https://phabricator.wikimedia.org/T281854 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Sj, Harej, Andrawaag, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata
EgonWillighagen added a comment. Regarding the question of the "growth of scientific literature", there is a good bit of literature on this, and sometimes conflated with the topic of "growth of science". I started collecting some knowledge about this: https://scholia.toolforge.org/topic/Q107292942 TASK DETAIL https://phabricator.wikimedia.org/T281854 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Andrawaag, Harej, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata
EgonWillighagen added a comment. I am with @Harej here. Focusing on the largest data set is not the right approach. As I have indicated in similar discussions elsewhere, there will be a next large subset and this one will also be large. From the field chemistry, 60M items is nothing. The number of species every observed is millions. There are many things that easily go into the millions. At this moment, we have a small subset of chemicals in Wikidata (~1.2 million), because of the growing pains this is artificially low (real chemical databases have >102 M records of chemicals experimentally studied). I regularly run into missing content (even just looking at the English Wikipedia), and am very selective in what i add at this moment. As soon as you remove one big blob, all that will happen is that the void will be very quickly filled by another big blob. Now, if a single database is not possible, then the overall design must just change, and everything should become a separate namespace and make sure the federation works extremely well: the reason why Wikidata works so awesome, is that I can move from one topic to underlying data sources because everything is integrated. Please take that into consideration. In fact, it the sake is just to split out a blog and see what happens, then plz focus on something more volatile then the knowledge about reality, and remove for example things that changes every year. For example, remove all humans, all of them, and organizations. There will be a new human tomorrow. When it comes to facts, who care who did or studied it, but just focus on what happened or what was discovered. TASK DETAIL https://phabricator.wikimedia.org/T281854 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Harej, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] [Commented On] T249041: Updated URL for the WikiPathways SPARQL endpoint
EgonWillighagen added a comment. I created a pull request: https://github.com/wikimedia/wikidata-query-deploy/pull/1 TASK DETAIL https://phabricator.wikimedia.org/T249041 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: EgonWillighagen Cc: Multichill, Aklapper, EgonWillighagen, CptViraj, darthmon_wmde, Dibya, 94rain, DannyS712, Nandana, Tks4Fish, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Jayprakash12345, QZanden, EBjune, Zoranzoki21, merbst, LawExplorer, DatGuy, Devwaker, Niklitov, _jensen, Urbanecm, rosalieper, JEumerus, Scott_WUaS, Jonas, Ananthsubray, Xmlizer, Superzerocool, Tulsi_Bhagat, Wong128hk, Luke081515, SimmeD, jkroll, Wikidata-bugs, Jdouglas, Snowolf, aude, Tobias1984, Dcljr, Manybubbles, Jdforrester-WMF, Matanya, Mbch331, Rxy, Jay8g, Krenair ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T175380: Queries with wikibase:statements or wikibase:sitelinks are slow
EgonWillighagen added a comment. Yes, in the end we want data for all chemicals, but this is a good tradeoff. I'll implement! Thanks! TASK DETAIL https://phabricator.wikimedia.org/T175380 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, EgonWillighagen Cc: EgonWillighagen, Aklapper, Smalyshev, Lucas_Werkmeister_WMDE, darthmon_wmde, ET4Eva, Nandana, Lahi, Gq86, Darkminds3113, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Avner, Gehel, _jensen, rosalieper, Scott_WUaS, Jonas, FloNight, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T175380: Queries with wikibase:statements or wikibase:sitelinks are slow
EgonWillighagen added a comment. I'm running into this problem too. Queries are slow or even time out for chemicals. The hints to do not seem to improve the query time significantly: SELECT ?wikis ?compound WHERE { ?compound wdt:P31 wd:Q11173 ; wikibase:sitelinks ?wikis . hint:Prior hint:rangeSafe true . FILTER(NOT EXISTS {?compound wdt:P2119 []}) } ORDER BY DESC(?wikis) LIMIT 100 TASK DETAIL https://phabricator.wikimedia.org/T175380 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, EgonWillighagen Cc: EgonWillighagen, Aklapper, Smalyshev, Lucas_Werkmeister_WMDE, darthmon_wmde, ET4Eva, Nandana, Lahi, Gq86, Darkminds3113, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Avner, Gehel, _jensen, rosalieper, Scott_WUaS, Jonas, FloNight, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata
EgonWillighagen added a comment. In T193728#4212862, @Rspeer wrote: how to change Wikidata's copyright status. In which you assume it will chance license(/waiver)... If you seek certainty, plenty of people have indicated their view on the situation here, but this discussion is not ever going to give you certainty: only court can.TASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: lisong, Lofhi, Nemo_bis, TomT0m, jrbs, EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T194735: WDQS embedding gives strange results.
EgonWillighagen added a comment. @Fnielsen, we could add the _javascript_ to run the queries in a way that it only runs when the is visible... e.g. with something like this: https://github.com/shaunbowe/jquery.visibilityChangedTASK DETAILhttps://phabricator.wikimedia.org/T194735EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, Jonas, Gehel, Fnielsen, Aklapper, Lahi, Gq86, Darkminds3113, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Avner, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata
EgonWillighagen added a comment. Hi all, IANAL but have been professionally dealing with copyright for quite some time now (scholar, author, database creator, advisor, etc, etc). First, automated (bots, quickstatements) added of content that is not public domain (the formal type, e.g. in USA, not the loose common sense of the word) or CCZero itself is not OK. That said, I am not aware of anyone violating that. I understood that was discussed on a private mailing list, but makes this ticket's discussion a bit academic (sadly). Second, to me, when I signed up for a Wikidata account, the license/waiver (CCZero is an agreement that waives all rights given by any law in any jurisdiction; does someone know if the legality of it has been challenged in court? not that I am aware of, at this moment) was pretty clear to me. Just like with Wikipedia, the account owner takes responsibility for not uploading copyright infringing material. To me, that does not make CCZero unsuitable for Wikidata at all, but does mean that some users seems to violate the Wikidata user agreement. Third, someone above suggest to change the CCZero license/waiver. I strongly disagree: it would violate my creative right, if not legally (which is probably OK), but at least morally. Like me, I expect that many others have put significant effort in entering CCZero data, and I really prefer others to acknowledge my and others wishes to have "my" data under CCZero. Fourth, tracking where data comes from is really important. But please consider it's complicated. If something says, "stated in" "English Wikipedia", it does not mean it was automatically entered; that depends from account to account. When I cannot find a better source, I often use "English Wikipedia" as reference source, and will state that in Wikidata (I sometimes forget). So, for me this issues is not about CCZero for Wikidata, it is about some people doing stuff they were not supposed to do. Have these people been contacted? Has the violating content be identified? What was their reply? Has an attempt been made to remove that content? EgonTASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata
EgonWillighagen added a comment. In T193728#4189219, @Psychoslave wrote: Let's recall that whether this transfer is done by automation or crowdsourcing doesn't matter, it's the quantity of transferred data Of all things I read about copyright law (IANAL but very interested), this is not what I have been told... in NL there is the provision that allows to replicate a database if the content is aggregated independently (the famous case is the Dutch phonebook which was manually copied in India; sorry cannot find an online description quickly). My point, from what I understood, is does matter how content was transferred, which makes me consider Mix'n'Match legally safe. EgonTASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T154660: increase length limit for external identifier, string and URL datatype
EgonWillighagen added a comment. I am not sure how much we should worry about the exact percentages for PubChem; to me, more important is are the percentages of the chemistry we have in Wikidata. These are likely correlated, and since PubChem is a lot bigger puts things in perspective. InChIs are identifiers, but not as we are common too, and I understand the point about indexing and ID length. Quick question... is the semantic meaning of an 'external identifier' that it must be indexed, all of them?TASK DETAILhttps://phabricator.wikimedia.org/T154660EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: daniel, thiemowmde, EgonWillighagen, Sebotic, Scott_WUaS, Sadads, Pasleim, Aklapper, Lydia_Pintscher, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T154660: increase length limit for external identifier, string and URL datatype
EgonWillighagen added a comment. The InChI is not the only use case for chemistry, btw. SMILES also runs into the char limit right now for a number of compounds.TASK DETAILhttps://phabricator.wikimedia.org/T154660EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, Sebotic, Scott_WUaS, Sadads, Pasleim, Aklapper, Lydia_Pintscher, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs