[Wikidata-bugs] [Maniphest] T356773: [tracking] Community feedback for the WDQS Split the Graph project

2024-02-10 Thread EgonWillighagen
EgonWillighagen added a comment.


  I tried to get the federation working, but got time outs too. The problem is 
that the current setup makes splits at a statement level. That is, given 
statements with some property (e.g. P2860 
<https://phabricator.wikimedia.org/P2860>), some results are in one QS instance 
and some are in the other. That means a lot of federation-union combinations to 
get all results. I posted an example query that is affected (the first I tried) 
in this issue report: https://github.com/WDscholia/scholia/issues/2423

TASK DETAIL
  https://phabricator.wikimedia.org/T356773

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Sannita, EgonWillighagen
Cc: EgonWillighagen, ArthurPSmith, Sj, dcausse, valerio.bozzolan, tfmorris, 
Gehel, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, 
Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, EBjune, KimKelting, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T349911: Explore the feasibility of using SPARQL federation for scholia queries

2023-10-28 Thread EgonWillighagen
EgonWillighagen added a comment.


  > Note that early experiments can be done by federating wdqs with itself, 
e.g. https://w.wiki/7vE9.
  
  Thanks for the example. Before I can experiment, I need to know which item 
types end up in which SPARQL endpoint. The example query suggest the author 
information will also go into the split. I am looking forward to the first 
experimental splitted endpoint to be available.

TASK DETAIL
  https://phabricator.wikimedia.org/T349911

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Fnielsen, Daniel_Mietchen, EgonWillighagen, dcausse, Aklapper, 
Danny_Benjafield_WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T325871: Wikibase QuickStatements incorrectly assumes HTTP for unit item IRIs

2022-12-22 Thread EgonWillighagen
EgonWillighagen created this task.
EgonWillighagen added a project: Wikidata.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  **Problem:**
  In Wikibase instances on https://www.wikibase.cloud/ QuickStatements where 
unit information is given fail to execute because the Uxxx in the 
QuickStatements gets expended to the wrong entity IRI, which uses the HTTP as 
on Wikidata instead of HTTPS in Wikibase installations.
  
  **List of steps to reproduce** (step by step, including full links if 
applicable):
  
  - create QuickStatements with a literal with unit information, something like
  
CREATE

LASTP1  Q2
LASTDen "chemical compound"
LASTP12 "CN(CC1=CN=CC=C1)C(=O)C2=NOC(=C2)COC3=CC4=C(4)C=C3" 
S14 Q5
LASTP3  "C₂₂H₂₃N₃O₃"S14 Q5
LASTP2  377.4372U3  S14 Q5
LASTP9  
"InChI=1S/C22H23N3O3/c1-25(14-16-5-4-10-23-13-16)22(26)21-12-20(28-24-21)15-27-19-9-8-17-6-2-3-7-18(17)11-19/h4-5,8-13H,2-3,6-7,14-15H2,1H3"
S14 Q5
LASTP10 "MPDNXORLVWKNOG-UHFFFAOYSA-N"
LASTP13 "24793226"  S14 Q4
  
  (Obviously, the exact P/Q-ids will differ per Wikibase)
  
  **What happens?**:
  Executing these QuickStatements on a Wikibase will expand the U3 
<https://phabricator.wikimedia.org/U3> to 
http://compoundcloud.wikibase.cloud/entity/U3
  
  F35889582: image.png <https://phabricator.wikimedia.org/F35889582>
  
  **What should have happened instead?**:
  The expanded IRI should use HTTPS instead of HTTP. It should be: 
https://compoundcloud.wikibase.cloud/entity/U3

TASK DETAIL
  https://phabricator.wikimedia.org/T325871

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers

2022-08-11 Thread EgonWillighagen
EgonWillighagen updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T314999

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, 
Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers

2022-08-11 Thread EgonWillighagen
EgonWillighagen updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T314999

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, 
Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers

2022-08-11 Thread EgonWillighagen
EgonWillighagen updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T314999

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, 
Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T314999: WDQS does not autocomplete when using modifiers

2022-08-11 Thread EgonWillighagen
EgonWillighagen created this task.
EgonWillighagen added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  **Steps to replicate the issue** (include links if applicable):
  
  In the Wikidata Query Service (https://query.wikidata.org/){F35423485}, If 
you want to negate a predicate (e.g. "!wdt:P31 
<https://phabricator.wikimedia.org/P31>" for "not instance of") or reverse a 
predicate (e.g. "^wdt:P921 <https://phabricator.wikimedia.org/P921>" for "is 
main subject of") then autocomplete does not work.
  
  1. take a SELECT {} template
  2. start a query
  3. type "^wdt:instan" and try to autocomplete
  
  **What happens?**:
  
  What happens is that it reports it does not know the "^wdt" namespace.
  
  **What should have happened instead?**:
  
  It should be aware of the ! and ^ modifiers and remove that in the namespace 
lookup.
  
  **Software version** (skip for WMF-hosted wikis like Wikipedia):
  
  **Other information** (browser name/version, screenshots, etc.):

TASK DETAIL
  https://phabricator.wikimedia.org/T314999

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Aklapper, EgonWillighagen, AWesterinen, MPhamWMF, CBogen, Namenlos314, 
Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-08-07 Thread EgonWillighagen
EgonWillighagen added a comment.


  This ticket can be closed.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Bugreporter, ArthurPSmith, Manuel, TheDJ, Aklapper, EgonWillighagen, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Matlin, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-08-07 Thread EgonWillighagen
EgonWillighagen added a comment.


  Thanks for the ping! That page was indeed the lead I had at the time and 
reason to file this issue, because I could not work out (in the time I had) how 
to update that.
  
  But the solution turned out to be a lot easier for Wikidata: 
https://www.wikidata.org/w/index.php?title=MediaWiki%3AGadget-AuthorityControl.js=revision=1694196586=1409657932
  
  This was done by Nikky only last weekend 
(https://chem-bla-ics.blogspot.com/2022/08/wikidata-now-escapes-smiles-and-cxsmiles.html)
 and since I had to focus on student report grading, I forgot to update this 
ticket.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Bugreporter, ArthurPSmith, Manuel, TheDJ, Aklapper, EgonWillighagen, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Matlin, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-07 Thread EgonWillighagen
EgonWillighagen updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-07 Thread EgonWillighagen
EgonWillighagen updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-07 Thread EgonWillighagen
EgonWillighagen added a comment.


  In T307662#7906276 <https://phabricator.wikimedia.org/T307662#7906276>, 
@EgonWillighagen wrote:
  
  > I will write up some examples later today using the "bug" template, to 
highlight some issues.
  
  One done: https://phabricator.wikimedia.org/T307662#7911276

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-07 Thread EgonWillighagen
EgonWillighagen added a comment.


  **List of steps to reproduce** (step by step, including full links if 
applicable):
  
  - got to https://www.wikidata.org/wiki/Q26075#P233
  - click the link (formatter URL) for the canonical SMILES C#N
  - notice the SVG shows CH4 instead of C#N
  
  **What happens?**:
  CDKDepict which the canonical SMILES links to shows methane
  
  F35109602: image.png <https://phabricator.wikimedia.org/F35109602>
  
  instead of hydrogen cyanide:
  
  F35109600: image.png <https://phabricator.wikimedia.org/F35109600>
  
  **What should have happened instead?**:
  The canonical SMILES should be URL encoded before added as $1 in the 
formatter URL, giving 
https://www.simolecule.com/cdkdepict/depict/bow/svg?smi=C%23N

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-05 Thread EgonWillighagen
EgonWillighagen added a comment.


  In T307662#7906210 <https://phabricator.wikimedia.org/T307662#7906210>, 
@TheDJ wrote:
  
  > This is a url encoding problem then. Do you have a link where this is 
actually occurring ?
  
  I will write up some examples later today using the "bug" template, to 
highlight some issues.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-05 Thread EgonWillighagen
EgonWillighagen added a comment.


  In T307662#7906232 <https://phabricator.wikimedia.org/T307662#7906232>, 
@TheDJ wrote:
  
  > Math-Chemistry-Support is a project specifically about defining these 
symbols using our Math/LateX wikicode extension.
  
  Ah, got it. Yeah, theoretically possible, as there is a LaTeX package for 
drawing chemical structures, but I'm not aware of a really good, open source 
tool to convert SMILES (-variants) into TeX. Yes, agreed it doesn't fit there.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-05 Thread EgonWillighagen
EgonWillighagen added a comment.


  In T307662#7906222 <https://phabricator.wikimedia.org/T307662#7906222>, 
@TheDJ wrote:
  
  > This is essentially: T160281 <https://phabricator.wikimedia.org/T160281>
  
  yes, same issue, but maybe not the same solution.

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307662: help needed with encoding statement value before pass it into formatter URLs for three SMILES related properties

2022-05-05 Thread EgonWillighagen
EgonWillighagen added a comment.


  @TheDJ, that Math-Chemistry-Support is not (also) about chemistry?

TASK DETAIL
  https://phabricator.wikimedia.org/T307662

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: TheDJ, Aklapper, EgonWillighagen, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata

2021-08-07 Thread EgonWillighagen
EgonWillighagen added a comment.


  1,939,738 authors -> https://w.wiki/3o2i
  
  trying to get all unique properties of these times out.
  
  Samples 50k authors for properties with an author as subject, 
https://w.wiki/3o3C, results:
  
  - 96% is linked to a profession (P106 
<https://phabricator.wikimedia.org/P106>)
  - 94% is linked to country of citizenship (P27 
<https://phabricator.wikimedia.org/P27>)
  - 90% is linked to a place of birth (P19 
<https://phabricator.wikimedia.org/P19>)
  - 36% is linked to an employer (P108 <https://phabricator.wikimedia.org/P108>)
  - 17% is linked to a notable work (P800 
<https://phabricator.wikimedia.org/P800>)
  - 9% is linked to their doctoral advisor (P184 
<https://phabricator.wikimedia.org/P184>)
  - 8% is linked to the political party they are member of (P102)
  
  These specific properties can be used to calculate the overall statistics. 
The inverse properties (where the author is the object) seems a bit more 
trickier and I'm running into time outs there. I hope this helps.

TASK DETAIL
  https://phabricator.wikimedia.org/T281854

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: AKhatun_WMF, EgonWillighagen
Cc: AKhatun_WMF, Esc3300, SCIdude, Sj, Harej, Andrawaag, Lydia_Pintscher, 
Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, 
Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, 
Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, 
Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, 
QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, 
Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata

2021-08-06 Thread EgonWillighagen
EgonWillighagen added a comment.


  @AKhatun_WMF, when you write "authors connected to other subgraphs", do you 
mean subgraphs within Wikidata (so, excluding external identifiers), or also 
graphs from other resources part of, for example, the Linked Open Data Cloud?

TASK DETAIL
  https://phabricator.wikimedia.org/T281854

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: AKhatun_WMF, EgonWillighagen
Cc: AKhatun_WMF, Esc3300, SCIdude, Sj, Harej, Andrawaag, Lydia_Pintscher, 
Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, Fnielsen, Darwinius, 
Daniel_Mietchen, Lokal_Profil, GoEThe, Alicia_Fagerving_WMSE, PKM, LWyatt, 
Multichill, Aklapper, MPhamWMF, Invadibot, maantietaja, CBogen, Akuckartz, 
Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, 
QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, 
Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata

2021-07-07 Thread EgonWillighagen
EgonWillighagen added a comment.


  In T281854#7185253 <https://phabricator.wikimedia.org/T281854#7185253>, 
@Multichill wrote:
  
  > No it's not, please have a look at the task description. This is about 
getting metrics.
  
  Can you elaborate on the "this plan" in that description? What do you know 
more that others do not?

TASK DETAIL
  https://phabricator.wikimedia.org/T281854

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Sj, Harej, Andrawaag, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, 
EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, 
Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, 
maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata

2021-06-19 Thread EgonWillighagen
EgonWillighagen added a comment.


  Regarding the question of the "growth of scientific literature", there is a 
good bit of literature on this, and sometimes conflated with the topic of 
"growth of science". I started collecting some knowledge about this: 
https://scholia.toolforge.org/topic/Q107292942

TASK DETAIL
  https://phabricator.wikimedia.org/T281854

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Andrawaag, Harej, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, 
EgonWillighagen, Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, 
Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, 
maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T281854: Get baseline measurements/expectations for splitting scholarly articles from Wikidata

2021-06-19 Thread EgonWillighagen
EgonWillighagen added a comment.


  I am with @Harej here. Focusing on the largest data set is not the right 
approach. As I have indicated in similar discussions elsewhere, there will be a 
next large subset and this one will also be large. From the field chemistry, 
60M items is nothing. The number of species every observed is millions. There 
are many things that easily go into the millions. At this moment, we have a 
small subset of chemicals in Wikidata (~1.2 million), because of the growing 
pains this is artificially low (real chemical databases have >102 M records of 
chemicals experimentally studied). I regularly run into missing content (even 
just looking at the English Wikipedia), and am very selective in what i add at 
this moment.
  
  As soon as you remove one big blob, all that will happen is that the void 
will be very quickly filled by another big blob. Now, if a single database is 
not possible, then the overall design must just change, and everything should 
become a separate namespace and make sure the federation works extremely well: 
the reason why Wikidata works so awesome, is that I can move from one topic to 
underlying data sources because everything is integrated. Please take that into 
consideration.
  
  In fact, it the sake is just to split out a blog and see what happens, then 
plz focus on something more volatile then the knowledge about reality, and 
remove for example things that changes every year. For example, remove all 
humans, all of them, and organizations. There will be a new human tomorrow. 
When it comes to facts, who care who did or studied it, but just focus on what 
happened or what was discovered.

TASK DETAIL
  https://phabricator.wikimedia.org/T281854

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Harej, Lydia_Pintscher, Mohammed_Sadat_WMDE, nichtich, EgonWillighagen, 
Fnielsen, Darwinius, Daniel_Mietchen, Lokal_Profil, GoEThe, 
Alicia_Fagerving_WMSE, PKM, LWyatt, Multichill, Aklapper, MPhamWMF, Invadibot, 
maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] [Commented On] T249041: Updated URL for the WikiPathways SPARQL endpoint

2020-03-31 Thread EgonWillighagen
EgonWillighagen added a comment.


  I created a pull request: 
https://github.com/wikimedia/wikidata-query-deploy/pull/1

TASK DETAIL
  https://phabricator.wikimedia.org/T249041

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EgonWillighagen
Cc: Multichill, Aklapper, EgonWillighagen, CptViraj, darthmon_wmde, Dibya, 
94rain, DannyS712, Nandana, Tks4Fish, Lahi, Gq86, Lucas_Werkmeister_WMDE, 
GoranSMilovanovic, Jayprakash12345, QZanden, EBjune, Zoranzoki21, merbst, 
LawExplorer, DatGuy, Devwaker, Niklitov, _jensen, Urbanecm, rosalieper, 
JEumerus, Scott_WUaS, Jonas, Ananthsubray, Xmlizer, Superzerocool, 
Tulsi_Bhagat, Wong128hk, Luke081515, SimmeD, jkroll, Wikidata-bugs, Jdouglas, 
Snowolf, aude, Tobias1984, Dcljr, Manybubbles, Jdforrester-WMF, Matanya, 
Mbch331, Rxy, Jay8g, Krenair
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T175380: Queries with wikibase:statements or wikibase:sitelinks are slow

2020-02-20 Thread EgonWillighagen
EgonWillighagen added a comment.


  Yes, in the end we want data for all chemicals, but this is a good tradeoff. 
I'll implement! Thanks!

TASK DETAIL
  https://phabricator.wikimedia.org/T175380

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Smalyshev, EgonWillighagen
Cc: EgonWillighagen, Aklapper, Smalyshev, Lucas_Werkmeister_WMDE, 
darthmon_wmde, ET4Eva, Nandana, Lahi, Gq86, Darkminds3113, GoranSMilovanovic, 
QZanden, EBjune, merbst, LawExplorer, Avner, Gehel, _jensen, rosalieper, 
Scott_WUaS, Jonas, FloNight, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, 
Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T175380: Queries with wikibase:statements or wikibase:sitelinks are slow

2020-02-20 Thread EgonWillighagen
EgonWillighagen added a comment.


  I'm running into this problem too. Queries are slow or even time out for 
chemicals. The hints to do not seem to improve the query time significantly:
  
SELECT ?wikis ?compound WHERE {
  ?compound wdt:P31 wd:Q11173 ;
wikibase:sitelinks ?wikis . hint:Prior hint:rangeSafe true .
  FILTER(NOT EXISTS {?compound wdt:P2119 []})
} ORDER BY DESC(?wikis)
LIMIT 100

TASK DETAIL
  https://phabricator.wikimedia.org/T175380

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Smalyshev, EgonWillighagen
Cc: EgonWillighagen, Aklapper, Smalyshev, Lucas_Werkmeister_WMDE, 
darthmon_wmde, ET4Eva, Nandana, Lahi, Gq86, Darkminds3113, GoranSMilovanovic, 
QZanden, EBjune, merbst, LawExplorer, Avner, Gehel, _jensen, rosalieper, 
Scott_WUaS, Jonas, FloNight, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, 
Tobias1984, Manybubbles, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata

2018-05-18 Thread EgonWillighagen
EgonWillighagen added a comment.

In T193728#4212862, @Rspeer wrote:
how to change Wikidata's copyright status.


In which you assume it will chance license(/waiver)... If you seek certainty, plenty of people have indicated their view on the situation here, but this discussion is not ever going to give you certainty: only court can.TASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: lisong, Lofhi, Nemo_bis, TomT0m, jrbs, EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T194735: WDQS embedding gives strange results.

2018-05-17 Thread EgonWillighagen
EgonWillighagen added a comment.
@Fnielsen, we could add the _javascript_ to run the queries in a way that it only runs when the  is visible... e.g. with something like this: https://github.com/shaunbowe/jquery.visibilityChangedTASK DETAILhttps://phabricator.wikimedia.org/T194735EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, Jonas, Gehel, Fnielsen, Aklapper, Lahi, Gq86, Darkminds3113, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Avner, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata

2018-05-14 Thread EgonWillighagen
EgonWillighagen added a comment.
Hi all, IANAL but have been professionally dealing with copyright for quite some time now (scholar, author, database creator, advisor, etc, etc).

First, automated (bots, quickstatements) added of content that is not public domain (the formal type, e.g. in USA, not the loose common sense of the word) or CCZero itself is not OK. That said, I am not aware of anyone violating that. I understood that was discussed on a private mailing list, but makes this ticket's discussion a bit academic (sadly).

Second, to me, when I signed up for a Wikidata account, the license/waiver (CCZero is an agreement that waives all rights given by any law in any jurisdiction; does someone know if the legality of it has been challenged in court? not that I am aware of, at this moment) was pretty clear to me. Just like with Wikipedia, the account owner takes responsibility for not uploading copyright infringing material. To me, that does not make CCZero unsuitable for Wikidata at all, but does mean that some users seems to violate the Wikidata user agreement.

Third, someone above suggest to change the CCZero license/waiver. I strongly disagree: it would violate my creative right, if not legally (which is probably OK), but at least morally. Like me, I expect that many others have put significant effort in entering CCZero data, and I really prefer others to acknowledge my and others wishes to have "my" data under CCZero.

Fourth, tracking where data comes from is really important. But please consider it's complicated. If something says, "stated in" "English Wikipedia", it does not mean it was automatically entered; that depends from account to account. When I cannot find a better source, I often use "English Wikipedia" as reference source, and will state that in Wikidata (I sometimes forget).

So, for me this issues is not about CCZero for Wikidata, it is about some people doing stuff they were not supposed to do. Have these people been contacted? Has the violating content be identified? What was their reply? Has an attempt been made to remove that content?

EgonTASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata

2018-05-14 Thread EgonWillighagen
EgonWillighagen added a comment.

In T193728#4189219, @Psychoslave wrote:
Let's recall that whether this transfer is done by automation or crowdsourcing doesn't matter, it's the quantity of transferred data


Of all things I read about copyright law (IANAL but very interested), this is not what I have been told... in NL there is the provision that allows to replicate a database if the content is aggregated independently (the famous case is the Dutch phonebook which was manually copied in India; sorry cannot find an online description quickly).

My point, from what I understood, is does matter how content was transferred, which makes me consider Mix'n'Match legally safe.

EgonTASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T154660: increase length limit for external identifier, string and URL datatype

2017-01-12 Thread EgonWillighagen
EgonWillighagen added a comment.
I am not sure how much we should worry about the exact percentages for PubChem; to me, more important is are the percentages of the chemistry we have in Wikidata. These are likely correlated, and since PubChem is a lot bigger puts things in perspective.  InChIs are identifiers, but not as we are common too, and I understand the point about indexing and ID length.

Quick question... is the semantic meaning of an 'external identifier' that it must be indexed, all of them?TASK DETAILhttps://phabricator.wikimedia.org/T154660EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: daniel, thiemowmde, EgonWillighagen, Sebotic, Scott_WUaS, Sadads, Pasleim, Aklapper, Lydia_Pintscher, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T154660: increase length limit for external identifier, string and URL datatype

2017-01-08 Thread EgonWillighagen
EgonWillighagen added a comment.
The InChI is not the only use case for chemistry, btw. SMILES also runs into the char limit right now for a number of compounds.TASK DETAILhttps://phabricator.wikimedia.org/T154660EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: EgonWillighagenCc: EgonWillighagen, Sebotic, Scott_WUaS, Sadads, Pasleim, Aklapper, Lydia_Pintscher, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs