[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2021-04-07 Thread Gehel
Gehel closed this task as "Resolved".
Gehel added a comment.


  Looks like this is resolved. If you find more issues, feel free to re-open.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Zbyszko, Gehel
Cc: Gehel, CBogen, Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, 
Aklapper, Invadibot, GFontenelle_WMF, MPhamWMF, maantietaja, FRomeo_WMF, 
Muchiri124, Nintendofan885, Akuckartz, Nandana, JKSTNK, Namenlos314, Lahi, 
Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, 
merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, _jensen, rosalieper, 
Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, 
Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, 
Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, 
Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2021-03-03 Thread Zbyszko
Zbyszko added a comment.


  First M-entity mentioned in the ticket was missing because there was a bug 
with weekly reloads, that has now been fixed - entries added before the reload 
should be available.
  
  As for the second part of the ticket  - sdc has uri encoded contentUrl 
(although apparently MD5 is calculated from decoded filename). I modified the 
query to match that fact:
  
SELECT (COUNT (DISTINCT ?image) AS ?images) (COUNT(DISTINCT ?file) AS 
?files)
WITH
{
  SELECT ?image ?contentUrl
  WHERE
  {
SERVICE 
{
  ?item wdt:P31 wd:Q5153359 .
  ?item wdt:P18 ?image .
}
BIND (REPLACE(wikibase:decodeUri(SUBSTR(STR(?image), 52)), " ", "_") AS 
?filename)
BIND (REPLACE(SUBSTR(STR(?image), 52), "%20", "_") AS 
?filenameUnencoded)
BIND (MD5(?filename) AS ?MD5)
BIND (URI(CONCAT("https://upload.wikimedia.org/wikipedia/commons/;,
 SUBSTR(?MD5, 1, 1), "/", SUBSTR(?MD5, 1, 2), "/", 
?filenameUnencoded)) As ?contentUrl)
  }
} AS %get_some_images_from_Wikidata
WHERE
{
  INCLUDE %get_some_images_from_Wikidata 
  OPTIONAL { ?file schema:contentUrl ?contentUrl . }
}
  
  This yields (at the time of writing):
  
  | images | files |
  | -- | - |
  | 6326   | 6318  |
  |
  
  Which leaves 8 unaccounted for.  Out of those, 7 do not have structured data 
defined and last one has a new structured data content, so possibly it wasn't 
present in the latest dump we reload from.
  
  Please, let me know if this resolves the issue.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Zbyszko
Cc: CBogen, Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, 
GFontenelle_WMF, MPhamWMF, FRomeo_WMF, Muchiri124, Nintendofan885, Akuckartz, 
Nandana, JKSTNK, Namenlos314, Lahi, Gq86, E1presidente, Ramsey-WMF, Cparle, 
Anooprao, SandraF_WMF, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, 
EBjune, Tramullas, Acer, merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, 
_jensen, rosalieper, Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, 
Ixocactus, Wong128hk, abian, Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, 
matthiasmullie, aude, Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, 
Ricordisamoa, Wesalius, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2021-03-03 Thread Zbyszko
Zbyszko claimed this task.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Zbyszko
Cc: CBogen, Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, 
GFontenelle_WMF, MPhamWMF, FRomeo_WMF, Muchiri124, Nintendofan885, Akuckartz, 
Nandana, JKSTNK, Namenlos314, Lahi, Gq86, E1presidente, Ramsey-WMF, Cparle, 
Anooprao, SandraF_WMF, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, 
EBjune, Tramullas, Acer, merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, 
_jensen, rosalieper, Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, 
Ixocactus, Wong128hk, abian, Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, 
matthiasmullie, aude, Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, 
Ricordisamoa, Wesalius, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-07 Thread CBogen
CBogen added a comment.


  Timebox to a half a day investigation and re-evaluate.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: CBogen
Cc: CBogen, Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, 
Muchiri124, Nintendofan885, Akuckartz, Nandana, JKSTNK, Namenlos314, Lahi, 
PDrouin-WMF, Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, 
merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, _jensen, rosalieper, 
Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, 
Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, 
Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, 
Fabrice_Florin, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-07 Thread RKemper
RKemper updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: RKemper
Cc: Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, Muchiri124, 
CBogen, Nintendofan885, Akuckartz, Nandana, JKSTNK, Namenlos314, Lahi, 
PDrouin-WMF, Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, 
merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, _jensen, rosalieper, 
Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, 
Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, 
Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, 
Fabrice_Florin, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-07 Thread CBogen
CBogen moved this task from All WDQS-related tasks to Current work on the 
Wikidata-Query-Service board.
CBogen added a project: Discovery-Search (Current work).

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

WORKBOARD
  https://phabricator.wikimedia.org/project/board/891/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: CBogen
Cc: Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, Muchiri124, 
CBogen, Nintendofan885, Akuckartz, Nandana, JKSTNK, Namenlos314, Lahi, 
PDrouin-WMF, Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, 
merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, _jensen, rosalieper, 
Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, 
Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, 
Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, 
Fabrice_Florin, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-03 Thread CBogen
CBogen added a project: Wikidata-Query-Service.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: CBogen
Cc: Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, Muchiri124, 
CBogen, Nintendofan885, Akuckartz, Nandana, JKSTNK, Namenlos314, Lahi, 
PDrouin-WMF, Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, 
Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, Tramullas, Acer, 
merbst, LawExplorer, Salgo60, Silverfish, Poyekhali, _jensen, rosalieper, 
Taiwania_Justo, Scott_WUaS, Jonas, Xmlizer, Susannaanas, Ixocactus, Wong128hk, 
Jane023, jkroll, Wikidata-bugs, Jdouglas, Base, matthiasmullie, aude, 
Tobias1984, El_Grafo, Dinoguy1000, Manybubbles, Ricordisamoa, Wesalius, 
Fabrice_Florin, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-03 Thread Omar_Ghrida
Omar_Ghrida added a project: Commons.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Omar_Ghrida
Cc: Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, Muchiri124, 
CBogen, Nintendofan885, Akuckartz, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, 
E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, 
QZanden, Tramullas, Acer, LawExplorer, Salgo60, Silverfish, Poyekhali, _jensen, 
rosalieper, Taiwania_Justo, Scott_WUaS, Susannaanas, Ixocactus, Wong128hk, 
Jane023, Wikidata-bugs, Base, matthiasmullie, aude, El_Grafo, Dinoguy1000, 
Ricordisamoa, Wesalius, Fabrice_Florin, Raymond, Steinsplitter, Mbch331, Keegan
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-03 Thread Vojtech.dostal
Vojtech.dostal added a subscriber: Lydia_Pintscher.
Vojtech.dostal added a comment.


  Adding Lydia on her request.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Vojtech.dostal
Cc: Lydia_Pintscher, Vojtech.dostal, Dipsacus_fullonum, Aklapper, CBogen, 
Nintendofan885, Akuckartz, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, 
E1presidente, Ramsey-WMF, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, 
QZanden, Tramullas, Acer, LawExplorer, Salgo60, Silverfish, _jensen, 
rosalieper, Scott_WUaS, Susannaanas, Jane023, Wikidata-bugs, Base, 
matthiasmullie, aude, Ricordisamoa, Wesalius, Fabrice_Florin, Raymond, 
Steinsplitter, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T269302: Many files on Commons cannot be found in WCQS

2020-12-02 Thread Reedy
Reedy added a project: SDC General.
Restricted Application added a project: Wikidata.

TASK DETAIL
  https://phabricator.wikimedia.org/T269302

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Reedy
Cc: Dipsacus_fullonum, Aklapper, CBogen, Nintendofan885, Akuckartz, Nandana, 
JKSTNK, Lahi, PDrouin-WMF, Gq86, E1presidente, Ramsey-WMF, Cparle, Anooprao, 
SandraF_WMF, GoranSMilovanovic, QZanden, Tramullas, Acer, LawExplorer, Salgo60, 
Silverfish, _jensen, rosalieper, Scott_WUaS, Susannaanas, Jane023, 
Wikidata-bugs, Base, matthiasmullie, aude, Ricordisamoa, Wesalius, 
Lydia_Pintscher, Fabrice_Florin, Raymond, Steinsplitter, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs