GoranSMilovanovic added a comment.

  - On the first sight, there were only 687 projects whose reuse data were 
sqooped by the WDCM_Sqoop_Clients.R 
<https://github.com/wikimedia/analytics-wmde-WD-WikidataAnalytics/blob/master/_engines/_wdcmModules/WDCM_Sqoop_Clients.R>
 run, and
  - that number, as far as I remember, should be higher;
  - Could it be that something in the organization of our core Mediawiki 
databases 
<https://wikitech.wikimedia.org/wiki/MariaDB#Core_MediaWiki_databases> has 
changed?
  - Inspecting now, here's the first suspect:
  
    In fread(paste0("shardTables_", i, ".tsv"), sep = "\t") :
      File 'shardTables_4.tsv' has size 0. Returning a NULL data.table.
  
  from the WDCM_Sqoop_Clients.R 
<https://github.com/wikimedia/analytics-wmde-WD-WikidataAnalytics/blob/master/_engines/_wdcmModules/WDCM_Sqoop_Clients.R>
 log; **`s4` is Commons** - ?
  But then it seems that even more is missing. Will parse the sqoop module log.
  
  Notes:
  
  - the latest sqoop run ended on `2021-06-07 06:19:55`;
  - the next one is scheduled from stat1004's crontab to start on `2021-06-14 
00:00:00`.
  
  The most probable next step following the analysis in SQL/MariaDB directly:
  
  - run a manual update of the WDCM sqoop module; monitor.

TASK DETAIL
  https://phabricator.wikimedia.org/T284850

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: RhinosF1, GoranSMilovanovic, Tobi_WMDE_SW, Lydia_Pintscher, Aklapper, 
MisterSynergy, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, QZanden, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to