GoranSMilovanovic added a comment.
- On the first sight, there were only 687 projects whose reuse data were sqooped by the WDCM_Sqoop_Clients.R <https://github.com/wikimedia/analytics-wmde-WD-WikidataAnalytics/blob/master/_engines/_wdcmModules/WDCM_Sqoop_Clients.R> run, and - that number, as far as I remember, should be higher; - Could it be that something in the organization of our core Mediawiki databases <https://wikitech.wikimedia.org/wiki/MariaDB#Core_MediaWiki_databases> has changed? - Inspecting now, here's the first suspect: In fread(paste0("shardTables_", i, ".tsv"), sep = "\t") : File 'shardTables_4.tsv' has size 0. Returning a NULL data.table. from the WDCM_Sqoop_Clients.R <https://github.com/wikimedia/analytics-wmde-WD-WikidataAnalytics/blob/master/_engines/_wdcmModules/WDCM_Sqoop_Clients.R> log; **`s4` is Commons** - ? But then it seems that even more is missing. Will parse the sqoop module log. Notes: - the latest sqoop run ended on `2021-06-07 06:19:55`; - the next one is scheduled from stat1004's crontab to start on `2021-06-14 00:00:00`. The most probable next step following the analysis in SQL/MariaDB directly: - run a manual update of the WDCM sqoop module; monitor. TASK DETAIL https://phabricator.wikimedia.org/T284850 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: RhinosF1, GoranSMilovanovic, Tobi_WMDE_SW, Lydia_Pintscher, Aklapper, MisterSynergy, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
