AndrewTavis_WMDE created this task. AndrewTavis_WMDE added projects: Wikidata, Wikidata Analytics. Restricted Application added a subscriber: Aklapper.
TASK DESCRIPTION The active_user_changes <https://phabricator.wikimedia.org/diffusion/ADES/browse/master/src/wikidata/site_stats/sql/active_user_changes.sql> SQL script doesn't have an upper bound in the `WHERE` clause. The following subsets by time: `rc_timestamp >= DATE_FORMAT(DATE_SUB(NOW(), INTERVAL 30 day), '%Y%m%d%H%i%s')`, with the problem being that the value of the produced metric will be determined by when the script is ran in morning depending on how busy the cluster is rather than running in the morning and collecting the data within a distinct 30 day period. A new `WHERE` clause should likely be between the start of the day 30 days ago and midnight of the last evening. This issue isn't critical, but it would be nice to check where we can do some improvements for data consistency :) If there's a group of other queries that should be checked with this I'd be happy to look into it all a bit more! TASK DETAIL https://phabricator.wikimedia.org/T338464 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AndrewTavis_WMDE Cc: Aklapper, AndrewTavis_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
