GoranSMilovanovic added a comment.
- given how often is `stat1007` used by analysts, - it barely has the resources for the computations that we need here (the languages x languages contingency table; takes at least ~25Gb to compute); - a fail-safe, batch processing procedure to compute large contingency matrices in R will be developed; - it will rely on `base` and/or `data.table` R functions, but it will be - less demanding in terms of memory resources. TASK DETAIL https://phabricator.wikimedia.org/T223118 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: Aklapper, Lydia_Pintscher, RazShuty, GoranSMilovanovic, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
