GoranSMilovanovic added a comment.
- failed; even {data.table} which is notoriously efficient in dealing with
large data sets fails to produce the contingency (run on `stat1007`):
### --- Contingency w. {data.table}
t1 <- Sys.time()
cT <- dcast(dataSet, property ~ item,
fill = 0,
drop = F,
value.var = "value")
Sys.time() - t1
results in
Error in CJ(1:1190, 1:22295882) :
Cross product of elements provided to CJ() would result in 26532099580
rows which exceeds .Machine$integer.max == 2147483647
Fallback option is active now: sampling.
TASK DETAIL
https://phabricator.wikimedia.org/T214897
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: GoranSMilovanovic
Cc: RazShuty, Addshore, JAllemandou, Aklapper, GoranSMilovanovic,
Lydia_Pintscher, alaa_wmde, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen,
rosalieper, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs