Matthias Boehm created SYSTEMML-2278: ----------------------------------------
Summary: Performance issues spark table(X,Y) w/ large num distinct
Key: SYSTEMML-2278
URL: https://issues.apache.org/jira/browse/SYSTEMML-2278
Project: SystemML
Issue Type: Task
Reporter: Matthias Boehm
{code}
N = 1e9
X = seq(1,N);
Y = 1+seq(1,N)/1000;
while(FALSE){}
Z = table(X,Y)
while(FALSE){}
print(sum(Z));
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
