Hi, When running SparkSQL branch 1.2.1 on EC2 standalone cluster, the following query does not work:
create table debug as select v1.* from t1 as v1 left join t2 as v2 on v1.sku = v2.sku where v2.sku is null Both t1 and t2 have 200 partitions. t1 has 10k rows, and t2 has 4k rows. this query block at: 14/12/17 15:56:54 INFO TaskSetManager: Finished task 133.0 in stage 2.0 (TID 541) in 370 ms on ip-10-79-184-49.ec2.internal (122/200) Via WebUI, I can see there are 24 tasks running, as the cluster has 24 core. The other tasks are succeeded. It seems that the 24 tasks are blocked and won't end. However, SparkSQL 1.1.0 works fine. There might be some problems with "join" on 1.2.1 Any idea? Hao -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/SparkSQL-1-2-1-snapshot-Left-Join-problem-tp20748.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org