>From the log, I noticed the "substr" was added on July 15th, 1.0.1 release >should be earlier than that. Community is now working on releasing the 1.1.0, >and also some of the performance improvements were added. Probably you can try >that for your benchmark.
Cheng Hao -----Original Message----- From: Tom [mailto:thubregt...@gmail.com] Sent: Tuesday, August 05, 2014 5:53 AM To: u...@spark.incubator.apache.org Subject: Substring in Spark SQL Hi, I am trying to run the Big Data Benchmark <https://amplab.cs.berkeley.edu/benchmark/> , and I am stuck at Query 2 for Spark SQL using Spark 1.0.1: SELECT SUBSTR(sourceIP, 1, X), SUM(adRevenue) FROM uservisits GROUP BY SUBSTR(sourceIP, 1, X) When I look into the sourcecode, it seems that "substr" is supported by HiveQL, but not by Spark SQL, correct? Thanks! Tom -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Substring-in-Spark-SQL-tp11373.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org