RE: Substring in Spark SQL

Cheng, Hao Mon, 04 Aug 2014 17:38:24 -0700

>From the log, I noticed the "substr" was added on July 15th, 1.0.1 release 
>should be earlier than that. Community is now working on releasing the 1.1.0, 
>and also some of the performance improvements were added. Probably you can try 
>that for your benchmark.

Cheng Hao

-----Original Message-----
From: Tom [mailto:thubregt...@gmail.com] 
Sent: Tuesday, August 05, 2014 5:53 AM
To: u...@spark.incubator.apache.org
Subject: Substring in Spark SQL

Hi,

I am trying to run the  Big Data Benchmark 
<https://amplab.cs.berkeley.edu/benchmark/>  , and I am stuck at Query 2 for 
Spark SQL using Spark 1.0.1:
SELECT SUBSTR(sourceIP, 1, X), SUM(adRevenue) FROM uservisits GROUP BY 
SUBSTR(sourceIP, 1, X) When I look into the sourcecode, it seems that "substr" 
is supported by HiveQL, but not by Spark SQL, correct?

Thanks!

Tom

--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Substring-in-Spark-SQL-tp11373.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional 
commands, e-mail: user-h...@spark.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

RE: Substring in Spark SQL

Reply via email to