Re: Low Performance of Shark over Spark.

2014-08-11 Thread vinay . kashyap
much difference was seen.   Thanks and regards Vinay Kashyap   From:"Yana Kadiyska" Sent:"vinay.kashyap" Date:Sat, August 9, 2014 6:56 am Subject:Re: Low Performance of Shark over Spark. Can you see where your t

Re: Low Performance of Shark over Spark.

2014-08-08 Thread vinay.kashyap
: http://apache-spark-user-list.1001560.n3.nabble.com/Low-Performance-of-Shark-over-Spark-tp11649p11776.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spar

Re: Low Performance of Shark over Spark.

2014-08-08 Thread Mayur Rustagi
ries are fixed and for specific set of data. > > > > Thanks and regards > > Vinay Kashyap > > > From:"Xiangrui Meng" > Sent:vinay.kash...@socialinfra.net > Cc:"user@spark.apache.org" > Date:Th

Re: Low Performance of Shark over Spark.

2014-08-07 Thread vinay . kashyap
Vinay Kashyap From:"Xiangrui Meng" Sent:vinay.kash...@socialinfra.net Cc:"user@spark.apache.org" Date:Thu, August 7, 2014 11:06 pm Subject:Re: Low Performance of Shark over Spark. > Did you cache the table? There are c

Re: Low Performance of Shark over Spark.

2014-08-07 Thread Xiangrui Meng
Did you cache the table? There are couple ways of caching a table in Shark: https://github.com/amplab/shark/wiki/Shark-User-Guide On Thu, Aug 7, 2014 at 6:51 AM, wrote: > Dear all, > > I am using Spark 0.9.2 in Standalone mode. Hive and HDFS in CDH 5.1.0. > > 6 worker nodes each with memory 96GB

Low Performance of Shark over Spark.

2014-08-07 Thread vinay . kashyap
Dear all, I am using Spark 0.9.2 in Standalone mode. Hive and HDFS in CDH 5.1.0. 6 worker nodes each with memory 96GB and 32 cores. I am using Shark Shell to execute queries on Spark. I have a raw_table ( of size 3TB with replication 3 ) which is partitioned by year, month and day. I am running