Re: Performance Benchmark Hbase vs Cassandra

2017-06-29 Thread Saikat Kanjilal
You should think about using ycsb and write an adapter for spark perf tests against these databases if it doesn't already exist. See here: https://github.com/brianfrankcooper/YCSB Sent from my iPhone On Jun 29, 2017, at 7:33 PM, Raj, Deepu

Re: Performance Benchmark Hbase vs Cassandra

2017-06-29 Thread Ted Yu
For Cassandra, I found: https://www.instaclustr.com/multi-data-center-sparkcassandra-benchmark-round-2/ My coworker (on vacation at the moment) was doing benchmark with hbase. When he comes back, the result can be published. Note: it is hard to find comparison results with same setup (hardware,

Performance Benchmark Hbase vs Cassandra

2017-06-29 Thread Raj, Deepu
Hi Team, I want to do a performance benchmark with some specific use case with Spark -->HBase and Spark --> Cassandra. Can anyone provide inputs:- 1. Scenarios / Parameters to monitor? 2. Any automation tool to make this work? 3. Any previous Learnings/

RE: Spark Hbase Connector

2017-06-29 Thread Raj, Deepu
Thanks Weiging / Ted From: Weiqing Yang [mailto:yangweiqing...@gmail.com] Sent: Friday, 30 June 2017 10:34 AM To: Ted Yu Cc: Raj, Deepu ; dev@spark.apache.org Subject: Re: Spark Hbase Connector https://github.com/hortonworks-spark/shc/releases

Re: Spark Hbase Connector

2017-06-29 Thread Weiqing Yang
https://github.com/hortonworks-spark/shc/releases (v1.x.x-2.1 for Spark 2.1) https://github.com/hortonworks-spark/shc/tree/branch-2.1 (for Spark 2.1) On Thu, Jun 29, 2017 at 4:36 PM, Ted Yu wrote: > Please take a look at HBASE-16179 (work in progress). > > On Thu, Jun 29,

Re: PlanLater not being optimized out of Query Plan

2017-06-29 Thread Russell Spitzer
Figured it out, it was in my Exec, I hadn't defined it as a case class (Just a normal class) and just left in stubs for the Product trait methods. This just led to some... unwanted behaviors. On Thu, Jun 29, 2017 at 4:26 PM Russell Spitzer wrote: > I've been writing

Spark Hbase Connector

2017-06-29 Thread Raj, Deepu
Hi Team, Is there stable Spark HBase connector for Spark 2.0 ? Thanks, Deepu Raj

PlanLater not being optimized out of Query Plan

2017-06-29 Thread Russell Spitzer
I've been writing some toy experimental strategies which end up adding UnaryExec nodes to the plan. For some reason though my "PlanLater" nodes end up being ignored and end up in the final physical plan. Is there anything in general that I might be missing? I'm doing my sample work on 2.1.X and