I am attempting to read an hbase table in pyspark with a range scan.
conf = {
"hbase.zookeeper.quorum": host,
"hbase.mapreduce.inputtable": table,
"hbase.mapreduce.scan" : scan
}
hbase_rdd = sc.newAPIHadoopRDD(
"org.apache.hadoop.hbase.mapreduce.TableInputFormat",
"o
I am running spark with a cloudera cluster, spark version
0.9.0-cdh5.0.0-beta-2
While nothing else is running on the cluster i am having frequent worker
failures with errors like
AssociationError [akka.tcp://sparkWorker@worker5:7078] ->
[akka.tcp://sparkExecutor@worker5:37487]: Error [Association