[ 
https://issues.apache.org/jira/browse/HBASE-14849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhan Zhang updated HBASE-14849:
-------------------------------
    Release Note: 
For user configurable parameters for HBase datasources. Please refer to 
org.apache.hadoop.hbase.spark.datasources.HBaseSparkConf for details. 

User can either set them in SparkConf, which will take effect globally, or 
configure it per table, which will overwrite the value set in SparkConf. If not 
set, the default value will take effect.

Currently three parameters are supported.
1. spark.hbase.blockcache.enable for blockcache enable/disable. Default is 
enable,  but note that this potentially may slow down the system.
2. spark.hbase.cacheSize for cache size when performing HBase table scan. 
Default value is 1000
3. spark.hbase.batchNum for the batch number when performing HBase table scan. 
Default value is 1000.

> Add option to set block cache to false on SparkSQL executions
> -------------------------------------------------------------
>
>                 Key: HBASE-14849
>                 URL: https://issues.apache.org/jira/browse/HBASE-14849
>             Project: HBase
>          Issue Type: New Feature
>          Components: spark
>            Reporter: Ted Malaska
>            Assignee: Zhan Zhang
>         Attachments: HBASE-14849-1.patch, HBASE-14849-2.patch, 
> HBASE-14849.patch
>
>
> I was working at a client with a ported down version of the Spark module for 
> HBase and realized we didn't add an option to turn of block cache for the 
> scans.  
> At the client I just disabled all caching with Spark SQL, this is an easy but 
> very impactful fix.
> The fix for this patch will make this configurable



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to