[jira] [Updated] (HBASE-28213) Evalue using hbase-shaded-client-byo-hadoop for Spark connector

Istvan Toth (Jira) Wed, 22 Nov 2023 04:45:30 -0800


     [ 
https://issues.apache.org/jira/browse/HBASE-28213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Istvan Toth updated HBASE-28213:
--------------------------------
    Description: 
Since 3.2 Spark now uses hadoop-client-api and hadoop-client-runtime.
While we don't actually specify what HBase libraries are needed on the Spark 
client side for the connector, at least the Cloudera docs specify the classes 
provided by "hbase mapredcp"
which includes the full unshaded Hadoop JAR set.

Investigate whether  *hbase-shaded-client-byo-hadoop* and the 
*hbase-client-api* and *hbase-client-runtime* is enough for the connector, and 
if yes, document how to set the Spark classpath.

Alternatively, if *hbase-shaded-client-byo-hadoop*  is not enough, check if 
*hbase-shaded-mapreduce* plus the above two shaded Hadoop client JAR provides 
everything needed.

  was:
Since 3.2 Spark now uses hadoop-client-api and hadoop-client-runtime.
While we don't actually specify what HBase libraries are needed on the Spark 
client side for the connector, at least the Cloudera docs specify the classes 
provided by "hbase mapredcp"
which includes the full unshaded Hadoop JAR set.

Investigate whether  *hbase-shaded-client-byo-hadoop* and the 
*hbase-client-api* and *hbase-client-runtime* is enough for the connector, and 
if yes, document how to set the Spark classpath.


> Evalue using hbase-shaded-client-byo-hadoop for Spark connector
> ---------------------------------------------------------------
>
>                 Key: HBASE-28213
>                 URL: https://issues.apache.org/jira/browse/HBASE-28213
>             Project: HBase
>          Issue Type: Improvement
>          Components: spark
>            Reporter: Istvan Toth
>            Priority: Major
>
> Since 3.2 Spark now uses hadoop-client-api and hadoop-client-runtime.
> While we don't actually specify what HBase libraries are needed on the Spark 
> client side for the connector, at least the Cloudera docs specify the classes 
> provided by "hbase mapredcp"
> which includes the full unshaded Hadoop JAR set.
> Investigate whether  *hbase-shaded-client-byo-hadoop* and the 
> *hbase-client-api* and *hbase-client-runtime* is enough for the connector, 
> and if yes, document how to set the Spark classpath.
> Alternatively, if *hbase-shaded-client-byo-hadoop*  is not enough, check if 
> *hbase-shaded-mapreduce* plus the above two shaded Hadoop client JAR provides 
> everything needed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (HBASE-28213) Evalue using hbase-shaded-client-byo-hadoop for Spark connector

Reply via email to