[jira] [Updated] (SPARK-46523) Single Hive/Datasource table partitioning to multiple storage system- (e.g, S3 and HDFS)

zhixingheyi-tian (Jira) Wed, 27 Dec 2023 00:49:04 -0800


     [ 
https://issues.apache.org/jira/browse/SPARK-46523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


zhixingheyi-tian updated SPARK-46523:
-------------------------------------
    Description: 
Single Hive/Datasource table partitioning to multiple storage system- (e.g, S3 
and HDFS)

For Hive table:

 
{code:java}
CREATE  TABLE htable a string, b string)  PARTITIONED BY ( p string ) location 
"hdfs://{cluster}}/user/hadoop/htable/";

alter table htable  add partition(p='p1')  location 
's3a://{bucketname}/usr/hive/warehouse/htable/p=p1';{code}
 

When inserting into htable,  or insert overwrite htable.  New data of “p=p1” 
will insert table location storage. This does not meet the requirements.

Is there any best practise?  Or is there a plan to support this feature?

Thanks

> Single Hive/Datasource table partitioning to multiple storage system- (e.g, 
> S3 and HDFS)
> ----------------------------------------------------------------------------------------
>
>                 Key: SPARK-46523
>                 URL: https://issues.apache.org/jira/browse/SPARK-46523
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 3.3.2
>            Reporter: zhixingheyi-tian
>            Priority: Major
>
> Single Hive/Datasource table partitioning to multiple storage system- (e.g, 
> S3 and HDFS)
> For Hive table:
>  
> {code:java}
> CREATE  TABLE htable a string, b string)  PARTITIONED BY ( p string ) 
> location "hdfs://{cluster}}/user/hadoop/htable/";
> alter table htable  add partition(p='p1')  location 
> 's3a://{bucketname}/usr/hive/warehouse/htable/p=p1';{code}
>  
> When inserting into htable,  or insert overwrite htable.  New data of “p=p1” 
> will insert table location storage. This does not meet the requirements.
> Is there any best practise?  Or is there a plan to support this feature?
> Thanks



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (SPARK-46523) Single Hive/Datasource table partitioning to multiple storage system- (e.g, S3 and HDFS)

Reply via email to