Zoltán Borók-Nagy created IMPALA-11339:
------------------------------------------

             Summary: Implement LOAD DATA INPATH for Iceberg tables
                 Key: IMPALA-11339
                 URL: https://issues.apache.org/jira/browse/IMPALA-11339
             Project: IMPALA
          Issue Type: Bug
          Components: Frontend
            Reporter: Zoltán Borók-Nagy


Currently Impala doesn't support LOAD DATA statements for Iceberg tables.

Some user workflows still use this statement, so it would be nice to implement 
it in some way.

A possible solution would be to
 # create a temp table on those sets of files with the right schema
 # run a {{insert into iceberg table select * from tmp table}}
 # drop the tmp table and delete the files in the staging directory

It does some copying, but probably this would be the safest solution.

Users might specify the partition columns in the [PARTITION (partcol1=val1, 
partcol2=val2 ...)] clause. In this case the data files don't necessarily 
contain the partition values, i.e. we need to create the tmp table with proper 
partitioning.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to