Zoltán Borók-Nagy created IMPALA-11339:
------------------------------------------
Summary: Implement LOAD DATA INPATH for Iceberg tables
Key: IMPALA-11339
URL: https://issues.apache.org/jira/browse/IMPALA-11339
Project: IMPALA
Issue Type: Bug
Components: Frontend
Reporter: Zoltán Borók-Nagy
Currently Impala doesn't support LOAD DATA statements for Iceberg tables.
Some user workflows still use this statement, so it would be nice to implement
it in some way.
A possible solution would be to
# create a temp table on those sets of files with the right schema
# run a {{insert into iceberg table select * from tmp table}}
# drop the tmp table and delete the files in the staging directory
It does some copying, but probably this would be the safest solution.
Users might specify the partition columns in the [PARTITION (partcol1=val1,
partcol2=val2 ...)] clause. In this case the data files don't necessarily
contain the partition values, i.e. we need to create the tmp table with proper
partitioning.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]