aimenglin commented on issue #2685:
URL: https://github.com/apache/iceberg/issues/2685#issuecomment-858925332


   @marton-bod Thanks for your patience and time for helping me debug. 
   1. Log
   ```
   2021-06-09T23:25:37,850  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Compiling 
command(queryId=shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1):
 INSERT INTO TABLE hadoop_table_1
     VALUES (7, "John", "Basketball", "Senior")
   2021-06-09T23:25:37,876  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Concurrency mode is disabled, not creating a lock manager
   2021-06-09T23:25:37,876  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Starting Semantic Analysis
   2021-06-09T23:25:37,903  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Completed phase 1 of Semantic Analysis
   2021-06-09T23:25:37,903  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for source tables
   2021-06-09T23:25:37,903  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for subqueries
   2021-06-09T23:25:37,903  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for destination tables
   2021-06-09T23:25:37,928  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Completed getting MetaData in Semantic Analysis
   2021-06-09T23:25:38,101  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Context: New scratch dir is 
hdfs://shenme-cluster-3-m/tmp/hive/shenme_google_com/521d08fa-9fda-4ca3-8bab-3d0b60ddc6de/hive_2021-06-09_23-25-37_866_2611149539519878546-1
   2021-06-09T23:25:38,619  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for source tables
   2021-06-09T23:25:38,620  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for subqueries
   2021-06-09T23:25:38,620  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for source tables
   2021-06-09T23:25:38,631  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for subqueries
   2021-06-09T23:25:38,631  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for destination tables
   2021-06-09T23:25:40,604  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Launching Job 1 out of 1
   2021-06-09T23:25:40,604  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Starting task [Stage-2:MAPRED] in serial mode
   2021-06-09T23:25:40,686  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Context: New scratch dir is 
hdfs://shenme-cluster-3-m/tmp/hive/shenme_google_com/521d08fa-9fda-4ca3-8bab-3d0b60ddc6de/hive_2021-06-09_23-25-37_866_2611149539519878546-1
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
tez.TezSessionPoolManager: The current user: shenme_google_com, session user: 
shenme_google_com
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
tez.TezSessionPoolManager: Current queue name is null incoming queue name is 
null
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
exec.Task: Subscribed to counters: [] for queryId: 
shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1
   2021-06-09T23:25:38,631  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Get metadata for destination tables
   2021-06-09T23:25:38,696  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Context: New scratch dir is 
hdfs://shenme-cluster-3-m/tmp/hive/shenme_google_com/521d08fa-9fda-4ca3-8bab-3d0b60ddc6de/hive_2021-06-09_23-25-37_866_2611149539519878546-1
   2021-06-09T23:25:38,702  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:38,944  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:38,944  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hive.HiveIcebergSerDe: Using schema from existing table 
{"type":"struct","fields":[{"id":1,"name":"id","required":true,"type":"int"},{"id":2,"name":"name","required":false,"type":"string"},{"id":3,"name":"major","required":false,"type":"string"},{"id":4,"name":"grade","required":false,"type":"string"}]}
   2021-06-09T23:25:39,006  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:39,238  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:39,238  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hive.HiveIcebergSerDe: Using schema from existing table 
{"type":"struct","fields":[{"id":1,"name":"id","required":true,"type":"int"},{"id":2,"name":"name","required":false,"type":"string"},{"id":3,"name":"major","required":false,"type":"string"},{"id":4,"name":"grade","required":false,"type":"string"}]}
   2021-06-09T23:25:39,238  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:39,436  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:39,436  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hive.HiveIcebergSerDe: Using schema from existing table 
{"type":"struct","fields":[{"id":1,"name":"id","required":true,"type":"int"},{"id":2,"name":"name","required":false,"type":"string"},{"id":3,"name":"major","required":false,"type":"string"},{"id":4,"name":"grade","required":false,"type":"string"}]}
   2021-06-09T23:25:39,437  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
common.FileUtils: Creating directory if it doesn't exist: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1/.hive-staging_hive_2021-06-09_23-25-37_866_2611149539519878546-1
   2021-06-09T23:25:39,518  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:39,707  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:39,800  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: CBO Succeeded; optimized logical plan.
   2021-06-09T23:25:39,800  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ppd.OpProcFactory: Processing for FS(4)
   2021-06-09T23:25:39,800  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ppd.OpProcFactory: Processing for SEL(3)
   2021-06-09T23:25:39,800  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ppd.OpProcFactory: Processing for UDTF(2)
   2021-06-09T23:25:39,800  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ppd.OpProcFactory: Processing for SEL(1)
   2021-06-09T23:25:39,800  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ppd.OpProcFactory: Processing for TS(0)
   2021-06-09T23:25:39,823  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.TezCompiler: Cycle free: true
   2021-06-09T23:25:39,828  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
parse.CalcitePlanner: Completed plan generation
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Semantic Analysis Completed (retrial = false)
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:col1, 
type:int, comment:null), FieldSchema(name:col2, type:string, comment:null), 
FieldSchema(name:col3, type:string, comment:null), FieldSchema(name:col4, 
type:string, comment:null)], properties:null)
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Completed compiling 
command(queryId=shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1);
 Time taken: 1.979 seconds
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
reexec.ReExecDriver: Execution #1 of query
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Concurrency mode is disabled, not creating a lock manager
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Executing 
command(queryId=shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1):
 INSERT INTO TABLE hadoop_table_1
     VALUES (7, "John", "Basketball", "Senior")
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Query ID = 
shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1
   2021-06-09T23:25:39,829  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Total jobs = 1
   2021-06-09T23:25:39,830  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Starting task [Stage-0:DDL] in serial mode
   2021-06-09T23:25:39,852  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:40,093  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:40,093  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hive.HiveIcebergSerDe: Using schema from existing table 
{"type":"struct","fields":[{"id":1,"name":"id","required":true,"type":"int"},{"id":2,"name":"name","required":false,"type":"string"},{"id":3,"name":"major","required":false,"type":"string"},{"id":4,"name":"grade","required":false,"type":"string"}]}
   2021-06-09T23:25:40,094  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:40,327  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:40,327  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hive.HiveIcebergSerDe: Using schema from existing table 
{"type":"struct","fields":[{"id":1,"name":"id","required":true,"type":"int"},{"id":2,"name":"name","required":false,"type":"string"},{"id":3,"name":"major","required":false,"type":"string"},{"id":4,"name":"grade","required":false,"type":"string"}]}
   2021-06-09T23:25:40,328  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
mr.Catalogs: Catalog is not configured
   2021-06-09T23:25:40,528  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hadoop.HadoopTables: Table location loaded: 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/test1
   2021-06-09T23:25:40,528  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
hive.HiveIcebergSerDe: Using schema from existing table 
{"type":"struct","fields":[{"id":1,"name":"id","required":true,"type":"int"},{"id":2,"name":"name","required":false,"type":"string"},{"id":3,"name":"major","required":false,"type":"string"},{"id":4,"name":"grade","required":false,"type":"string"}]}
   2021-06-09T23:25:40,597  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Starting task [Stage-1:DDL] in serial mode
   2021-06-09T23:25:40,604  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Launching Job 1 out of 1
   2021-06-09T23:25:40,604  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Starting task [Stage-2:MAPRED] in serial mode
   2021-06-09T23:25:40,686  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Context: New scratch dir is 
hdfs://shenme-cluster-3-m/tmp/hive/shenme_google_com/521d08fa-9fda-4ca3-8bab-3d0b60ddc6de/hive_2021-06-09_23-25-37_866_2611149539519878546-1
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
tez.TezSessionPoolManager: The current user: shenme_google_com, session user: 
shenme_google_com
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
tez.TezSessionPoolManager: Current queue name is null incoming queue name is 
null
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
exec.Task: Subscribed to counters: [] for queryId: 
shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1
   2021-06-09T23:25:40,691  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
exec.Task: Session is already open
   2021-06-09T23:25:40,694  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
tez.DagUtils: Resource modification time: 1623281030625 for 
hdfs://shenme-cluster-3-m/tmp/hive/shenme_google_com/_tez_session_dir/521d08fa-9fda-4ca3-8bab-3d0b60ddc6de-resources/iceberg-hive-runtime.jar
   2021-06-09T23:25:40,694  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
exec.Task: Dag name: INSERT INTO TABLE hadoop_table_1..."Senior") (Stage-2)
   2021-06-09T23:25:40,698  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Context: New scratch dir is 
hdfs://shenme-cluster-3-m/tmp/hive/shenme_google_com/521d08fa-9fda-4ca3-8bab-3d0b60ddc6de/hive_2021-06-09_23-25-37_866_2611149539519878546-1
   2021-06-09T23:25:40,758  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
tez.DagUtils: Vertex has custom input? false
   2021-06-09T23:25:40,759  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
exec.SerializationUtilities: Serializing MapWork using kryo
   2021-06-09T23:25:40,789  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
exec.Utilities: Serialized plan (via RPC) - name: Map 1 size: 301.80KB
   2021-06-09T23:25:40,891  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
client.TezClient: Submitting dag to TezSession, 
sessionName=HIVE-521d08fa-9fda-4ca3-8bab-3d0b60ddc6de, 
applicationId=application_1622522135523_0073, dagName=INSERT INTO TABLE 
hadoop_table_1..."Senior") (Stage-2), callerContext={ context=HIVE, 
callerType=HIVE_QUERY_ID, 
callerId=shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1 }
   2021-06-09T23:25:40,991  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
client.TezClient: Submitted dag to TezSession, 
sessionName=HIVE-521d08fa-9fda-4ca3-8bab-3d0b60ddc6de, 
applicationId=application_1622522135523_0073, dagId=dag_1622522135523_0073_2, 
dagName=INSERT INTO TABLE hadoop_table_1..."Senior") (Stage-2)
   2021-06-09T23:25:41,513  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
SessionState: Status: Running (Executing on YARN cluster with App id 
application_1622522135523_0073)
   
   2021-06-09T23:25:41,515  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
SessionState: Map 1: 0/1
   2021-06-09T23:25:44,542  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
SessionState: Map 1: 0(+1)/1
   2021-06-09T23:25:47,576  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
SessionState: Map 1: 0(+1)/1
   2021-06-09T23:25:48,220  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
SessionState: Map 1: 1/1
   2021-06-09T23:25:48,223  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Starting task [Stage-4:DDL] in serial mode
   2021-06-09T23:25:48,224  INFO [521d08fa-9fda-4ca3-8bab-3d0b60ddc6de main] 
ql.Driver: Completed executing 
command(queryId=shenme_google_com_20210609232537_9d2ce449-4ebc-403b-8f74-8b08ba9f62d1);
 Time taken: 8.395 seconds
   ```
   2. Version
   
![image](https://user-images.githubusercontent.com/84943281/121573545-3c7fe400-c9da-11eb-9bad-5b20e070302b.png)
   
   3. Explain Plan
   ```
   hive> EXPLAIN INSERT INTO TABLE hadoop_table_1 VALUES (8, "Eve", "Zoology", 
"Senior");
   OK
   Plan optimized by CBO.
   
   Stage-0
     Alter Table Operator:
       old name:default.hadoop_table_1,type:drop props
   Stage-1
     Pre Insert operator:{"Pre-Insert task":{}}
   Stage-4
     Insert operator:{"Commit-Insert-Hook":{}}
       Stage-2
         Map 1
         File Output Operator [FS_4]
           table:{"name:":"default.hadoop_table_1"}
           Select Operator [SEL_3] (rows=1 width=8)
             Output:["_col0","_col1","_col2","_col3"]
             UDTF Operator [UDTF_2] (rows=1 width=48)
               function name:inline
               Select Operator [SEL_1] (rows=1 width=48)
                 Output:["_col0"]
                 TableScan [TS_0] (rows=1 width=10)
                   
_dummy_database@_dummy_table,_dummy_table,Tbl:COMPLETE,Col:COMPLETE
   
   Time taken: 1.592 seconds, Fetched: 22 row(s)
   ```
   4. Everytime when I insert the data, there'll be a _tmp file, but it's empty.
   ```
   gsutil ls -la 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/_tmp.test1/
   
   gsutil cat 
gs://gcs-bucket-shenme-test-service-7f1dbd01-5cc9-4393-a54a-e9c2b9c9/hive-warehouse/_tmp.test1/#1623347746784286
   ```
   
![image](https://user-images.githubusercontent.com/84943281/121574915-98973800-c9db-11eb-9eec-6aba62c10479.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to