Leoyzen opened a new issue, #7824:
URL: https://github.com/apache/hudi/issues/7824

   **Describe the problem you faced**
   When enabling metadata on a table which don't has metadata previously, NPE 
occurs.
   
   **To Reproduce**
   
   Steps to reproduce the behavior:
   
   1. setup a table with metadata disabled.
   2. restart the job and enabling the metadata option.
   3. NPE occurs
   
   **Expected behavior**
   
   No NPE
   
   **Environment Description**
   
   * Hudi version : 0.13.0-rc1
   
   * Spark version : N/A
   
   * Hive version : 3.1.2
   
   * Hadoop version : 3.1.3
   
   * Storage (HDFS/S3/GCS..) : OSS
   
   * Running on Docker? (yes/no) : yes, HA VVP
   
   
   **Additional context**
   
   
   **Stacktrace**
   
   ```LOG
   023-02-01 00:36:05,435 WARN  
org.apache.hudi.metadata.HoodieBackedTableMetadata           [] - Metadata 
table was not found at path 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/metadata
   2023-02-01 00:36:05,436 INFO  
org.apache.hudi.common.table.view.FileSystemViewManager      [] - Creating View 
Manager with storage type :REMOTE_FIRST
   2023-02-01 00:36:05,436 INFO  
org.apache.hudi.common.table.view.FileSystemViewManager      [] - Creating 
remote first table view
   2023-02-01 00:36:05,438 INFO  
org.apache.hudi.client.transaction.lock.LockManager          [] - LockProvider 
org.apache.hudi.client.transaction.lock.FileSystemBasedLockProvider
   2023-02-01 00:36:05,618 INFO  
org.apache.hudi.common.table.HoodieTableMetaClient           [] - Loading 
HoodieTableMetaClient from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test
   2023-02-01 00:36:05,635 INFO  org.apache.hudi.common.table.HoodieTableConfig 
              [] - Loading table properties from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/hoodie.properties
   2023-02-01 00:36:05,647 INFO  
org.apache.hudi.common.table.HoodieTableMetaClient           [] - Finished 
Loading Table of type MERGE_ON_READ(version=1, baseFileFormat=PARQUET) from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test
   2023-02-01 00:36:05,776 INFO  
org.apache.hudi.common.table.timeline.HoodieActiveTimeline   [] - Loaded 
instants upto : Option{val=[==>20230201003302911__deltacommit__INFLIGHT]}
   2023-02-01 00:36:05,778 WARN  
org.apache.hudi.metadata.HoodieBackedTableMetadataWriter     [] - Cannot 
initialize metadata table as operation(s) are in progress on the dataset: 
[[==>20230131214246466__compaction__INFLIGHT], 
[==>20230131214554966__compaction__REQUESTED], 
[==>20230131215522662__compaction__REQUESTED], 
[==>20230131220449842__compaction__REQUESTED], 
[==>20230201000246991__compaction__REQUESTED], 
[==>20230201001718618__rollback__INFLIGHT], 
[==>20230201003302911__deltacommit__INFLIGHT]]
   2023-02-01 00:36:05,778 INFO  
org.apache.hudi.common.table.HoodieTableMetaClient           [] - Loading 
HoodieTableMetaClient from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test
   2023-02-01 00:36:05,789 INFO  org.apache.hudi.common.table.HoodieTableConfig 
              [] - Loading table properties from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/hoodie.properties
   2023-02-01 00:36:05,798 INFO  
org.apache.hudi.common.table.HoodieTableMetaClient           [] - Finished 
Loading Table of type MERGE_ON_READ(version=1, baseFileFormat=PARQUET) from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test
   2023-02-01 00:36:05,799 INFO  
org.apache.hudi.common.table.HoodieTableMetaClient           [] - Loading 
HoodieTableMetaClient from 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/metadata
   2023-02-01 00:36:05,814 WARN  
org.apache.hudi.metadata.HoodieBackedTableMetadata           [] - Metadata 
table was not found at path 
oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/metadata
   2023-02-01 00:36:05,938 INFO  org.apache.hadoop.hive.conf.HiveConf           
              [] - Found configuration file 
jar:file:../usrlib/ververica-connector-hudi-1.15-vvr-6.0-hive312-0.13.0-rc1-SNAPSHOT-jar-with-dependencies-20230131234928.jar!/hive-site.xml
   2023-02-01 00:36:06,099 WARN  org.apache.hadoop.hive.conf.HiveConf           
              [] - HiveConf of name 
hive.dummyparam.test.server.specific.config.override does not exist
   2023-02-01 00:36:06,100 WARN  org.apache.hadoop.hive.conf.HiveConf           
              [] - HiveConf of name 
hive.dummyparam.test.server.specific.config.metastoresite does not exist
   
   
   ...........
   
   
   
   
   
   2023-02-01 00:36:24,740 INFO  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Executor 
executes action [handle write metadata event for instant ] success!
   2023-02-01 00:36:24,800 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Sink: 
compact_commit (1/1) #0 (b732feaa948ea68f9bf1c0df9689f8f4) switched from 
INITIALIZING to RUNNING.
   2023-02-01 00:36:24,886 INFO  org.apache.hudi.client.BaseHoodieWriteClient   
              [] - Generate a new instant time: 20230201003624886 action: 
deltacommit
   2023-02-01 00:36:25,295 INFO  
org.apache.hudi.common.table.timeline.HoodieActiveTimeline   [] - Loaded 
instants upto : Option{val=[==>20230201003624920__rollback__REQUESTED]}
   2023-02-01 00:36:25,297 INFO  
org.apache.hudi.client.heartbeat.HoodieHeartbeatClient       [] - Received 
request to start heartbeat for instant time 20230201003624886
   2023-02-01 00:36:25,329 INFO  
org.apache.hudi.common.table.timeline.HoodieActiveTimeline   [] - Creating a 
new instant [==>20230201003624886__deltacommit__REQUESTED]
   2023-02-01 00:36:25,364 INFO  
org.apache.hudi.common.table.timeline.HoodieActiveTimeline   [] - Checking for 
file exists 
?oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/20230201003624886.deltacommit.requested
   2023-02-01 00:36:25,394 INFO  
org.apache.hudi.common.table.timeline.HoodieActiveTimeline   [] - Create new 
file for toInstant 
?oss://dengine-lake-zjk/cloudcode_pre/tmp_hudi_hive_test/.hoodie/20230201003624886.deltacommit.inflight
   2023-02-01 00:36:25,435 INFO  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Create 
instant [20230201003624886] for table [tbl_resp_sink] with type [MERGE_ON_READ]
   2023-02-01 00:36:25,439 INFO  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Executor 
executes action [initialize instant ] success!
   2023-02-01 00:36:25,746 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - 
compact_plan_generate (1/1) #0 (06c8d064a1b42253bf7dba8786f881a0) switched from 
INITIALIZING to FAILED on 
job-84b59a23-8dfd-41e9-968d-e83d5280df5d-taskmanager-1-1 @ 
bd-flink011088139163.na63.tbsite.net (dataPort=23011).
   java.lang.NullPointerException: null
        at 
org.apache.hudi.metadata.HoodieBackedTableMetadataWriter.update(HoodieBackedTableMetadataWriter.java:933)
 ~[?:?]
        at 
org.apache.hudi.table.action.BaseActionExecutor.lambda$writeTableMetadata$2(BaseActionExecutor.java:76)
 ~[?:?]
        at org.apache.hudi.common.util.Option.ifPresent(Option.java:97) ~[?:?]
        at 
org.apache.hudi.table.action.BaseActionExecutor.writeTableMetadata(BaseActionExecutor.java:76)
 ~[?:?]
        at 
org.apache.hudi.table.action.rollback.BaseRollbackActionExecutor.finishRollback(BaseRollbackActionExecutor.java:246)
 ~[?:?]
        at 
org.apache.hudi.table.action.rollback.BaseRollbackActionExecutor.runRollback(BaseRollbackActionExecutor.java:114)
 ~[?:?]
        at 
org.apache.hudi.table.action.rollback.BaseRollbackActionExecutor.execute(BaseRollbackActionExecutor.java:135)
 ~[?:?]
        at 
org.apache.hudi.table.HoodieFlinkMergeOnReadTable.rollback(HoodieFlinkMergeOnReadTable.java:128)
 ~[?:?]
        at 
org.apache.hudi.table.HoodieTable.rollbackInflightInstant(HoodieTable.java:626) 
~[?:?]
        at 
org.apache.hudi.table.HoodieTable.rollbackInflightCompaction(HoodieTable.java:600)
 ~[?:?]
        at 
org.apache.hudi.table.HoodieTable.rollbackInflightCompaction(HoodieTable.java:584)
 ~[?:?]
        at 
org.apache.hudi.util.CompactionUtil.lambda$rollbackCompaction$1(CompactionUtil.java:190)
 ~[?:?]
        at java.util.ArrayList.forEach(ArrayList.java:1249) ~[?:1.8.0_102]
        at 
org.apache.hudi.util.CompactionUtil.rollbackCompaction(CompactionUtil.java:188) 
~[?:?]
        at 
org.apache.hudi.sink.compact.CompactionPlanOperator.open(CompactionPlanOperator.java:75)
 ~[?:?]
        at 
org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.initializeStateAndOpenOperators(RegularOperatorChain.java:107)
 ~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at 
org.apache.flink.streaming.runtime.tasks.StreamTask.restoreGates(StreamTask.java:851)
 ~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at 
org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$1.call(StreamTaskActionExecutor.java:55)
 ~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at 
org.apache.flink.streaming.runtime.tasks.StreamTask.restoreInternal(StreamTask.java:798)
 ~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at 
org.apache.flink.streaming.runtime.tasks.StreamTask.restore(StreamTask.java:765)
 ~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at 
org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:954)
 ~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at 
org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:923) 
~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:746) 
~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at org.apache.flink.runtime.taskmanager.Task.run(Task.java:568) 
~[flink-dist-1.15-vvr-6.0.4-SNAPSHOT.jar:1.15-vvr-6.0.4-SNAPSHOT]
        at java.lang.Thread.run(Thread.java:834) ~[?:1.8.0_102]
   2023-02-01 00:36:25,780 INFO  
org.apache.flink.runtime.executiongraph.failover.flip1.RestartPipelinedRegionFailoverStrategy
 [] - Calculating tasks to restart to recover the failed task 
d14c81370450f18851be93e3825a2d06_0.
   2023-02-01 00:36:25,785 INFO  org.apache.flink.runtime.jobmaster.JobMaster   
              [] - 14 tasks should be restarted to recover the failed task 
d14c81370450f18851be93e3825a2d06_0. 
   2023-02-01 00:36:25,785 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - Job 
1c7f0fe9-c059-491d-86eb-322554d24d0f (84b59a238dfd41e9968de83d5280df5d) 
switched from state RUNNING to RESTARTING.
   2023-02-01 00:36:25,787 WARN  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Reset the 
event for task [2]
   2023-02-01 00:36:25,787 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - bucket_write: 
tbl_resp_sink (3/4) #0 (88f02c5d12f53da0b571729af1aae985) switched from RUNNING 
to CANCELING.
   2023-02-01 00:36:25,788 WARN  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Reset the 
event for task [3]
   2023-02-01 00:36:25,788 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - bucket_write: 
tbl_resp_sink (4/4) #0 (4101dbc91d55a0711db769e51450689c) switched from RUNNING 
to CANCELING.
   2023-02-01 00:36:25,788 WARN  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Reset the 
event for task [0]
   2023-02-01 00:36:25,788 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph       [] - bucket_write: 
tbl_resp_sink (1/4) #0 (bdcc0b8232bc1939fd6d29256bab09c3) switched from RUNNING 
to CANCELING.
   2023-02-01 00:36:25,788 WARN  
org.apache.hudi.sink.StreamWriteOperatorCoordinator          [] - Reset the 
event for task [1]
   ``
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to