sidnakoppa commented on issue #652: Reading Merge_on_read table| Failing SchemaParseException: Empty name URL: https://github.com/apache/incubator-hudi/issues/652#issuecomment-486529691 Thanks for the quick response. I have written the data as below > s.write.format("com.uber.hoodie") > .option(HoodieWriteConfig.TABLE_NAME,"emp_mor") > .option(DataSourceWriteOptions.STORAGE_TYPE_OPT_KEY,"MERGE_ON_READ") > .option(DataSourceWriteOptions.RECORDKEY_FIELD_OPT_KEY,"emp_id") > .option("hoodie.upsert.shuffle.parallelism",4) > .mode(SaveMode.Append) > .save("/apps/hive/warehouse/emp_mor") Run Syntool with below params > java -cp /home/designer/hoodie/hoodie-hive-bundle-0.4.5-SNAPSHOT.jar::runsyn/* com.uber.hoodie.hive.HiveSyncTool > --user root --database default --jdbc-url jdbc:hive2://10.20.0.10:10000 --pass root > --table emp_mor --base-path hdfs://nn10.htrunk.com/apps/hive/warehouse/emp_mor/ **Output** ``` Trying to sync hoodie table emp_mor with base path /apps/hive/warehouse/emp_mor/ of type MERGE_ON_READ Found the last compaction commit as Optional.empty Found the last delta commit Optional[[20190424193634__deltacommit__COMPLETED]] - Table emp_mor is not found. Creating it ``` Table is created as below _emp_mor_ but was expecting it be created as **emp_mor_rt** > CREATE EXTERNAL TABLE `emp_mor`( > `_hoodie_commit_time` string COMMENT '', > `_hoodie_commit_seqno` string COMMENT '', > `_hoodie_record_key` string COMMENT '', > `_hoodie_partition_path` string COMMENT '', > `_hoodie_file_name` string COMMENT '', > `emp_id` int COMMENT '', > `emp_name` string COMMENT '', > `ts` timestamp COMMENT '') > ROW FORMAT SERDE > 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' > STORED AS INPUTFORMAT > 'com.uber.hoodie.hadoop.HoodieInputFormat' > OUTPUTFORMAT > 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat' > LOCATION > 'hdfs://nn10.htrunk.com:8020/apps/hive/warehouse/emp_mor/default' So created realtime table as below in hive but facing the exception as Failing SchemaParseException: Empty name ``` CREATE EXTERNAL TABLE `emp_mor_rt`( `_hoodie_commit_time` string COMMENT '', `_hoodie_commit_seqno` string COMMENT '', `_hoodie_record_key` string COMMENT '', `_hoodie_partition_path` string COMMENT '', `_hoodie_file_name` string COMMENT '', `emp_id` int COMMENT '', `emp_name` string COMMENT '', `ts` timestamp COMMENT '') ROW FORMAT SERDE 'com.uber.hoodie.hadoop.realtime.HoodieParquetSerde' STORED AS INPUTFORMAT 'com.uber.hoodie.hadoop.realtime.HoodieRealtimeInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat' LOCATION 'hdfs://nn10.htrunk.com:8020/apps/hive/warehouse/emp_mor/default' ```
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
