[ https://issues.apache.org/jira/browse/HIVE-23871?focusedWorklogId=460455&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-460455 ]
ASF GitHub Bot logged work on HIVE-23871: ----------------------------------------- Author: ASF GitHub Bot Created on: 17/Jul/20 18:23 Start Date: 17/Jul/20 18:23 Worklog Time Spent: 10m Work Description: pgaref commented on a change in pull request #1273: URL: https://github.com/apache/hive/pull/1273#discussion_r456604112 ########## File path: ql/src/test/results/clientpositive/llap/load_micromanaged_delim.q.out ########## @@ -0,0 +1,186 @@ +#### A masked pattern was here #### +PREHOOK: type: CREATETABLE +#### A masked pattern was here #### +PREHOOK: Output: database:default +PREHOOK: Output: default@delim_table_ext +#### A masked pattern was here #### +POSTHOOK: type: CREATETABLE +#### A masked pattern was here #### +POSTHOOK: Output: database:default +POSTHOOK: Output: default@delim_table_ext +PREHOOK: query: describe formatted delim_table_ext +PREHOOK: type: DESCTABLE +PREHOOK: Input: default@delim_table_ext +POSTHOOK: query: describe formatted delim_table_ext +POSTHOOK: type: DESCTABLE +POSTHOOK: Input: default@delim_table_ext +# col_name data_type comment +id int +name string +safety int + +# Detailed Table Information +Database: default +#### A masked pattern was here #### +Retention: 0 +#### A masked pattern was here #### +Table Type: EXTERNAL_TABLE +Table Parameters: + EXTERNAL TRUE + bucketing_version 2 + numFiles 1 + totalSize 52 +#### A masked pattern was here #### + +# Storage Information +SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe +InputFormat: org.apache.hadoop.mapred.TextInputFormat +OutputFormat: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat +Compressed: No +Num Buckets: -1 +Bucket Columns: [] +Sort Columns: [] +Storage Desc Params: + field.delim \t + serialization.format \t +PREHOOK: query: SELECT * FROM delim_table_ext +PREHOOK: type: QUERY +PREHOOK: Input: default@delim_table_ext +#### A masked pattern was here #### +POSTHOOK: query: SELECT * FROM delim_table_ext +POSTHOOK: type: QUERY +POSTHOOK: Input: default@delim_table_ext +#### A masked pattern was here #### +1 Acura 4 +2 Toyota 3 +3 Tesla 5 +4 Honda 5 +11 Mazda 2 +PREHOOK: query: CREATE TABLE delim_table_micro(id INT, name STRING, safety INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS TEXTFILE TBLPROPERTIES('transactional'='true', "transactional_properties"="insert_only") +PREHOOK: type: CREATETABLE +PREHOOK: Output: database:default +PREHOOK: Output: default@delim_table_micro +POSTHOOK: query: CREATE TABLE delim_table_micro(id INT, name STRING, safety INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS TEXTFILE TBLPROPERTIES('transactional'='true', "transactional_properties"="insert_only") +POSTHOOK: type: CREATETABLE +POSTHOOK: Output: database:default +POSTHOOK: Output: default@delim_table_micro +#### A masked pattern was here #### +PREHOOK: type: LOAD +#### A masked pattern was here #### +PREHOOK: Output: default@delim_table_micro +#### A masked pattern was here #### +POSTHOOK: type: LOAD +#### A masked pattern was here #### +POSTHOOK: Output: default@delim_table_micro +PREHOOK: query: describe formatted delim_table_micro +PREHOOK: type: DESCTABLE +PREHOOK: Input: default@delim_table_micro +POSTHOOK: query: describe formatted delim_table_micro +POSTHOOK: type: DESCTABLE +POSTHOOK: Input: default@delim_table_micro +# col_name data_type comment +id int +name string +safety int + +# Detailed Table Information +Database: default +#### A masked pattern was here #### +Retention: 0 +#### A masked pattern was here #### +Table Type: MANAGED_TABLE +Table Parameters: + bucketing_version 2 + numFiles 1 + numRows 0 + rawDataSize 0 + totalSize 52 + transactional true + transactional_properties insert_only +#### A masked pattern was here #### + +# Storage Information +SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe +InputFormat: org.apache.hadoop.mapred.TextInputFormat +OutputFormat: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat +Compressed: No +Num Buckets: -1 +Bucket Columns: [] +Sort Columns: [] +PREHOOK: query: SELECT * FROM delim_table_micro +PREHOOK: type: QUERY +PREHOOK: Input: default@delim_table_micro +#### A masked pattern was here #### +POSTHOOK: query: SELECT * FROM delim_table_micro +POSTHOOK: type: QUERY +POSTHOOK: Input: default@delim_table_micro +#### A masked pattern was here #### +NULL NULL NULL +NULL NULL NULL +NULL NULL NULL +NULL NULL NULL +NULL NULL NULL +PREHOOK: query: CREATE TRANSACTIONAL TABLE delim_table_trans(id INT, name STRING, safety INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS TEXTFILE +PREHOOK: type: CREATETABLE +PREHOOK: Output: database:default +PREHOOK: Output: default@delim_table_trans +POSTHOOK: query: CREATE TRANSACTIONAL TABLE delim_table_trans(id INT, name STRING, safety INT) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS TEXTFILE +POSTHOOK: type: CREATETABLE +POSTHOOK: Output: database:default +POSTHOOK: Output: default@delim_table_trans +#### A masked pattern was here #### +PREHOOK: type: LOAD +#### A masked pattern was here #### +PREHOOK: Output: default@delim_table_trans +#### A masked pattern was here #### +POSTHOOK: type: LOAD +#### A masked pattern was here #### +POSTHOOK: Output: default@delim_table_trans +PREHOOK: query: describe formatted delim_table_trans +PREHOOK: type: DESCTABLE +PREHOOK: Input: default@delim_table_trans +POSTHOOK: query: describe formatted delim_table_trans +POSTHOOK: type: DESCTABLE +POSTHOOK: Input: default@delim_table_trans +# col_name data_type comment +id int +name string +safety int + +# Detailed Table Information +Database: default +#### A masked pattern was here #### +Retention: 0 +#### A masked pattern was here #### +Table Type: MANAGED_TABLE +Table Parameters: + bucketing_version 2 + numFiles 1 + numRows 0 + rawDataSize 0 + totalSize 52 + transactional true + transactional_properties insert_only +#### A masked pattern was here #### + +# Storage Information +SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe +InputFormat: org.apache.hadoop.mapred.TextInputFormat +OutputFormat: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat +Compressed: No +Num Buckets: -1 +Bucket Columns: [] +Sort Columns: [] +PREHOOK: query: SELECT * FROM delim_table_trans +PREHOOK: type: QUERY +PREHOOK: Input: default@delim_table_trans +#### A masked pattern was here #### +POSTHOOK: query: SELECT * FROM delim_table_trans +POSTHOOK: type: QUERY +POSTHOOK: Input: default@delim_table_trans +#### A masked pattern was here #### +NULL NULL NULL +NULL NULL NULL +NULL NULL NULL +NULL NULL NULL +NULL NULL NULL Review comment: This is exactly the behavior this patch fixes! Apparently I run the test only before the changes -- this is now updated ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 460455) Time Spent: 40m (was: 0.5h) > ObjectStore should properly handle MicroManaged Table properties > ---------------------------------------------------------------- > > Key: HIVE-23871 > URL: https://issues.apache.org/jira/browse/HIVE-23871 > Project: Hive > Issue Type: Bug > Components: Metastore > Reporter: Panagiotis Garefalakis > Assignee: Panagiotis Garefalakis > Priority: Major > Labels: pull-request-available > Attachments: table1 > > Time Spent: 40m > Remaining Estimate: 0h > > HIVE-23281 optimizes StorageDescriptor conversion as part of the ObjectStore > by skipping particular Table properties like SkewInfo, bucketCols, ordering > etc. > However, it does that for all Transactional Tables – not only ACID – causing > MicroManaged Tables to behave abnormally. > MicroManaged (insert_only) tables may miss needed properties such as Storage > Desc Params – that may define how lines are delimited (like in the example > below): > To repro the issue: > {code:java} > CREATE TRANSACTIONAL TABLE delim_table_trans(id INT, name STRING, safety INT) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS TEXTFILE; > LOAD DATA INPATH 'table1' OVERWRITE INTO TABLE delim_table_trans; > describe formatted delim_table_trans; > SELECT * FROM delim_table_trans; > {code} > Result: > {code:java} > Table Type: MANAGED_TABLE > Table Parameters: > bucketing_version 2 > numFiles 1 > numRows 0 > rawDataSize 0 > totalSize 72 > transactional true > transactional_properties insert_only > #### A masked pattern was here #### > > # Storage Information > SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe > > InputFormat: org.apache.hadoop.mapred.TextInputFormat > OutputFormat: > org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat > Compressed: No > Num Buckets: -1 > Bucket Columns: [] > Sort Columns: [] > PREHOOK: query: SELECT * FROM delim_table_trans > PREHOOK: type: QUERY > PREHOOK: Input: default@delim_table_trans > #### A masked pattern was here #### > POSTHOOK: query: SELECT * FROM delim_table_trans > POSTHOOK: type: QUERY > POSTHOOK: Input: default@delim_table_trans > #### A masked pattern was here #### > NULL NULL NULL > NULL NULL NULL > NULL NULL NULL > NULL NULL NULL > NULL NULL NULL > NULL NULL NULL > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)