[ https://issues.apache.org/jira/browse/HIVE-25453?focusedWorklogId=645256&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-645256 ]
ASF GitHub Bot logged work on HIVE-25453: ----------------------------------------- Author: ASF GitHub Bot Created on: 01/Sep/21 13:03 Start Date: 01/Sep/21 13:03 Worklog Time Spent: 10m Work Description: szlta commented on a change in pull request #2586: URL: https://github.com/apache/hive/pull/2586#discussion_r700191437 ########## File path: llap-server/src/java/org/apache/hadoop/hive/llap/io/api/impl/LlapRecordReader.java ########## @@ -158,8 +167,11 @@ private LlapRecordReader(MapWork mapWork, JobConf job, FileSplit split, rbCtx = ctx != null ? ctx : LlapInputFormat.createFakeVrbCtx(mapWork); isAcidScan = AcidUtils.isFullAcidScan(jobConf); - TypeDescription schema = OrcInputFormat.getDesiredRowTypeDescr( - job, isAcidScan, Integer.MAX_VALUE); + + String icebergOrcSchema = job.get(ColumnProjectionUtils.ICEBERG_ORC_SCHEMA_STRING); Review comment: Yeah it's unfortunate. This is the equivalent of non-LLAP but vectorized ORC case at https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/orc/VectorizedOrcInputFormat.java#L80 Hive sets IOConstants.SCHEMA_EVOLUTION_COLUMNS and SCHEMA_EVOLUTION_COLUMNS_TYPES during compile and pushes this down to execution. These are simple string representations of Hive types so not ORC specific, and relates to LOGICAL schema. Iceberg - in order to support a broader schema evolution - produces the FILE schema based on file info and logical type info, and the result is an ORC TypeDescription instance. (See VectorizedReadUtils.handleIcebergProjection) I have found no easy way to transform this object back into Hive types, the conversion only exists in the other direction: OrcInputFormat.typeDescriptionsFromHiveTypeProperty() -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 645256) Time Spent: 2.5h (was: 2h 20m) > Add LLAP IO support for Iceberg ORC tables > ------------------------------------------ > > Key: HIVE-25453 > URL: https://issues.apache.org/jira/browse/HIVE-25453 > Project: Hive > Issue Type: New Feature > Reporter: Ádám Szita > Assignee: Ádám Szita > Priority: Major > Labels: pull-request-available > Time Spent: 2.5h > Remaining Estimate: 0h > -- This message was sent by Atlassian Jira (v8.3.4#803005)