[ https://issues.apache.org/jira/browse/HIVE-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sarvesh Sakalanaga updated HIVE-4727: ------------------------------------- Attachment: Hive-4727.0.patch Patch available. > Optimize ORC StringTreeReader::nextVector to not create dictionary of strings > for each call to nextVector > --------------------------------------------------------------------------------------------------------- > > Key: HIVE-4727 > URL: https://issues.apache.org/jira/browse/HIVE-4727 > Project: Hive > Issue Type: Sub-task > Reporter: Sarvesh Sakalanaga > Assignee: Sarvesh Sakalanaga > Priority: Minor > Attachments: Hive-4727.0.patch > > > Currently ORC StringTreeReader::nextVector creates dictionary of strings for > each call to nextVector. This leads to bad perf as there is huge memory > allocation and deallocation on each call. Since the dictionary does not > change within a stripe, StringTreeReader::nextVector should be optimized to > create this dictionary only on stripe read. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira