[ https://issues.apache.org/jira/browse/HIVE-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ashutosh Chauhan updated HIVE-4727: ----------------------------------- Resolution: Fixed Fix Version/s: vectorization-branch Status: Resolved (was: Patch Available) Committed to branch. Thanks, Sarvesh! > Optimize ORC StringTreeReader::nextVector to not create dictionary of strings > for each call to nextVector > --------------------------------------------------------------------------------------------------------- > > Key: HIVE-4727 > URL: https://issues.apache.org/jira/browse/HIVE-4727 > Project: Hive > Issue Type: Sub-task > Reporter: Sarvesh Sakalanaga > Assignee: Sarvesh Sakalanaga > Priority: Minor > Fix For: vectorization-branch > > Attachments: Hive-4727.0.patch > > > Currently ORC StringTreeReader::nextVector creates dictionary of strings for > each call to nextVector. This leads to bad perf as there is huge memory > allocation and deallocation on each call. Since the dictionary does not > change within a stripe, StringTreeReader::nextVector should be optimized to > create this dictionary only on stripe read. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira