[ https://issues.apache.org/jira/browse/HIVE-21305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16802192#comment-16802192 ]
Prasanth Jayachandran commented on HIVE-21305: ---------------------------------------------- [~gopalv] LLAP sets ROWS_EMITTED as VRB batch size, whereas tez counts VRB as 1 record, hence the difference. [~rajesh.balamohan] This disables read through cache behavior for ETL queries reading from text table and writing elsewhere (which could be temp table as well). > LLAP: Option to skip cache for ETL queries > ------------------------------------------ > > Key: HIVE-21305 > URL: https://issues.apache.org/jira/browse/HIVE-21305 > Project: Hive > Issue Type: Improvement > Components: llap > Affects Versions: 4.0.0 > Reporter: Prasanth Jayachandran > Assignee: Prasanth Jayachandran > Priority: Major > Attachments: HIVE-21305.1.patch, HIVE-21305.2.patch, > HIVE-21305.3.patch > > > To avoid ETL queries from polluting the cache, would be good to detect such > queries at compile time and optional skip llap io for such queries. > org.apache.hadoop.hive.ql.parse.QBParseInfo.hasInsertTables() is the simplest > way to catch ETL queries. -- This message was sent by Atlassian JIRA (v7.6.3#76005)