[ https://issues.apache.org/jira/browse/HIVE-15422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15747847#comment-15747847 ]
Hive QA commented on HIVE-15422: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12843175/HIVE-15422.3.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 12 failed/errored test(s), 10796 tests executed *Failed tests:* {noformat} TestMiniSparkOnYarnCliDriver - did not produce a TEST-*.xml file (likely timed out) (batchId=161) [scriptfile1.q,vector_outer_join5.q,file_with_header_footer.q,bucket4.q,input16_cc.q,bucket5.q,infer_bucket_sort_merge.q,constprog_partitioner.q,orc_merge2.q,reduce_deduplicate.q,schemeAuthority2.q,load_fs2.q,orc_merge8.q,orc_merge_incompat2.q,infer_bucket_sort_bucketed_table.q,vector_outer_join4.q,disable_merge_for_bucketing.q,vector_inner_join.q,orc_merge7.q] TestVectorizedColumnReaderBase - did not produce a TEST-*.xml file (likely timed out) (batchId=251) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[auto_sortmerge_join_2] (batchId=44) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[sample2] (batchId=5) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[sample4] (batchId=15) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[sample6] (batchId=61) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[sample7] (batchId=60) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[sample9] (batchId=38) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[transform_ppr2] (batchId=135) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[metadataonly1] (batchId=150) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[stats_based_fetch_decision] (batchId=151) org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_4] (batchId=93) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/2572/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/2572/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-2572/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 12 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12843175 - PreCommit-HIVE-Build > HiveInputFormat::pushProjectionsAndFilters paths comparison generates huge > number of objects for partitioned dataset > -------------------------------------------------------------------------------------------------------------------- > > Key: HIVE-15422 > URL: https://issues.apache.org/jira/browse/HIVE-15422 > Project: Hive > Issue Type: Improvement > Reporter: Rajesh Balamohan > Assignee: Rajesh Balamohan > Priority: Minor > Attachments: HIVE-15422.1.patch, HIVE-15422.2.patch, > HIVE-15422.3.patch, Profiler_Snapshot_HIVE-15422.png > > > When executing the following query in LLAP (single instance) in a 5 node > cluster, lots of GC pressure was observed. > {noformat} > select a.type, a.city , a.frequency, b.city, b.country, b.lat, b.lon > from (select 'depart' as type, origin as city, count(origin) as frequency > from flights > group by origin > order by frequency desc, type) as a > left join airports as b on a.city = b.iata > order by frequency desc; > {noformat} > Flights table has got around 7000+ partitions in S3. Profiling revealed large > amount of objects created just in path comparisons in HiveInputFormat. > HIVE-15405 reduces number of path comparisons at FileUtils, but it still ends > up doing lots of comparisons in HiveInputFormat::pushProjectionsAndFilters. -- This message was sent by Atlassian JIRA (v6.3.4#6332)