[ https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ashutosh Chauhan updated HIVE-2780: ----------------------------------- Status: Open (was: Patch Available) My manually conflict-resolved patch resulted in failure in split_sample.q {code} [junit] java.lang.NullPointerException [junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat$DefaultPercentSampler.sampling(CombineHiveInputFormat.java:596) [junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampling(CombineHiveInputFormat.java:496) [junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampleSplits(CombineHiveInputFormat.java:477) [junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:403) [junit] at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810) [junit] at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781) [junit] at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730) [junit] at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:448) [junit] at org.apache.hadoop.hive.ql.exec.ExecDriver.main(ExecDriver.java:682) [junit] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) [junit] at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) [junit] at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) [junit] at java.lang.reflect.Method.invoke(Method.java:597) [junit] at org.apache.hadoop.util.RunJar.main(RunJar.java:156) [junit] Job Submission failed with exception 'java.lang.NullPointerException(null)' {code} Either my resolution wasn't correct or trunk has moved significantly. Navis, if you rebase the patch, I will take a look at this one quickly so that it doesnt go stale again. > Implement more restrictive table sampler > ---------------------------------------- > > Key: HIVE-2780 > URL: https://issues.apache.org/jira/browse/HIVE-2780 > Project: Hive > Issue Type: Improvement > Reporter: Navis > Assignee: Navis > Priority: Trivial > Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, > ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch > > > Current table sampling scans whole block, making more rows included than > expected especially for small tables. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira