----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/24830/#review51419 -----------------------------------------------------------
ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java <https://reviews.apache.org/r/24830/#comment89734> There are other utility functions that extracts taskID/attemptID from file names. None of these methods throw exception if it could not find matches for the regex pattern. Example: getIdFromFilename() returns filename as Id if it cannot match pattern. I was also following the same convention. In this case, if there are no matches for copy file pattern it will return false and will fallback to old code path. The regex will still work if files are loaded using "LOAD DATA LOCAL INPATH" statement. With this statement, the file names will be like 1) filename.txt 2) filename_copy_1.txt 3) filename_copy_2.txt For this file pattern, there will be no match for taskId/attemptId extraction. Hence no files will be marked duplicate. We really don't have to worry about copy file names in this case as there will not be any duplicate elimination. ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcFileMergeMapper.java <https://reviews.apache.org/r/24830/#comment89735> Fixed it. - Prasanth_J On Aug. 19, 2014, 12:29 a.m., Prasanth_J wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/24830/ > ----------------------------------------------------------- > > (Updated Aug. 19, 2014, 12:29 a.m.) > > > Review request for hive and Gunther Hagleitner. > > > Repository: hive-git > > > Description > ------- > > ORC fast merge (HIVE-7509) will fail the merge task in case if any of the > precondition checks fail. Precondition check fail is good for "ALTER TABLE .. > CONCATENATE" but not for automatic trigger of merge task from conditional > resolver. In case if a partition has non-compatible ORC files for merging > then the merge task should ignore it and not fail the task. > > > Diffs > ----- > > ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1d6a93a > ql/src/java/org/apache/hadoop/hive/ql/io/merge/MergeMapper.java beb4f7d > ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcFileMergeMapper.java > b36152a > ql/src/test/queries/clientnegative/orc_merge1.q b2d42cd > ql/src/test/queries/clientnegative/orc_merge2.q 2f62ee7 > ql/src/test/queries/clientnegative/orc_merge3.q 5158e2e > ql/src/test/queries/clientnegative/orc_merge4.q ad48572 > ql/src/test/queries/clientnegative/orc_merge5.q e94a8cc > ql/src/test/queries/clientpositive/orc_merge_incompat1.q PRE-CREATION > ql/src/test/queries/clientpositive/orc_merge_incompat2.q PRE-CREATION > ql/src/test/results/clientpositive/orc_merge_incompat1.q.out PRE-CREATION > ql/src/test/results/clientpositive/orc_merge_incompat2.q.out PRE-CREATION > > Diff: https://reviews.apache.org/r/24830/diff/ > > > Testing > ------- > > > Thanks, > > Prasanth_J > >