> On April 30, 2013, 8:07 p.m., Rohini Palaniswamy wrote: > > contrib/piggybank/java/src/main/java/org/apache/pig/piggybank/storage/avro/AvroStorageUtils.java, > > line 146 > > <https://reviews.apache.org/r/10351/diff/1/?file=279039#file279039line146> > > > > Doing a globStatus again on a known file (FileStatus) is inefficient. > > Better move this block to a separate method and use that for recursion
thanks for the pointer, it is much better! > On April 30, 2013, 8:07 p.m., Rohini Palaniswamy wrote: > > contrib/piggybank/java/src/main/java/org/apache/pig/piggybank/storage/avro/AvroStorageUtils.java, > > line 160 > > <https://reviews.apache.org/r/10351/diff/1/?file=279039#file279039line160> > > > > Pattern should be a private static variable. This pattern only takes > > into account globs of the form {x,y}. Hadoop glob status supports a lot > > more > > > > > > http://hadoop.apache.org/docs/current/api/org/apache/hadoop/fs/FileSystem.html#globStatus(org.apache.hadoop.fs.Path). > > > > > > Found this method in pig which would take care of the logic - > > LoadFunc.getPathStrings() . Use this for splitting paths. This should > > simplify the whole change > > > > > > Johnny Zhang wrote: > Thanks a lot for your comments, Rohini. I am going to update the revised > patch very soon today. just fixed those two in the revised patch - Johnny ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/10351/#review19974 ----------------------------------------------------------- On May 3, 2013, 12:27 a.m., Johnny Zhang wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/10351/ > ----------------------------------------------------------- > > (Updated May 3, 2013, 12:27 a.m.) > > > Review request for pig. > > > Description > ------- > > we want to support comma separated input paths in AvroStorage, for example > "test_dir1/test_glob1.avro,test_dir1/test_glob2.avro,test_dir1/test_glob3.avro" > "test_dir1/*, test_dir2/test_glob4.avro, test_dir2/test_glob5.avro" > > > This addresses bug PIG-3223. > https://issues.apache.org/jira/browse/PIG-3223 > > > Diffs > ----- > > > contrib/piggybank/java/src/main/java/org/apache/pig/piggybank/storage/avro/AvroStorageUtils.java > 0ac0225 > > contrib/piggybank/java/src/test/java/org/apache/pig/piggybank/test/storage/avro/TestAvroStorage.java > bd7a6d2 > > Diff: https://reviews.apache.org/r/10351/diff/ > > > Testing > ------- > > added two more test cases in TestAvroStorage.java and they all pass > > > Thanks, > > Johnny Zhang > >
