----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/17876/#review34044 -----------------------------------------------------------
http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/PigGenericMapBase.java <https://reviews.apache.org/r/17876/#comment63999> This is problematic style -- an undocumented implicit contract on mp.isEmpty() not changing after setup(), and the variable is private, so none of the implementors or this abstract class can do anything about its value. http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POPackage.java <https://reviews.apache.org/r/17876/#comment63998> outside the scope of this patch, but what's our story with sJobConfInternal? It's internal, but marked public, and plain old sJobConf is deprecated in favor of UdfContext.getUdfContext().getJobConf() we should at least have a comment in that code explaining what's going on and when you would use which... http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POPackage.java <https://reviews.apache.org/r/17876/#comment63997> let's move these string constants (batchsize, cachedbag type, etc) to PigConfiguration when we come across them - Dmitriy Ryaboy On Feb. 9, 2014, 2:17 a.m., Rohini Palaniswamy wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/17876/ > ----------------------------------------------------------- > > (Updated Feb. 9, 2014, 2:17 a.m.) > > > Review request for pig. > > > Bugs: PIG-3456 > https://issues.apache.org/jira/browse/PIG-3456 > > > Repository: pig > > > Description > ------- > > 1) DefaultTuple has a protected boolean isNull = false; which is never used. > Removing this gives ~3-5% improvement for big jobs > 2) Config checking with ThreadLocal conf is repeatedly done for each record. > For eg: createDataBag in POCombinerPackage. But initialized only for first > time in other places like POPackage, POJoinPackage, etc. > > > Diffs > ----- > > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/PigGenericMapBase.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POCombinerPackage.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/PODistinct.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POLoad.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POPackage.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POPartialAgg.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/POSort.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/builtin/Distinct.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/data/DefaultAbstractBag.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/src/org/apache/pig/data/DefaultTuple.java > 1562947 > > http://svn.apache.org/repos/asf/pig/trunk/test/org/apache/pig/test/TestTuple.java > 1562947 > > Diff: https://reviews.apache.org/r/17876/diff/ > > > Testing > ------- > > Full suite of unit tests pass > > > Thanks, > > Rohini Palaniswamy > >
