If I got it right I should see an output like "Total input paths (combined) to process : 7" when I run a pig script, but I'm missing the "(combined)" part, so CombineFileInputFormat is not used? Where could I find the pig configuration? I think I have to check the "pig.splitCombination" value.
2012/1/11 Daniel Dai <[email protected]> > Check PIG-1518. > > Daniel > > On Wed, Jan 11, 2012 at 11:01 AM, Marcel Holle > <[email protected]>wrote: > > > How could I verify this information? Could you point me to a config or > the > > source code? > > > > 2012/1/11 Daniel Dai <[email protected]> > > > > > It is default in 0.8 as well. > > > > > > Daniel > > > > > > On Wed, Jan 11, 2012 at 10:43 AM, Marcel Holle > > > <[email protected]>wrote: > > > > > > > Is there also a way to activate the CombineFileInputFormat in Pig > > 0.8.1? > > > > > > > > 2012/1/10 Alex Rovner <[email protected]> > > > > > > > > > In versions 9+ default is CombineFileInputFormat > > > > > > > > > > On Tue, Jan 10, 2012 at 8:10 PM, Marcel Holle > > > > > <[email protected]>wrote: > > > > > > > > > > > How could I use the CombineFileInputFormat in Pig? I have a > > > performance > > > > > > issue with lots of small files which I want to get rid of. I > think > > by > > > > > > default the FileInputFormat is used. > > > > > > > > > > > > > > > > > > > > >
