Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Pig Wiki" for change 
notification.

The "Pig070IncompatibleChanges" page has been changed by RichardDing.
http://wiki.apache.org/pig/Pig070IncompatibleChanges?action=diff&rev1=37&rev2=38

--------------------------------------------------

  || Removing Split by file feature || Split by File || Low to None || Input 
format of the loader would need to support this || We don't know that this 
feature was widely/ever used ||
  || Local files no longer accessible from cluster || Access to Local Files 
from Map-Reduce Mode || low to none || copy the file to the cluster using 
copyToLocal command prior to the load || This feature was not documented ||
  || Removing Custom Comparators || Removing Custom Comparators || Low to None 
|| None || This feature has been deprecated since Pig 0.5.0 release. We don't 
have a single known use case ||
- || Using PigFileInputFormat and PigTextInputFormat || Changes to custom Load 
Functions || Low to medium || Custom loaders using a text-based input format 
and like to support recursive file listing need to use these classes || This is 
to get around the problem of MAPREDUCE-1577. ||
+ || Using !PigFileInputFormat and !PigTextInputFormat || Changes to custom 
Load Functions || Low to medium || Custom loaders using a text-based input 
format and like to support recursive file listing need to use these classes || 
This is to get around the problem of MAPREDUCE-1577. ||
  
  == Changes to the Load and Store Functions ==
  
@@ -90, +90 @@

  == Merge Join ==
  In Pig.0.6.0 there was a pre-condition for merge join: "The loadfunc for the 
right input of the join should implement the !SamplableLoader interface" - 
instead the !LoadFunc should now implement !OrderedLoadFunc interface in Pig 
0.7.0. All other pre-condtions still hold.
  
+ == PigFileInputFormat and PigTextInputFormat ==
+ 
+ Given a load location, Pig 0.6.0 loaders load recursively all files under the 
location (can be multi-level). To get around MAPREDUCE-1577, Pig 0.7.0 adds 
!PigFileInputFormat and !PigTextInputFormat classes. They are subclasses of the 
Hadoop !FileInputFormat and !TextInputFormat overriding the listStatus method 
to support multi-level/recursive directory/file listing. Any custom loader that 
uses !FileInputFormat or !TextInputFormat, and wants to support recursive file 
listing should use the corresponding Pig version of the !InputFormat. 
+ 

Reply via email to