Check out Kevin Weil's FileSetInputFormat which, after my enhancement, is close to what you wanted.
Cheers On Sun, May 30, 2010 at 9:40 AM, Tiago Veloso <[email protected]> wrote: > I want to add as input to an hadoop Job all the files in the subfolders > (and folders under those). > > > Tiago Veloso > [email protected] > > > > On May 30, 2010, at 11:28 , Pierre ANCELOT wrote: > > > Maybe would it be simpler if you told us wht you're trying to acheive..? > > > > > > On Sun, May 30, 2010 at 10:44 AM, Tiago Veloso <[email protected]> > wrote: > > > >> Hi, > >> > >> I am writing an hadoop application that should process a Java Project. A > >> Java Project directory can have many subfolders(packages) and files(fava > >> files) mixed in with them. As well as unrelated packages (.svn and the > >> likes). > >> > >> Is it possible to configure hadoop to take as input all the folders of a > >> Java Project directory so it would capture all files? > >> > >> I know it's possible to flatten a Java Project directory. But I would > like > >> to avoid it. > >> > >> Tiago Veloso > >> [email protected] > >> > >> > >> > >> > > > > > > -- > > http://www.neko-consulting.com > > Ego sum quis ego servo > > "Je suis ce que je protège" > > "I am what I protect" > >
