Hi, > I am trying to use the hadoop's datajoin for joining two relation. According > to > the Readme file of datajoin, it gives the following syntax: > > $HADOOP_HOME/bin/hadoop jar hadoop-datajoin-examples.jar > org.apache.hadoop.contrib.utils.join.DataJoinJob datajoin/input > datajoin/output > Text 1 org.apache.hadoop.contrib.utils.join.SampleDataJoinMapper > org.apache.hadoop.contrib.utils.join.SampleDataJoinReducer > org.apache.hadoop.contrib.utils.join.SampleTaggedMapOutput Text > > > But I do not find hadoop-datajoin-examples.jar anywhere in my Hadoop_home. Can > anyone tell me how to produce it or where to find it?
Datajoin is a contrib module. So, you will typically find it under contrib/datajoin/. The name could something slightly different - it could have a version number and other things. Thanks Hemanth > > Thanks in advance. > > > >