DataJoinJob is contained in hadoop-0.20.2-datajoin.jar which is in your HADOOP_CLASSPATH
I think you should specify samplejoin.jar using -libjars instead of putting it directly after jar command: hadoop jar hadoop-0.20.2-datajoin.jar org.apache.hadoop.contrib.utils.join.DataJoinJob -libjars ./samplejoin.jar ... (same as your example) Cheers On Fri, Mar 26, 2010 at 3:24 PM, M B <machac...@gmail.com> wrote: > I may be having a setup issue with classpaths, would appreciate some help. > > I created a jar with all the Sample* classes in contrib/DataJoin. Here is > the listing of my samplejoin.jar file: > " zip.vim version v22 > " Browsing zipfile /home/hadoop/hadoop_tests/samplejoin.jar > " Select a file with cursor and press ENTER > META-INF/ > META-INF/MANIFEST.MF > org/ > org/apache/ > org/apache/hadoop/ > org/apache/hadoop/contrib/ > org/apache/hadoop/contrib/utils/ > org/apache/hadoop/contrib/utils/join/ > org/apache/hadoop/contrib/utils/join/SampleDataJoinReducer.class > org/apache/hadoop/contrib/utils/join/SampleTaggedMapOutput.class > org/apache/hadoop/contrib/utils/join/SampleDataJoinMapper.class > > When I go to run this, things start to run, but every Map try errors out > with: > "java.lang.RuntimeException: java.lang.ClassNotFoundException: > org.apache.hadoop.contrib.utils.join.SampleTaggedMapOutput" > > Here is the command: > hadoop jar ./samplejoin.jar > org.apache.hadoop.contrib.utils.join.DataJoinJob > datajoin/input datajoin/output Text 1 > org.apache.hadoop.contrib.utils.join.SampleDataJoinMapper > org.apache.hadoop.contrib.utils.join.SampleDataJoinReducer > org.apache.hadoop.contrib.utils.join.SampleTaggedMapOutput Text > > This is a new install of 0.20.2. > > HADOOP_CLASSPATH is set > to: /opt/hadoop-0.20.2/contrib/datajoin/hadoop-0.20.2-datajoin.jar > Any help would be appreciated. >