how to chain multiple jobs in hadoop streamming

2008-07-10 Thread xinfan meng
Does Hadoop suport chaiing multiple jobs with hadoop streaming mechanism? If so, how can I do that? Thanks. -- Best Wishes Meng Xinfan(蒙新泛) Institute of Computational Linguistics Department of Computer Science & Technology School of Electronic Engineering & Computer Science Peking University Beij

compute document frequency with hadoop-streaming

2008-06-08 Thread xinfan meng
In hadoopstreaming, we accept input from stdin. If we want to compute the document frequncy of words, the somplest way is to output words as keys and file name as values. then how can we get the input file name passed to this MapReduce job? Thanks. -- Best Wishes Meng Xinfan(蒙新泛) Institute of Com