Hi Everyone, I'm a newer to hadoop. I want to write a mapreduce program to implement the inverted index. My question is which input format should I use? It seems that the TextInputFormat's key is the offset of each line in the document. How can I get the document name(as the document id) that the map function processes? Thanks! He Hao
---------------------------------
@yahoo.cn 新域名、无限量,快来抢注!
