HI, I am developing the simple inverted index program frm the hadoop. My map function has the output: <word, doc> and the reducer has: <word, list(docs)>
Now I want to use one more mapreduce to remove stop and scrub words from this output. Also in the next stage I would like to have short summay associated with every word. How should I design my program from this stage? I mean how would I apply multiple mapreduce to this? What would be the better way to perform this? Thanks, Regards, - Aayush Garg, Phone: +41 76 482 240