HI,
I am developing the simple inverted index program frm the hadoop. My map
function has the output:
<word, doc>
and the reducer has:
<word, list(docs)>

Now I want to use one more mapreduce to remove stop and scrub words from
this output. Also in the next stage I would like to have short summay
associated with every word. How should I design my program from this stage?
I mean how would I apply multiple mapreduce to this? What would be the
better way to perform this?

Thanks,

Regards,
-
Aayush Garg,
Phone: +41 76 482 240

Reply via email to