you could write your word set to a file in DFS somewhere outside of
the input directory and read it at map init time (within the
configure() method). you could pass the path to file as a
configuration property.

HTH

Alejandro

On 3/19/07, Ilya Vishnevsky <[EMAIL PROTECTED]> wrote:
Hello! My question is about mapreduce. Is it possible to pass to the map
function some global information? For example I have a set of words and
a large set of documents. I want the map function to get each document
as value and emit pairs (word-frequency) for each word in the set, where
"frequency" is frequency of this word in the document. To do this I need
map function to have access to the set of words each time it runs. Is it
possible to do that?


Reply via email to