Re: Some basis questions from noob

Peter W. Wed, 27 Jun 2007 13:17:03 -0700

Jeroen,

I'm also a noob but making slight progress.


JobConf will always send Mapreduce output to a specified Path
but I think if you setOutputValueClass(Text.class) it's possible
to later change the destination from a file to stream?

Or, run a separate task with only one reduce which should
write simplified output to one file, then open as stream.

Recent threads mention there is no chaining of tasks so
shell scripting is another way to combine file results.

Hope that helps,

Peter W.


On Jun 27, 2007, at 6:16 AM, Jeroen Verhagen wrote:

Does working with hadoop always involve having a set of files in one
directory as input and resulting in a set of files in one directory as
output? Are the names of the files in input and output directory
insignificant?

How do you handle the end result of a set of Mapreduce tasks? If the
result is a set of files do you have to use another Mapreduce task
that doesn't write to file (to the DFS for example) but to a simple
String to display something on a webpage for example? Or do you have
to read the resulting files directly.

Re: Some basis questions from noob

Reply via email to