Re: Store mapreduce output into my own data structures

sallonchina Fri, 27 Nov 2009 06:38:18 -0800

Actually I want the output can be used by other modules. So it has to readthe output from hdfs files? Or integrate these modules into map-reduce? Isthere other ways?


--------------------------------------------------
From: "Jeff Zhang" <[email protected]>
Sent: Friday, November 27, 2009 10:00 PM
To: <[email protected]>
Subject: Re: Store mapreduce output into my own data structures

Hi Liu,
Why you want to store the output in memory? You can not use the outputout
of reducer.
Actually at the beginning the output of reducer is in memory, and the
OutputFormat write these data to file system or other data store.


Jeff Zhang



2009/11/27 Liu Xianglong <[email protected]>
Hi, everyone. Is there someone who uses map-reduce to store the reduce
output in memory. I mean, now the output path of job is set and reduce
outputs are stored into files under this path.(see the comments alongwith
the following codes)
    job.setOutputFormatClass(MyOutputFormat.class);
//can I implement my OutputFormat to store these output key-valuepairs
in my data structures, or are these other ways to do it?
    job.setOutputKeyClass(ImmutableBytesWritable.class);
    job.setOutputValueClass(Result.class);
    FileOutputFormat.setOutputPath(job, outputDir);
Is there any way to store them in some variables or data structures?Thenhow can I implement my OutputFormat? Any suggestions and codes arewelcomed.
Another question: is there some way to set the number of map task? Itseemsthere is no API to do this in hadoop new job APIs. I am not sure the wayto
set this number.

Thanks!

Best Wishes!
_____________________________________________________________

刘祥龙  Liu Xianglong

Re: Store mapreduce output into my own data structures

Reply via email to