Re: How is big file got divided

Lei Chen Thu, 20 Apr 2006 02:05:10 -0700

Thanks, Arbow

I checked the code and also carried out some experiments. It seems that big
file can be split within one line. But the map/reduce will still work
properly since the dfs layer will hide the block layout information from the
map/reduce tasks.


Lei

On 4/20/06, Arbow <[EMAIL PROTECTED]> wrote:
>
> Hi, Lei Chen:
>
> You can have a view on org.apache.hadoop.mapred.InputFormatBase, I
> think it will help you.
>
> On 4/20/06, Lei Chen <[EMAIL PROTECTED]> wrote:
> > Hi,
> >      I am a new user of hadoop. This project looks cool.
> >
> >      There is one question about the MapReduce. I want to process a big
> > file. To my understanding, hadoop will partition big file into block and
> > each block is assigned to a worker. Then, how does hadoop decide where
> to
> > cut those big files? Does it guarantee that each line in the input file
> will
> > be assigned to one block and no line will be divided into two parts in
> > different blocks?
> >
> > Lei
> >
> >
>

Re: How is big file got divided

Reply via email to