Re: split operator

2010-08-23 Thread Daniel Dai
a lot. > > -Gang > > > > > - 原始邮件 > 发件人: Daniel Dai > 收件人: "pig-dev@hadoop.apache.org" > 发送日期: 2010/7/26 (周一) 4:58:49 下午 > 主 题: Re: split operator > > Hi, Gang, > It is about multiquery optimization. In MRCompiler, we will create a &g

Re: split operator

2010-08-23 Thread Gang Luo
日期: 2010/7/26 (周一) 4:58:49 下午 主 题: Re: split operator Hi, Gang, It is about multiquery optimization. In MRCompiler, we will create a map-reduce boundary for split, later in MultiQueryOptimizer, we will merge several split into one map-reduce job. In this map-reduce job, we will nest several s

Re: split operator

2010-07-26 Thread Daniel Dai
74.pdf > > > -Gang > > > > - 原始邮件 > 发件人: Daniel Dai > 收件人: "pig-dev@hadoop.apache.org" > 发送日期: 2010/7/26 (周一) 2:09:25 下午 > 主 题: Re: split operator > > Hi, Gang, > Which part of the paper are you talking about? We don't do in-memor

Re: split operator

2010-07-26 Thread Gang Luo
-Gang - 原始邮件 发件人: Daniel Dai 收件人: "pig-dev@hadoop.apache.org" 发送日期: 2010/7/26 (周一) 2:09:25 下午 主 题: Re: split operator Hi, Gang, Which part of the paper are you talking about? We don't do in-memory split. We dump the split result to a temporary file and start a new

Re: split operator

2010-07-26 Thread Daniel Dai
Hi, Gang, Which part of the paper are you talking about? We don't do in-memory split. We dump the split result to a temporary file and start a new map-reduce job. Split do create a map-reduce boundary (Though it is not entirely true, multiquery optimizer may combine some of these jobs) Daniel