While we are here, controlling the number of regions while creating a new segment, with the cube size options - SMALL, MEDIUM, LARGE - sometimes seems too restrictive. Even with SMALL size cubes, number of reducers are sometimes too low, which adversely impacts the cube build time.
By modifying the cut size to an even smaller value, I was able to bring down the average reduce time of this step, from ~25mins to ~2mins. But I understand that setting very low cut sizes will lead to creation of very small regions, which isn't very desirable either. But too many smaller regions problem for new segments can be handled while merging the segments. So, does it make sense to make the setting of cube size option a bit more flexible ? Through either making it configurable or by providing more options(other than small, medium, large) while creating the cube. Thanks, Bijeet On Wed, Aug 5, 2015 at 8:21 PM, Bijeet Singh <[email protected]> wrote: > The number of reducers in this step, depends on the cube size selected > while creating the cube. You can try with a small size cube. > > But if you are getting only one reducer, even with a small cube, you'd > have to probably tweak the cut size in RangeKeyDistributionReducer to even > smaller values. That will help you increase the number of reducers. > > 2015-08-05 18:13 GMT+05:30 liangmeng <[email protected]>: > >> there is only one reducer, and reduce take too much time, is it possible >> to increase reducenum? >> >> >> >> 梁猛 >> 中国移动广东公司 网管维护中心 网管支撑室 >> 电话:13802880779 >> 邮箱: [email protected] ,[email protected] >> 地址:广东省广州市珠江新城珠江西路11号 广东全球通大厦北3楼 >> 邮编:510623 >> > >
