[ https://issues.apache.org/jira/browse/PIG-657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Olga Natkovich resolved PIG-657. -------------------------------- Resolution: Fixed This is resolved with Pig 0.7 > splitsize is ignored in PigInputFormat > -------------------------------------- > > Key: PIG-657 > URL: https://issues.apache.org/jira/browse/PIG-657 > Project: Pig > Issue Type: Bug > Reporter: Laukik Chitnis > > The way to control the number of mappers in Hadoop has been to specify a > mapred.min.split.size parameter in the job conf. For eg. > mapred.min.split.size=1073741824,mapred.map.tasks=10 > However, even if this parameter is specified, Pig creates the number of > mappers depending only on the number of blocks in the file. This is because > the parameter is not used in the PigInputFormat. > The parameter can actually be extracted from the job conf object. So, one way > of doing this would be to pass an handle to the job conf object to the > PigInputFormat or the custom slicer. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.