MaoYuan Xian created HAMA-757:
---------------------------------
Summary: The partitioning job output should be un-splitable
Key: HAMA-757
URL: https://issues.apache.org/jira/browse/HAMA-757
Project: Hama
Issue Type: Bug
Components: bsp core
Affects Versions: 0.6.1
Reporter: MaoYuan Xian
When the output sequence files from partitioning job are large(bigger than two
hdfs file block size), the second round of the job (using these sequence file
as input) will start up more tasks than client want. Some times, this
uncertainty make the job exceed the cluster slot capacity.
In the real project, I implemented an new Inputformat which marked as
un-splitable to solve the problem. Is there any better way?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira