[
https://issues.apache.org/jira/browse/HAMA-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15047998#comment-15047998
]
Edward J. Yoon edited comment on HAMA-970 at 12/9/15 4:17 AM:
--------------------------------------------------------------
Hi,
To launch more tasks than num of splits, you should use input partitioner -
https://github.com/apache/hama/blob/master/core/src/test/java/org/apache/hama/bsp/TestPartitioning.java
example, you should use input partitioner.
For example, if you have a 10MB file and set the number of tasks 10 with
partitioner, the framework automatically partition 10MB file into 10 files and
then launch your main BSP program with 10 tasks.
{quote}
Previously in my Input Paths, I was adding 2 files, one empty file and one 70
MB file. This is working but Hama only opens up 2 tasks, one for empty file
(which becomes the master) and one for 70 MB file (which becomes my only
slave). Now, since I want to divide the 70 MB file into 4-5 tasks if I try to
do this solution, I get an exception.
{quote}
You can do like this: 1) partition one 70MB file into 9 files (manually) and
then launch the BSP program with setNumOfTasks(10);
was (Author: udanax):
Hi,
To launch more tasks than num of splits, you should use input partitioner -
https://github.com/apache/hama/blob/master/core/src/test/java/org/apache/hama/bsp/TestPartitioning.java
example, you should use input partitioner.
For example, if you have a 10MB file and set the number of tasks 10 with
partitioner, the framework automatically partition 10MB file into 10 files and
then launch your main BSP program with 10 tasks.
{qoute}
Previously in my Input Paths, I was adding 2 files, one empty file and one 70
MB file. This is working but Hama only opens up 2 tasks, one for empty file
(which becomes the master) and one for 70 MB file (which becomes my only
slave). Now, since I want to divide the 70 MB file into 4-5 tasks if I try to
do this solution, I get an exception.
{qoute}
You can do like this: 1) partition one 70MB file into 9 files (manually) and
then launch the BSP program with setNumOfTasks(10);
> Exception can occur if the size of splits is bigger than numBSPTasks
> --------------------------------------------------------------------
>
> Key: HAMA-970
> URL: https://issues.apache.org/jira/browse/HAMA-970
> Project: Hama
> Issue Type: Bug
> Components: bsp core
> Affects Versions: 0.7.0
> Reporter: JongYoon Lim
> Priority: Trivial
> Attachments: HAMA-970.patch
>
>
> In JonInProgress, it's possble to get Exception in initTasks().
> {code:java}
> this.tasks = new TaskInProgress[numBSPTasks];
> for (int i = 0; i < splits.length; i++) {
> tasks[i] = new TaskInProgress(getJobID(), this.jobFile.toString(),
> splits[i], this.conf, this, i);
> }
> {code}
> I'm not sure that *numBSPTask* is always bigger than *splits.length*.
> So, I think it's better to use bigger value to assign the *tasks* array.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)