Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hama Wiki" for change notification.
The "Partitioning" page has been changed by edwardyoon: http://wiki.apache.org/hama/Partitioning?action=diff&rev1=12&rev2=13 == Partition Function == + + * '''NOTE: if when the number of splits exceeds the maximum number of tasks?'''. In Hama BSP computing framework, the Partition function is used for obtaining scalability of a Bulk Synchronous Parallel processing, and determining how to distribute the slices of input data among BSP processors. Unlike Map/Reduce data processing model, many scientific algorithms based on Message-Passing Bulk Synchronous Parallel model often requires that a processor obtain “nearby or related” data from other processors in order to complete the computation. In this case, you can create your own Partition function for determining processor inter-communication and how to distribute the data. @@ -27, +29 @@ === Specify the partition files and directories === - - If the input is already partitioned, you can skip pre-partitioning step as following configuration: {{{
