Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hama Wiki" for change 
notification.

The "Partitioning" page has been changed by edwardyoon:
http://wiki.apache.org/hama/Partitioning?action=diff&rev1=5&rev2=6

- == User-defined partitioning ==
+ == Partitioning Function ==
  
- In Hama BSP computing framework, the Partition function is used for obtaining 
scalability of a Bulk Synchronous Parallel processing, and determining how to 
distribute the slices of input data among BSP processors. Unlike MapReduce data 
processing model, many scientific algorithms based on Message-Passing Bulk 
Synchronous Parallel model often requires that a processor obtain “nearby or 
related” data from other processors in order to complete the processing. In 
this case, processors determine their communication partners, or neighbors 
using Partition function.
+ In Hama BSP computing framework, the Partition function is used for obtaining 
scalability of a Bulk Synchronous Parallel processing, and determining how to 
distribute the slices of input data among BSP processors. Unlike Map/Reduce 
data processing model, many scientific algorithms based on Message-Passing Bulk 
Synchronous Parallel model often requires that a processor obtain “nearby or 
related” data from other processors in order to complete the processing. In 
this case, processors determine their communication partners, or neighbors 
using Partition function.
  
  Internally, Input data-partitioning works as following sequence:
  
@@ -10, +10 @@

    * Each task of "partitioning job" reads its assigned data block and rewrite 
them to particular partition files.
   * After prepartitioning done, launch the BSP job.
  
- == Example ==
+ == Create your own Partitioner ==
+ 
+ 
+ == Partitioner Configuration ==
  
  {{{
    BSPJob job = new BSPJob(conf);

Reply via email to