-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/9301/
-----------------------------------------------------------

Review request for giraph.


Description
-------

When using a lot of workers and a lot of input split threads, checking that all 
input splits are finished after the reading is done takes a long time, since we 
check every input split once per thread.


This addresses bug GIRAPH-498.
    https://issues.apache.org/jira/browse/GIRAPH-498


Diffs
-----

  giraph-core/src/main/java/org/apache/giraph/conf/GiraphConfiguration.java 
796047d 
  giraph-core/src/main/java/org/apache/giraph/worker/BspServiceWorker.java 
f542344 
  
giraph-core/src/main/java/org/apache/giraph/worker/EdgeInputSplitsCallable.java 
7d40dfb 
  
giraph-core/src/main/java/org/apache/giraph/worker/EdgeInputSplitsCallableFactory.java
 1adcd73 
  
giraph-core/src/main/java/org/apache/giraph/worker/InputSplitPathOrganizer.java 
bfaefd2 
  giraph-core/src/main/java/org/apache/giraph/worker/InputSplitsCallable.java 
d09ca2b 
  giraph-core/src/main/java/org/apache/giraph/worker/InputSplitsHandler.java 
PRE-CREATION 
  
giraph-core/src/main/java/org/apache/giraph/worker/VertexInputSplitsCallable.java
 a4f98e1 
  
giraph-core/src/main/java/org/apache/giraph/worker/VertexInputSplitsCallableFactory.java
 0d617dc 
  giraph-core/src/test/java/org/apache/giraph/TestBspBasic.java 987f51c 
  giraph-hbase/.graph.csv.crc PRE-CREATION 
  giraph-hbase/graph.csv PRE-CREATION 

Diff: https://reviews.apache.org/r/9301/diff/


Testing
-------

mvn clean verify

Real application, using 200 workers and 20 input threads:
- trunk - about 560s for input split threads to finish, 720s for input superstep
- with this patch - about 310s for input split threads to finish, 500s for 
input superstep


Thanks,

Maja Kabiljo

Reply via email to