-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/9138/
-----------------------------------------------------------

Review request for giraph.


Description
-------

Adds status updates to saving of the vertices and improve the overall logic of 
when to print status of loading/storing the graph (every 250k vertices or 15 
secs). This will help us to see which are the slow workers when saving output. 
This updates the Hadoop status messages as well as prints to the task log. I 
also made this consistent for the loading.

Task log messages look like the following:

INFO    2013-01-29 12:51:46,044 [main] 
org.apache.giraph.worker.BspServiceWorker  - saveVertices: Saved 98751 out of 
1000000 vertices, on partition 2 out of 24
INFO    2013-01-29 12:52:25,539 [main] 
org.apache.giraph.worker.BspServiceWorker  - saveVertices: Saved 348752 out of 
1000000 vertices, on partition 8 out of 24
INFO    2013-01-29 12:53:28,062 [main] 
org.apache.giraph.worker.BspServiceWorker  - saveVertices: Saved 598753 out of 
1000000 vertices, on partition 14 out of 24

I added an option for dumping output to PageRankBenchmark to test this as well.


This addresses bug GIRAPH-492.
    https://issues.apache.org/jira/browse/GIRAPH-492


Diffs
-----

  giraph-core/src/main/java/org/apache/giraph/benchmark/PageRankBenchmark.java 
3ef471a711183dd147990c5f6bb07485a58f5a71 
  giraph-core/src/main/java/org/apache/giraph/bsp/CentralizedService.java 
83fba57af3f11f6c412ec59e2d121e57ea280d98 
  giraph-core/src/main/java/org/apache/giraph/bsp/CentralizedServiceMaster.java 
399dc72896673fbf75d2b7b933c4ea02a08f25ea 
  giraph-core/src/main/java/org/apache/giraph/bsp/CentralizedServiceWorker.java 
294c2c71017b5273240c9dce1f3de2be65aed289 
  giraph-core/src/main/java/org/apache/giraph/graph/FinishedSuperstepStats.java 
d888d1038026c83b5f82956b6872eb64a44dd700 
  giraph-core/src/main/java/org/apache/giraph/graph/GraphTaskManager.java 
401e07bb346e8ac43600992718f1f231db85aa7c 
  giraph-core/src/main/java/org/apache/giraph/master/BspServiceMaster.java 
7ad290244946b89c6a563a86536d8e898e1c0aec 
  giraph-core/src/main/java/org/apache/giraph/utils/LoggerUtils.java 
81dfd1d8d9b27442543db1259235fb1825f72f7e 
  giraph-core/src/main/java/org/apache/giraph/worker/BspServiceWorker.java 
d5ad62b39517f06884ef1c003c5e91348cbe2459 
  
giraph-core/src/main/java/org/apache/giraph/worker/VertexInputSplitsCallable.java
 7522027b7306aff17668e17f4895a39e23e8a590 

Diff: https://reviews.apache.org/r/9138/diff/


Testing
-------

Passsed unittests and tested on a real cluster with PageRankBenchmark.


Thanks,

Avery Ching

Reply via email to