GitHub user dlogothetis opened a pull request: https://github.com/apache/giraph/pull/91
Remove the use of atomic counter in LongByteMappingStore I don't see any particular value in this keeping this counter, which incurs high overhead is it is accessed on every entry. There are better ways to update it as values are added, but i don't think it's worth it. Tests - mvn -Phadoop_facebook clean install - mvn -Phadop_2 clean install - Ran jobs that read a mapping, this reduces the time to load mapping by up to 50%. You can merge this pull request into a Git repository by running: $ git pull https://github.com/dlogothetis/giraph mapping_perf Alternatively you can review and apply these changes as the patch at: https://github.com/apache/giraph/pull/91.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #91 ---- commit db149d968790fb42f5a87e45245843c3005b79ae Author: Dionysios Logothetis <dlogothetis@...> Date: 2018-10-24T21:05:55Z Remove atomic counter overhead in LongByteMappingStore commit 497278cdc8aa24a18ccdc3bb701939cbe8de6fb7 Author: Dionysios Logothetis <dlogothetis@...> Date: 2018-10-24T21:55:41Z Get size from the right map ---- ---