keith-turner commented on a change in pull request #46: ACCUMULO-4752 Create documentation on improving performance URL: https://github.com/apache/accumulo-website/pull/46#discussion_r155314922
########## File path: _docs-2-0/troubleshooting/peformance.md ########## @@ -0,0 +1,39 @@ +--- +title: Performance +category: troubleshooting +order: 5 +--- + +Accumulo can be tuned to improve read and write performance. + +## Read performance + +1. Enable [caching] on tables to reduce reads to disk. + +1. Enable [bloom filters][bloom-filters] on tables to limit the number of disk lookups. + +1. Decrease the [major compaction ratio][compaction] of a table to decrease the number of + files per tablet. Less files reduces the latency of reads. + +## Write performance + +1. Enable [native maps][native-maps] on tablet servers to prevent Java garbage collection pauses + which can slow ingest. + +1. [Pre-split new tables][split] to distribute writes across multiple tablet servers. + +1. Ingest data using [multiple clients][multi-client] or [bulk ingest][bulk] to increase ingest throughput. + +1. Increase the [major compaction ratio][compaction] of a table to limit the number of major compactions + which improves ingest peformance. + Review comment: Setting `table.file.compress.type=snappy` can increase write spead at the expense of using more disk space. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
