keith-turner commented on a change in pull request #46: ACCUMULO-4752 Create documentation on improving performance URL: https://github.com/apache/accumulo-website/pull/46#discussion_r155314670
########## File path: _docs-2-0/troubleshooting/peformance.md ########## @@ -0,0 +1,39 @@ +--- +title: Performance +category: troubleshooting +order: 5 +--- + +Accumulo can be tuned to improve read and write performance. + +## Read performance + +1. Enable [caching] on tables to reduce reads to disk. + +1. Enable [bloom filters][bloom-filters] on tables to limit the number of disk lookups. + +1. Decrease the [major compaction ratio][compaction] of a table to decrease the number of + files per tablet. Less files reduces the latency of reads. + Review comment: Adjusting `table.file.compress.blocksize` and `table.file.compress.blocksize.index` can also be helpful. Lowering `table.file.compress.blocksize` can result in better random seek performance. However it increases the index size in the file. If the indexes are too large to fit in cache, this can hinder performance. Also, as the index size increases the depth of the index tree in each file may increase. Increasing `table.file.compress.blocksize.index` can reduce the depth of the tree. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
