[
https://issues.apache.org/jira/browse/HBASE-10074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13838398#comment-13838398
]
Nick Dimiduk commented on HBASE-10074:
--------------------------------------
I don't read docbook, so I cannot comment about the markup. However, the
content is great! Here are some nits.
bq. <para>The master as is is allergic to tons of regions
Mind adding some JIRA references here?
bq. tons of regions on a few RS can cause the store file index to rise raising
heap usage and...
"store file index to rise rising" ? This sentence is confusing me.
bq. Keeping 5 regions per RS would be too low for a job, whereas 1000 will
generate too many maps.
How about "Hosting only 5 regions per RS will not be enough task splits for a
mapreduce job, while 1000 regions will generate far too many map tasks."
In section {{<section xml:id="ops.capacity.regions"><title>Determining region
count and size</title>}} you suggest "20-200 regions per RS" but previously you
said "20-100".
+1
> consolidate and improve capacity/sizing documentation
> -----------------------------------------------------
>
> Key: HBASE-10074
> URL: https://issues.apache.org/jira/browse/HBASE-10074
> Project: HBase
> Issue Type: Improvement
> Components: documentation
> Reporter: Sergey Shelukhin
> Assignee: Sergey Shelukhin
> Attachments: HBASE-10074.patch
>
>
> Region count description is in config section; region size description is in
> architecture sections; both of these have a lot of good technical details,
> but imho we could do better in terms of admin-centric advice.
> Currently, there's a nearly-empty capacity section; I'd like to rewrite it to
> consolidate capacity planning/sizing/region sizing information, and some
> basic configuration pertaining to it.
--
This message was sent by Atlassian JIRA
(v6.1#6144)