petersomogyi commented on code in PR #7155: URL: https://github.com/apache/hbase/pull/7155#discussion_r2215295859
########## src/main/asciidoc/_chapters/architecture.adoc: ########## @@ -781,15 +781,15 @@ You can supplement a scan (both bounded and unbounded) with RowFilter constructe ==== FirstKeyOnlyFilter This is primarily used for rowcount jobs. -See link:https://hbase.apache.org/apidocs/org/apache/hadoop/hbase/filter/FirstKeyOnlyFilter.html[FirstKeyOnlyFilter]. +See link:https://hbase.apache.org/devapidocs/org/apache/hadoop/hbase/filter/FirstKeyOnlyFilter.html[FirstKeyOnlyFilter]. [[architecture.master]] == Master `HMaster` is the implementation of the Master Server. The Master server is responsible for monitoring all RegionServer instances in the cluster, and is the interface for all metadata changes. In a distributed cluster, the Master typically runs on the <<arch.hdfs.nn>>. -J Mohamed Zahoor goes into some more detail on the Master Architecture in this blog posting, link:http://blog.zahoor.in/2012/08/hbase-hmaster-architecture/[HBase HMaster Architecture ]. +J Mohamed Zahoor goes into some more detail on the Master Architecture in this blog posting, link:http://web.archive.org/web/20191211053128/http://blog.zahoor.in:80/2012/08/hbase-hmaster-architecture/[HBase HMaster Architecture ]. Review Comment: ```suggestion J Mohamed Zahoor goes into some more detail on the Master Architecture in this blog posting, link:https://web.archive.org/web/20191211053128/http://blog.zahoor.in/2012/08/hbase-hmaster-architecture/[HBase HMaster Architecture]. ``` ########## src/main/asciidoc/_chapters/architecture.adoc: ########## @@ -1129,7 +1129,7 @@ If the BucketCache is deployed in off-heap mode, this memory is not managed by t This is why you'd use BucketCache in pre-2.0.0, so your latencies are less erratic, to mitigate GCs and heap fragmentation, and so you can safely use more memory. See Nick Dimiduk's link:http://www.n10k.com/blog/blockcache-101/[BlockCache 101] for comparisons running on-heap vs off-heap tests. -Also see link:https://people.apache.org/~stack/bc/[Comparing BlockCache Deploys] which finds that if your dataset fits inside your LruBlockCache deploy, use it otherwise if you are experiencing cache churn (or you want your cache to exist beyond the vagaries of java GC), use BucketCache. +Also see link:http://web.archive.org/web/20231109025243/http://people.apache.org/~stack/bc/[Comparing BlockCache Deploys] which finds that if your dataset fits inside your LruBlockCache deploy, use it otherwise if you are experiencing cache churn (or you want your cache to exist beyond the vagaries of java GC), use BucketCache. Review Comment: ```suggestion Also see link:https://web.archive.org/web/20231109025243/http://people.apache.org/~stack/bc/[Comparing BlockCache Deploys] which finds that if your dataset fits inside your LruBlockCache deploy, use it otherwise if you are experiencing cache churn (or you want your cache to exist beyond the vagaries of java GC), use BucketCache. ``` ########## src/main/asciidoc/_chapters/architecture.adoc: ########## @@ -1239,7 +1239,7 @@ The value allocated by `MaxDirectMemorySize` must not exceed physical RAM, and i You can see how much memory -- on-heap and off-heap/direct -- a RegionServer is configured to use and how much it is using at any one time by looking at the _Server Metrics: Memory_ tab in the UI. It can also be gotten via JMX. In particular the direct memory currently used by the server can be found on the `java.nio.type=BufferPool,name=direct` bean. -Terracotta has a link:http://terracotta.org/documentation/4.0/bigmemorygo/configuration/storage-options[good write up] on using off-heap memory in Java. +Terracotta has a link:https://web.archive.org/web/20170907032911/http://terracotta.org:80/documentation/4.0/bigmemorygo/configuration/storage-options[good write up] on using off-heap memory in Java. Review Comment: ```suggestion Terracotta has a link:https://web.archive.org/web/20170907032911/http://terracotta.org/documentation/4.0/bigmemorygo/configuration/storage-options[good write up] on using off-heap memory in Java. ``` ########## src/main/asciidoc/_chapters/configuration.adoc: ########## @@ -1264,7 +1264,7 @@ idea on the size you need by surveying RegionServer UIs; you'll see index block the top of the webpage). [[nagles]] -==== link:http://en.wikipedia.org/wiki/Nagle's_algorithm[Nagle's] or the small package problem +==== link:http://en.wikipedia.org/wiki/Nagle%27s_algorithm[Nagle's] or the small package problem Review Comment: ```suggestion ==== link:https://en.wikipedia.org/wiki/Nagle%27s_algorithm[Nagle's] or the small package problem ``` ########## src/main/asciidoc/_chapters/unit_testing.adoc: ########## @@ -388,4 +388,4 @@ A record is inserted, a Get is performed from the same table, and the insertion NOTE: Starting the mini-cluster takes about 20-30 seconds, but that should be appropriate for integration testing. -See the paper at link:http://blog.sematext.com/2010/08/30/hbase-case-study-using-hbasetestingutility-for-local-testing-development/[HBase Case-Study: Using HBaseTestingUtility for Local Testing and Development] (2010) for more information about HBaseTestingUtility. +See the paper at link:https://web.archive.org/web/20160214165147/http://blog.sematext.com:80/2010/08/30/hbase-case-study-using-hbasetestingutility-for-local-testing-development/[HBase Case-Study: Using HBaseTestingUtility for Local Testing and Development] (2010) for more information about HBaseTestingUtility. Review Comment: ```suggestion See the paper at link:https://web.archive.org/web/20160214165147/http://blog.sematext.com/2010/08/30/hbase-case-study-using-hbasetestingutility-for-local-testing-development/[HBase Case-Study: Using HBaseTestingUtility for Local Testing and Development] (2010) for more information about HBaseTestingUtility. ``` ########## src/main/asciidoc/_chapters/case_studies.adoc: ########## @@ -160,7 +160,7 @@ Investigation results of a self-described "we're not sure what's wrong, but it s === Case Study #3 (Performance Research 2010)) Investigation results of general cluster performance from 2010. -Although this research is on an older version of the codebase, this writeup is still very useful in terms of approach. http://hstack.org/hbase-performance-testing/ +Although this research is on an older version of the codebase, this writeup is still very useful in terms of approach. https://web.archive.org/web/20180503124332/http://hstack.org:80/hbase-performance-testing/ Review Comment: ```suggestion Although this research is on an older version of the codebase, this writeup is still very useful in terms of approach. https://web.archive.org/web/20180503124332/http://hstack.org/hbase-performance-testing/ ``` ########## src/main/asciidoc/_chapters/configuration.adoc: ########## @@ -593,7 +593,7 @@ Pseudo-distributed mode can run against the local filesystem or it can run again the _Hadoop Distributed File System_ (HDFS). Fully-distributed mode can ONLY run on HDFS. See the Hadoop link:https://hadoop.apache.org/docs/current/[documentation] for how to set up HDFS. A good walk-through for setting up HDFS on Hadoop 2 can be found at -http://www.alexjf.net/blog/distributed-systems/hadoop-yarn-installation-definitive-guide. +http://web.archive.org/web/20221007121526/https://www.alexjf.net/blog/distributed-systems/hadoop-yarn-installation-definitive-guide/. Review Comment: ```suggestion https://web.archive.org/web/20221007121526/https://www.alexjf.net/blog/distributed-systems/hadoop-yarn-installation-definitive-guide/. ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
