Will Berkeley has posted comments on this change. ( http://gerrit.cloudera.org:8080/12119 )
Change subject: [blog] a blogpost about location awareness in Kudu ...................................................................... Patch Set 1: (46 comments) Gerrit gets funny when there's a lot of comments on one file, it seems, so I'm posting this first batch and I will continue reviewing later. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md File _posts/2018-12-20-location-awareness.md: http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@7 PS1, Line 7: a high-level overview An overview is always high-level, so remove "high-level". http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@9 PS1, Line 9: principles behind the design decisions principles of the design http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@11 PS1, Line 11: ways how can be extended and enhanced in the future potential future enhancements and extensions http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@17 PS1, Line 17: the Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@17 PS1, Line 17: to fit for http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@20 PS1, Line 20: data replicas http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@20 PS1, Line 20: Having servers in multiple racks in a Kudu cluster In a Kudu cluster consisting of multiple servers spread over several racks http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@21 PS1, Line 21: 's data Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@21 PS1, Line 21: replicated Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@21 PS1, Line 21: in http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@22 PS1, Line 22: becomes become http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@21 PS1, Line 21: for : reading and writing Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@22 PS1, Line 22: if any subset of servers even if all the servers http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@25 PS1, Line 25: In other words, a single rack failure should not lead to data unavailability : in a Kudu cluster. Remove (redundant). http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@28 PS1, Line 28: In that sense, a rack is a fault domain (usually, a fault domain : defined is a set of hardware components that share a single point of failure). I don't think we need to use this concept very much so I wouldn't include it. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@31 PS1, Line 31: To generalize, replace 'rack' with any other physical aggregation of nodes Join this paragraph with the previous paragraph. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@31 PS1, Line 31: To generalize nit: I prefer "More generally", but "To generalize" is ok. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@31 PS1, Line 31: physical Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@32 PS1, Line 32: chassis Is a chassis a common term for something multiple servers would run on? To me it sounds synonymous with a single physical host, unless the cluster is running in VMs and some VMs share a physical host. If that's the case, I think we ought to say "shared VM host" or some such instead of "chassis". http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@32 PS1, Line 32: site Does a site differ from a datacenter? http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@33 PS1, Line 33: It might be applicable even to a datacenter : given that the network latency between datacenters is negligible : (which, frankly, isn't quite realistic). Rephrase this more positively: "This even applies to a datacenter if the network latency between datacenters is low." http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@37 PS1, Line 37: At this point, the location awareness feature is a work in progress and may be : released with Kudu 1.9.0. I'm not sure what our publishing schedule is, but I think we ought not publish this until the feature is sure to be released with the next release and the next release is coming in the next month or two. In that case, we shouldn't include this sentence, and we should mention the release the feature will first appear in above the fold. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@49 PS1, Line 49: exploiting to exploit http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@49 PS1, Line 49: for to establish http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@50 PS1, Line 50: other components of the Hadoop ecosystem (e.g., HDFS) I think it's just HDFS. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@57 PS1, Line 57: location a location http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@57 PS1, Line 57: leader master Actually all masters assign a location, but it's the leader master's location that is used in placement decisions, etc. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@57 PS1, Line 57: for to http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@57 PS1, Line 57: Kudu The http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@57 PS1, Line 57: upon its registration when it registers http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@58 PS1, Line 58: being registered Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@58 PS1, Line 58: location the location http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@59 PS1, Line 59: the http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@60 PS1, Line 60: for the specified IP address/hostname Remove. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@60 PS1, Line 60: a the corresponding http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@62 PS1, Line 62: next registration of the tablet server the tablet server re-registers, which only occurs if the master or tablet server restarts. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@62 PS1, Line 62: The assigned : location is not reported back to the tablet server since Kudu tablet servers : are location-agnostic as of now nit: I would rephrase as "Kudu tablet servers are location agnostic, at least for now, so the assigned location is not reported back to the tablet server. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@64 PS1, Line 64: : masters . Masters http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@65 PS1, Line 65: internally Explain how, briefly (maybe by just pointing the reader to the section below). http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@65 PS1, Line 65: provide connected clients with that information as well Ditto. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@67 PS1, Line 67: Kudu leader master assigns location for a client upon connecting to : a cluster (i.e. when the client invokes the `ConnectToMaster` RPC). : The assigned location is sent back to the client along with other : information in response to `ConnectToMaster` RPC. I think this is too detailed, but if you want to include it mention the `ConnectToMaster` RPCs above when you first introduce how tablet servers are assigned location when they register. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@73 PS1, Line 73: `--location_mapping_cmd` master's Swap these two words. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@76 PS1, Line 76: Does this paragraph belong in the next section? It repeats part of it. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@84 PS1, Line 84: 3 (three) My preference is to use the word "three", not the symbol "3", but either is OK. Putting both is unnecessary, I think. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@97 PS1, Line 97: By design, keeping the target replication factor for tablets has higher I'd first restate that re-replication and the placement of new replicas attempts to spread replicas across locations so that the failure of tablet servers in one location does not make tablets unavailable. http://gerrit.cloudera.org:8080/#/c/12119/1/_posts/2018-12-20-location-awareness.md@113 PS1, Line 113: is are -- To view, visit http://gerrit.cloudera.org:8080/12119 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: kudu Gerrit-Branch: gh-pages Gerrit-MessageType: comment Gerrit-Change-Id: I10b30a80d5661fb889a11285b8118cdea1a87cd2 Gerrit-Change-Number: 12119 Gerrit-PatchSet: 1 Gerrit-Owner: Alexey Serbin <aser...@cloudera.com> Gerrit-Reviewer: Will Berkeley <wdberke...@gmail.com> Gerrit-Comment-Date: Wed, 26 Dec 2018 16:49:48 +0000 Gerrit-HasComments: Yes