[jira] [Commented] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
[ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050705#comment-17050705 ] Akshay Nehe commented on HADOOP-16899: -- Thank you so much [~aajisaka] for guiding me through the process. > Update HdfsDesign.md to reduce ambiguity > > > Key: HADOOP-16899 > URL: https://issues.apache.org/jira/browse/HADOOP-16899 > Project: Hadoop Common > Issue Type: Improvement > Components: documentation >Reporter: Akshay Nehe >Assignee: Akshay Nehe >Priority: Minor > Fix For: 3.3.0, 3.2.2 > > > A proposed update to > [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] > in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd > last line. > The sentence is leading to ambiguity of reader. > Considering the statement segmented in 3 parts by the commas: > # the first part talks about "one thirds of replicas"; > # the second part talks about "two thirds of replicas" > # the third part talking about "the other third" is leading to ambiguity > when one thirds and two thirds have already accounted for the whole. > Proposed solution: > Getting rid of the third part or rephrasing entire sentence to capture the > overall essence of the sentence. > In other words, replacing > _One third of replicas are on one node, two thirds of replicas are on one > rack, and the other third are evenly distributed across the remaining racks._ > with > -_One third of replicas are on one node, two thirds of replicas are on one > rack._- > {quote}{{Two replicas are on different nodes of one rack and the remaining > replica is on a node of one of the other racks.}} > {quote} > In addition to this, found 2 more sentences in same paragraph that will be > corrected. > 1. > {quote}However, it does reduce the aggregate network bandwidth used when > reading data since a block is placed in only two unique racks rather than > three. > {quote} > will replace: does reduce -> does not reduce > 2. > {quote}With this policy, the replicas of a file do not evenly distribute > across the racks. > {quote} > will replace: replicas of a file -> replicas of a block > Please suggest if any additional meaning is getting lost with this > replacement. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Commented] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
[ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050372#comment-17050372 ] Akshay Nehe commented on HADOOP-16899: -- Updated details (in Description) on 3 sentences that will be updated. > Update HdfsDesign.md to reduce ambiguity > > > Key: HADOOP-16899 > URL: https://issues.apache.org/jira/browse/HADOOP-16899 > Project: Hadoop Common > Issue Type: Improvement > Components: documentation >Reporter: Akshay Nehe >Priority: Minor > > A proposed update to > [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] > in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd > last line. > The sentence is leading to ambiguity of reader. > Considering the statement segmented in 3 parts by the commas: > # the first part talks about "one thirds of replicas"; > # the second part talks about "two thirds of replicas" > # the third part talking about "the other third" is leading to ambiguity > when one thirds and two thirds have already accounted for the whole. > Proposed solution: > Getting rid of the third part or rephrasing entire sentence to capture the > overall essence of the sentence. > In other words, replacing > _One third of replicas are on one node, two thirds of replicas are on one > rack, and the other third are evenly distributed across the remaining racks._ > with > -_One third of replicas are on one node, two thirds of replicas are on one > rack._- > {quote}{{Two replicas are on different nodes of one rack and the remaining > replica is on a node of one of the other racks.}} > {quote} > In addition to this, found 2 more sentences in same paragraph that will be > corrected. > 1. > {quote}However, it does reduce the aggregate network bandwidth used when > reading data since a block is placed in only two unique racks rather than > three. > {quote} > will replace: does reduce -> does not reduce > 2. > {quote}With this policy, the replicas of a file do not evenly distribute > across the racks. > {quote} > will replace: replicas of a file -> replicas of a block > Please suggest if any additional meaning is getting lost with this > replacement. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
[ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akshay Nehe updated HADOOP-16899: - Description: A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Considering the statement segmented in 3 parts by the commas: # the first part talks about "one thirds of replicas"; # the second part talks about "two thirds of replicas" # the third part talking about "the other third" is leading to ambiguity when one thirds and two thirds have already accounted for the whole. Proposed solution: Getting rid of the third part or rephrasing entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with -_One third of replicas are on one node, two thirds of replicas are on one rack._- {quote}{{Two replicas are on different nodes of one rack and the remaining replica is on a node of one of the other racks.}} {quote} In addition to this, found 2 more sentences in same paragraph that will be corrected. 1. {quote}However, it does reduce the aggregate network bandwidth used when reading data since a block is placed in only two unique racks rather than three. {quote} will replace: does reduce -> does not reduce 2. {quote}With this policy, the replicas of a file do not evenly distribute across the racks. {quote} will replace: replicas of a file -> replicas of a block Please suggest if any additional meaning is getting lost with this replacement. was: A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Considering the statement segmented in 3 parts by the commas: # the first part talks about "one thirds of replicas"; # the second part talks about "two thirds of replicas" # the third part talking about "the other third" is leading to ambiguity when one thirds and two thirds have already accounted for the whole. Proposed solution: Getting rid of the third part or rephrasing entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with -_One third of replicas are on one node, two thirds of replicas are on one rack._- {quote}{{Two replicas are on different nodes of one rack and the remaining replica is on a node of one of the other racks.}} {quote} In addition to this, found 2 more sentences in same paragraph that can be corrected. 1. {quote}However, it does reduce the aggregate network bandwidth used when reading data since a block is placed in only two unique racks rather than three. {quote} will replace: does reduce -> does not reduce 2. {quote}With this policy, the replicas of a file do not evenly distribute across the racks. {quote} will replace: replicas of a file -> replicas of a block Please suggest if any additional meaning is getting lost with this replacement. > Update HdfsDesign.md to reduce ambiguity > > > Key: HADOOP-16899 > URL: https://issues.apache.org/jira/browse/HADOOP-16899 > Project: Hadoop Common > Issue Type: Improvement > Components: documentation >Reporter: Akshay Nehe >Priority: Minor > > A proposed update to > [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] > in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd > last line. > The sentence is leading to ambiguity of reader. > Considering the statement segmented in 3 parts by the commas: > # the first part talks about "one thirds of replicas"; > # the second part talks about "two thirds of replicas" > # the third part talking about "the other third" is leading to ambiguity > when one thirds and two thirds have already accounted for the whole. > Proposed solution: > Getting rid of the third part or rephrasing entire sentence to capture the > overall essence of the sentence. > In other words, replacing > _One third of replicas are on one node, two thirds of replicas are on one > rack, and the other third are evenly distributed across the remaining racks._ > with > -_One third of replicas are on one node, two thirds of replicas are
[jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
[ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akshay Nehe updated HADOOP-16899: - Description: A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Considering the statement segmented in 3 parts by the commas: # the first part talks about "one thirds of replicas"; # the second part talks about "two thirds of replicas" # the third part talking about "the other third" is leading to ambiguity when one thirds and two thirds have already accounted for the whole. Proposed solution: Getting rid of the third part or rephrasing entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with -_One third of replicas are on one node, two thirds of replicas are on one rack._- {quote}{{Two replicas are on different nodes of one rack and the remaining replica is on a node of one of the other racks.}} {quote} In addition to this, found 2 more sentences in same paragraph that can be corrected. 1. {quote}However, it does reduce the aggregate network bandwidth used when reading data since a block is placed in only two unique racks rather than three. {quote} will replace: does reduce -> does not reduce 2. {quote}With this policy, the replicas of a file do not evenly distribute across the racks. {quote} will replace: replicas of a file -> replicas of a block Please suggest if any additional meaning is getting lost with this replacement. was: A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Considering the statement segmented in 3 parts by the commas: # the first part talks about "one thirds of replicas"; # the second part talks about "two thirds of replicas" # the third part talking about "the other third" is leading to ambiguity when one thirds and two thirds have already accounted for the whole. Proposed solution: Getting rid of the third part or rephrasing entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with _One third of replicas are on one node, two thirds of replicas are on one rack._ Please suggest if any additional meaning is getting lost with this replacement. > Update HdfsDesign.md to reduce ambiguity > > > Key: HADOOP-16899 > URL: https://issues.apache.org/jira/browse/HADOOP-16899 > Project: Hadoop Common > Issue Type: Improvement > Components: documentation >Reporter: Akshay Nehe >Priority: Minor > > A proposed update to > [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] > in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd > last line. > The sentence is leading to ambiguity of reader. > Considering the statement segmented in 3 parts by the commas: > # the first part talks about "one thirds of replicas"; > # the second part talks about "two thirds of replicas" > # the third part talking about "the other third" is leading to ambiguity > when one thirds and two thirds have already accounted for the whole. > Proposed solution: > Getting rid of the third part or rephrasing entire sentence to capture the > overall essence of the sentence. > In other words, replacing > _One third of replicas are on one node, two thirds of replicas are on one > rack, and the other third are evenly distributed across the remaining racks._ > with > -_One third of replicas are on one node, two thirds of replicas are on one > rack._- > {quote}{{Two replicas are on different nodes of one rack and the remaining > replica is on a node of one of the other racks.}} > {quote} > In addition to this, found 2 more sentences in same paragraph that can be > corrected. > 1. > {quote}However, it does reduce the aggregate network bandwidth used when > reading data since a block is placed in only two unique racks rather than > three. > {quote} > will replace: does reduce -> does not reduce > 2. > {quote}With this policy, the replicas of a file do not evenly distribute > across the racks. > {quote} > will
[jira] [Commented] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
[ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17049868#comment-17049868 ] Akshay Nehe commented on HADOOP-16899: -- Based on comment by [~aajisaka] on the PR, considering replacing the line with: _Two replicas are on one rack, and the remaining replica is on one of the other racks._ instead of previously indicated update. An additional recommended update will be included for the sentence 2 sentences before the previously mentioned. Quoting Akira: The following sentence is not directly related to your PR, however, it can be fixed at the same time. {quote}However, it does reduce the aggregate network bandwidth used when reading data since a block is placed in only two unique racks rather than three. With this policy, the replicas of a file do not evenly distribute across the racks. {quote} * it does reduce -> it does not reduce If a block is placed in three unique racks, the probability of rack-local read will increase and the network bandwidth will be reduced when reading the data. Therefore I think 'does' should be changed to 'does not'. Once decided to make both changes, will update the Jira Description. > Update HdfsDesign.md to reduce ambiguity > > > Key: HADOOP-16899 > URL: https://issues.apache.org/jira/browse/HADOOP-16899 > Project: Hadoop Common > Issue Type: Improvement > Components: documentation >Reporter: Akshay Nehe >Priority: Minor > > A proposed update to > [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] > in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd > last line. > The sentence is leading to ambiguity of reader. > Considering the statement segmented in 3 parts by the commas: > # the first part talks about "one thirds of replicas"; > # the second part talks about "two thirds of replicas" > # the third part talking about "the other third" is leading to ambiguity > when one thirds and two thirds have already accounted for the whole. > Proposed solution: > Getting rid of the third part or rephrasing entire sentence to capture the > overall essence of the sentence. > In other words, replacing > _One third of replicas are on one node, two thirds of replicas are on one > rack, and the other third are evenly distributed across the remaining racks._ > with > _One third of replicas are on one node, two thirds of replicas are on one > rack._ > Please suggest if any additional meaning is getting lost with this > replacement. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
[ https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akshay Nehe updated HADOOP-16899: - Description: A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Considering the statement segmented in 3 parts by the commas: # the first part talks about "one thirds of replicas"; # the second part talks about "two thirds of replicas" # the third part talking about "the other third" is leading to ambiguity when one thirds and two thirds have already accounted for the whole. Proposed solution: Getting rid of the third part or rephrasing entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with _One third of replicas are on one node, two thirds of replicas are on one rack._ Please suggest if any additional meaning is getting lost with this replacement. was: A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Proposed solution: Getting rid of the third part of sentence (after 2nd comma) or rephrase entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with _One third of replicas are on one node, two thirds of replicas are on one rack._ Please suggest if any additional meaning is getting lost with this replacement. > Update HdfsDesign.md to reduce ambiguity > > > Key: HADOOP-16899 > URL: https://issues.apache.org/jira/browse/HADOOP-16899 > Project: Hadoop Common > Issue Type: Improvement > Components: documentation >Reporter: Akshay Nehe >Priority: Minor > > A proposed update to > [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/] > in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd > last line. > The sentence is leading to ambiguity of reader. > Considering the statement segmented in 3 parts by the commas: > # the first part talks about "one thirds of replicas"; > # the second part talks about "two thirds of replicas" > # the third part talking about "the other third" is leading to ambiguity > when one thirds and two thirds have already accounted for the whole. > Proposed solution: > Getting rid of the third part or rephrasing entire sentence to capture the > overall essence of the sentence. > In other words, replacing > _One third of replicas are on one node, two thirds of replicas are on one > rack, and the other third are evenly distributed across the remaining racks._ > with > _One third of replicas are on one node, two thirds of replicas are on one > rack._ > Please suggest if any additional meaning is getting lost with this > replacement. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org
[jira] [Created] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity
Akshay Nehe created HADOOP-16899: Summary: Update HdfsDesign.md to reduce ambiguity Key: HADOOP-16899 URL: https://issues.apache.org/jira/browse/HADOOP-16899 Project: Hadoop Common Issue Type: Improvement Components: documentation Reporter: Akshay Nehe A proposed update to [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com] in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd last line. The sentence is leading to ambiguity of reader. Proposed solution: Getting rid of the third part of sentence (after 2nd comma) or rephrase entire sentence to capture the overall essence of the sentence. In other words, replacing _One third of replicas are on one node, two thirds of replicas are on one rack, and the other third are evenly distributed across the remaining racks._ with _One third of replicas are on one node, two thirds of replicas are on one rack._ Please suggest if any additional meaning is getting lost with this replacement. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org