[jira] [Commented] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-03 Thread Akshay Nehe (Jira)


[ 
https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050705#comment-17050705
 ] 

Akshay Nehe commented on HADOOP-16899:
--

Thank you so much [~aajisaka] for guiding me through the process.

> Update HdfsDesign.md to reduce ambiguity
> 
>
> Key: HADOOP-16899
> URL: https://issues.apache.org/jira/browse/HADOOP-16899
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: documentation
>Reporter: Akshay Nehe
>Assignee: Akshay Nehe
>Priority: Minor
> Fix For: 3.3.0, 3.2.2
>
>
> A proposed update to 
> [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
>  in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
> last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity 
> when one thirds and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the 
> overall essence of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack, and the other third are evenly distributed across the remaining racks._
> with
> -_One third of replicas are on one node, two thirds of replicas are on one 
> rack._-
> {quote}{{Two replicas are on different nodes of one rack and the remaining 
> replica is on a node of one of the other racks.}}
> {quote}
> In addition to this, found 2 more sentences in same paragraph that will be 
> corrected.
> 1.
> {quote}However, it does reduce the aggregate network bandwidth used when 
> reading data since a block is placed in only two unique racks rather than 
> three.
> {quote}
> will replace: does reduce -> does not reduce
> 2. 
> {quote}With this policy, the replicas of a file do not evenly distribute 
> across the racks.
> {quote}
> will replace: replicas of a file -> replicas of a block 
> Please suggest if any additional meaning is getting lost with this 
> replacement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org



[jira] [Commented] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-03 Thread Akshay Nehe (Jira)


[ 
https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050372#comment-17050372
 ] 

Akshay Nehe commented on HADOOP-16899:
--

Updated details (in Description) on 3 sentences that will be updated.

> Update HdfsDesign.md to reduce ambiguity
> 
>
> Key: HADOOP-16899
> URL: https://issues.apache.org/jira/browse/HADOOP-16899
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: documentation
>Reporter: Akshay Nehe
>Priority: Minor
>
> A proposed update to 
> [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
>  in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
> last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity 
> when one thirds and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the 
> overall essence of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack, and the other third are evenly distributed across the remaining racks._
> with
> -_One third of replicas are on one node, two thirds of replicas are on one 
> rack._-
> {quote}{{Two replicas are on different nodes of one rack and the remaining 
> replica is on a node of one of the other racks.}}
> {quote}
> In addition to this, found 2 more sentences in same paragraph that will be 
> corrected.
> 1.
> {quote}However, it does reduce the aggregate network bandwidth used when 
> reading data since a block is placed in only two unique racks rather than 
> three.
> {quote}
> will replace: does reduce -> does not reduce
> 2. 
> {quote}With this policy, the replicas of a file do not evenly distribute 
> across the racks.
> {quote}
> will replace: replicas of a file -> replicas of a block 
> Please suggest if any additional meaning is getting lost with this 
> replacement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org



[jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-03 Thread Akshay Nehe (Jira)


 [ 
https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akshay Nehe updated HADOOP-16899:
-
Description: 
A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when 
one thirds and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the 
overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

-_One third of replicas are on one node, two thirds of replicas are on one 
rack._-
{quote}{{Two replicas are on different nodes of one rack and the remaining 
replica is on a node of one of the other racks.}}
{quote}
In addition to this, found 2 more sentences in same paragraph that will be 
corrected.

1.
{quote}However, it does reduce the aggregate network bandwidth used when 
reading data since a block is placed in only two unique racks rather than three.
{quote}
will replace: does reduce -> does not reduce

2. 
{quote}With this policy, the replicas of a file do not evenly distribute across 
the racks.
{quote}
will replace: replicas of a file -> replicas of a block 

Please suggest if any additional meaning is getting lost with this replacement.

  was:
A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when 
one thirds and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the 
overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

-_One third of replicas are on one node, two thirds of replicas are on one 
rack._-
{quote}{{Two replicas are on different nodes of one rack and the remaining 
replica is on a node of one of the other racks.}}
{quote}
In addition to this, found 2 more sentences in same paragraph that can be 
corrected.

1.
{quote}However, it does reduce the aggregate network bandwidth used when 
reading data since a block is placed in only two unique racks rather than three.
{quote}
will replace: does reduce -> does not reduce

2. 
{quote}With this policy, the replicas of a file do not evenly distribute across 
the racks.
{quote}
will replace: replicas of a file -> replicas of a block 

Please suggest if any additional meaning is getting lost with this replacement.


> Update HdfsDesign.md to reduce ambiguity
> 
>
> Key: HADOOP-16899
> URL: https://issues.apache.org/jira/browse/HADOOP-16899
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: documentation
>Reporter: Akshay Nehe
>Priority: Minor
>
> A proposed update to 
> [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
>  in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
> last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity 
> when one thirds and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the 
> overall essence of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack, and the other third are evenly distributed across the remaining racks._
> with
> -_One third of replicas are on one node, two thirds of replicas are 

[jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-03 Thread Akshay Nehe (Jira)


 [ 
https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akshay Nehe updated HADOOP-16899:
-
Description: 
A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when 
one thirds and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the 
overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

-_One third of replicas are on one node, two thirds of replicas are on one 
rack._-
{quote}{{Two replicas are on different nodes of one rack and the remaining 
replica is on a node of one of the other racks.}}
{quote}
In addition to this, found 2 more sentences in same paragraph that can be 
corrected.

1.
{quote}However, it does reduce the aggregate network bandwidth used when 
reading data since a block is placed in only two unique racks rather than three.
{quote}
will replace: does reduce -> does not reduce

2. 
{quote}With this policy, the replicas of a file do not evenly distribute across 
the racks.
{quote}
will replace: replicas of a file -> replicas of a block 

Please suggest if any additional meaning is getting lost with this replacement.

  was:
A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when 
one thirds and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the 
overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

_One third of replicas are on one node, two thirds of replicas are on one rack._

Please suggest if any additional meaning is getting lost with this replacement.


> Update HdfsDesign.md to reduce ambiguity
> 
>
> Key: HADOOP-16899
> URL: https://issues.apache.org/jira/browse/HADOOP-16899
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: documentation
>Reporter: Akshay Nehe
>Priority: Minor
>
> A proposed update to 
> [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
>  in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
> last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity 
> when one thirds and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the 
> overall essence of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack, and the other third are evenly distributed across the remaining racks._
> with
> -_One third of replicas are on one node, two thirds of replicas are on one 
> rack._-
> {quote}{{Two replicas are on different nodes of one rack and the remaining 
> replica is on a node of one of the other racks.}}
> {quote}
> In addition to this, found 2 more sentences in same paragraph that can be 
> corrected.
> 1.
> {quote}However, it does reduce the aggregate network bandwidth used when 
> reading data since a block is placed in only two unique racks rather than 
> three.
> {quote}
> will replace: does reduce -> does not reduce
> 2. 
> {quote}With this policy, the replicas of a file do not evenly distribute 
> across the racks.
> {quote}
> will 

[jira] [Commented] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-02 Thread Akshay Nehe (Jira)


[ 
https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17049868#comment-17049868
 ] 

Akshay Nehe commented on HADOOP-16899:
--

Based on comment by [~aajisaka] on the PR, considering replacing the line with:

_Two replicas are on one rack, and the remaining replica is on one of the other 
racks._

instead of previously indicated update.

An additional recommended update will be included for the sentence 2 sentences 
before the previously mentioned.

Quoting Akira:

The following sentence is not directly related to your PR, however, it can be 
fixed at the same time.
{quote}However, it does reduce the aggregate network bandwidth used when 
reading data since a block is placed in only two unique racks rather than 
three. With this policy, the replicas of a file do not evenly distribute across 
the racks.
{quote} * it does reduce -> it does not reduce

If a block is placed in three unique racks, the probability of rack-local read 
will increase and the network bandwidth will be reduced when reading the data. 
Therefore I think 'does' should be changed to 'does not'.

 

Once decided to make both changes, will update the Jira Description.

> Update HdfsDesign.md to reduce ambiguity
> 
>
> Key: HADOOP-16899
> URL: https://issues.apache.org/jira/browse/HADOOP-16899
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: documentation
>Reporter: Akshay Nehe
>Priority: Minor
>
> A proposed update to 
> [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
>  in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
> last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity 
> when one thirds and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the 
> overall essence of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack, and the other third are evenly distributed across the remaining racks._
> with
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack._
> Please suggest if any additional meaning is getting lost with this 
> replacement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org



[jira] [Updated] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-02 Thread Akshay Nehe (Jira)


 [ 
https://issues.apache.org/jira/browse/HADOOP-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akshay Nehe updated HADOOP-16899:
-
Description: 
A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Considering the statement segmented in 3 parts by the commas:
 # the first part talks about "one thirds of replicas";
 # the second part talks about "two thirds of replicas"
 # the third part talking about "the other third" is leading to ambiguity when 
one thirds and two thirds have already accounted for the whole.

Proposed solution:

Getting rid of the third part or rephrasing entire sentence to capture the 
overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

_One third of replicas are on one node, two thirds of replicas are on one rack._

Please suggest if any additional meaning is getting lost with this replacement.

  was:
A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Proposed solution:

Getting rid of the third part of sentence (after 2nd comma) or rephrase entire 
sentence to capture the overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

_One third of replicas are on one node, two thirds of replicas are on one rack._

Please suggest if any additional meaning is getting lost with this replacement.


> Update HdfsDesign.md to reduce ambiguity
> 
>
> Key: HADOOP-16899
> URL: https://issues.apache.org/jira/browse/HADOOP-16899
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: documentation
>Reporter: Akshay Nehe
>Priority: Minor
>
> A proposed update to 
> [https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com/]
>  in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
> last line.
> The sentence is leading to ambiguity of reader.
> Considering the statement segmented in 3 parts by the commas:
>  # the first part talks about "one thirds of replicas";
>  # the second part talks about "two thirds of replicas"
>  # the third part talking about "the other third" is leading to ambiguity 
> when one thirds and two thirds have already accounted for the whole.
> Proposed solution:
> Getting rid of the third part or rephrasing entire sentence to capture the 
> overall essence of the sentence.
> In other words, replacing 
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack, and the other third are evenly distributed across the remaining racks._
> with
> _One third of replicas are on one node, two thirds of replicas are on one 
> rack._
> Please suggest if any additional meaning is getting lost with this 
> replacement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org



[jira] [Created] (HADOOP-16899) Update HdfsDesign.md to reduce ambiguity

2020-03-02 Thread Akshay Nehe (Jira)
Akshay Nehe created HADOOP-16899:


 Summary: Update HdfsDesign.md to reduce ambiguity
 Key: HADOOP-16899
 URL: https://issues.apache.org/jira/browse/HADOOP-16899
 Project: Hadoop Common
  Issue Type: Improvement
  Components: documentation
Reporter: Akshay Nehe


A proposed update to 
[https://github.com/apache/hadoop/blob/trunk/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/HdfsDesign.md|http://example.com]
 in the section "Replica Placement: The First Baby Steps" 4th paragraph, 2nd 
last line.

The sentence is leading to ambiguity of reader.

Proposed solution:

Getting rid of the third part of sentence (after 2nd comma) or rephrase entire 
sentence to capture the overall essence of the sentence.

In other words, replacing 

_One third of replicas are on one node, two thirds of replicas are on one rack, 
and the other third are evenly distributed across the remaining racks._

with

_One third of replicas are on one node, two thirds of replicas are on one rack._

Please suggest if any additional meaning is getting lost with this replacement.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org