[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Hadoop Flags: Reviewed > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Fix For: 3.1.0 > > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Resolution: Fixed Status: Resolved (was: Patch Available) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Patch Available (was: Open) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Open (was: Patch Available) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Attachment: (was: HDFS-12778-HDFS-9806.003.patch) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Attachment: HDFS-12778-HDFS-9806.003.patch Updated patch fixing the findbugs and checkstyle issues. The failed tests pass locally except {{TestCheckpoint}}, which is unrelated. > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Open (was: Patch Available) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch, HDFS-12778-HDFS-9806.003.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Patch Available (was: Open) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Attachment: HDFS-12778-HDFS-9806.002.patch Thanks for taking a look [~elgoiri]. Posting a new patch with the additional test cases ({{testNumberOfProvidedLocations}} and {{testNumberOfProvidedLocationsManyBlocks}}). > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch, > HDFS-12778-HDFS-9806.002.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Open (was: Patch Available) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Attachment: HDFS-12778-HDFS-9806.001.patch > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Open (was: Patch Available) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Attachment: (was: HDFS-12778-HDFS-9806.001.patch) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Patch Available (was: Open) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Patch Available (was: Open) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Open (was: Patch Available) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Attachment: HDFS-12778-HDFS-9806.001.patch Attaching a patch where {{getBlockLocations}} on a PROVIDED file will return the default number of replicas configured ({{dfs.replication}}). More precisely, the number of locations returned for PROVIDED files = (number of local replicas) + min({{dfs.replication}} - (number of local replicas), number of datanodes configured with PROVIDED storage type). The patch also fixes the affected unit tests in {{TestNameNodeProvidedImplementation}}. > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-12778) [READ] Report multiple locations for PROVIDED blocks
[ https://issues.apache.org/jira/browse/HDFS-12778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Virajith Jalaparti updated HDFS-12778: -- Status: Patch Available (was: Open) > [READ] Report multiple locations for PROVIDED blocks > > > Key: HDFS-12778 > URL: https://issues.apache.org/jira/browse/HDFS-12778 > Project: Hadoop HDFS > Issue Type: Sub-task >Reporter: Virajith Jalaparti >Assignee: Virajith Jalaparti > Attachments: HDFS-12778-HDFS-9806.001.patch > > > On {{getBlockLocations}}, only one Datanode is returned as the location for > all PROVIDED blocks. This can hurt the performance of applications which > typically 3 locations per block. We need to return multiple Datanodes for > each PROVIDED block for better application performance/resilience. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org