[ https://issues.apache.org/jira/browse/HAMA-531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225870#comment-13225870 ]
Edward J. Yoon commented on HAMA-531: ------------------------------------- In my opinion, each task should load the locally assigned data and transfer the data with optimized way across the network. > Data re-partitioning in BSPJobClient > ------------------------------------ > > Key: HAMA-531 > URL: https://issues.apache.org/jira/browse/HAMA-531 > Project: Hama > Issue Type: Improvement > Reporter: Edward J. Yoon > > The re-partitioning the data is a very expensive operation. By the way, > currently, we processes read/write operations sequentially using HDFS api in > BSPJobClient from client-side. This causes potential too many open files > error, contains HDFS overheads, and shows slow performance. > We have to find another way to re-partitioning data. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira