[ https://issues.apache.org/jira/browse/HAMA-783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13860181#comment-13860181 ]
Edward J. Yoon commented on HAMA-783: ------------------------------------- According to my tests, this patch reduces memory usage by about 20 ~ 60%. I'll commit this tomorrow. > Efficient InMemory Storage for Vertices > --------------------------------------- > > Key: HAMA-783 > URL: https://issues.apache.org/jira/browse/HAMA-783 > Project: Hama > Issue Type: Improvement > Components: graph > Reporter: Edward J. Yoon > Fix For: 0.7.0 > > Attachments: patch.txt > > > Currently there are ListVerticesInfo, DiskVerticesInfo and DirectMemory, but > I personally think we have to do a big re-design of the vertices storage and > graph job runner. > Actually, the size of split is not so great. Maybe, 60 ~ 200MB. Hence, I > don't think DiskVerticesInfo will be really helpful. Instead, we can use the > Serialization like Spark. > Update: > 1) We also need to consider the checkpointing for Fault Tolerance, > periodically. > 2) If DiskVerticesInfo shows good performance, we can use just the > DiskVerticesInfo. -- This message was sent by Atlassian JIRA (v6.1.5#6160)