Claudio Martella created GIRAPH-625:
---------------------------------------

             Summary: DiskBackedMessageStore can merge fileStores in the 
background.
                 Key: GIRAPH-625
                 URL: https://issues.apache.org/jira/browse/GIRAPH-625
             Project: Giraph
          Issue Type: Improvement
            Reporter: Claudio Martella


If the number of messages is large compared to the number of messages kept in 
memory by the DiskBackedMessageStore, it can result in a large number of files. 
Reading messages for each vertex, requires linearly scanning multiple files at 
the same time, hence producing a lot of seeks by the disk head.

While computing the vertices, as the messages for the next superstep flow in, 
we can keep the number of filestores low by merging them in the background with 
a thread. The procedure is similar to compaction in NoSQL stores, and the 
merging of the shuffle & sort of M/R.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to