Hej, My name is Mohit Daga. I am a PhD student at KTH - Sweden. I study theory of distributed graph algorithms. In the theory community, a model called the Congested Clique model [0] is exactly similar to the vertex centric model of Giraph. Through this email, I would like to pitch a project idea for this year's GSOC. In the year 2013, I did a GSOC project with another org: BRLCAD [1]. I discussed doing a project with Giraph last year too, but due to a pandemic related issue, I could not finish the proposal. I sincerely apologise for that.
The main problems that are studied in the theory of distributed graph algorithms - community (henceforth, theory community) are minimum cuts, the shortest paths, minimum spanning trees etc. Now choosing "the one" out of them is tricky, unless we find some applications. I discussed this last year with *Dionysios* and *Claudio*. I got some suggestions from them. The shortest path problem has an application in **designing recommender system** [2]. Plus additionally, we already have many results in the theory community for the shortest path problems in the congested clique model (the model that is vertex centric, and exactly similar to Giraph). **To add to the excitement, a 2021 results claim O(log^2 log n) round algorithm [3] for all pair the shortest path (APSP)**. Here, rounds are similar to the iterations. My work is mainly in the theoretical side. My PhD graduation is around November 2023. IMHO, the recent papers appearing in A* conferences in theory community (eg. PODC, STOC, FOCS, and SODA), in relation with the distributed graph algorithm could lead to some very good practical results. A lot of them, I believe, have not been tried (happy to be corrected), also they are very recent. A result [5] on APSP is so simple, and uses **randomization** to give provable guarantees on finding weighted APSP. Though appears for a different model. In this email, I just wanted to basically say a "Hi". I shall work more towards preparing an application, in the coming weeks. In the meanwhile, any comments or suggestions would be super useful. Cheers! Mohit Daga [0] https://www.cs.tau.ac.il/~roshman/papers/podc14_clique.pdf [1] https://brlcad.org/wiki/User:Level_zero/index [2] https://towardsdatascience.com/shortest-path-similarity-a-fresh-breath-to-item-based-recommendations-9ac3d6ba7240 [3] https://dl.acm.org/doi/pdf/10.1145/3409964.3461784 [4] https://giraph.apache.org/intro.html [5] https://arxiv.org/pdf/1811.03337.pdf [6] PODC https://www.podc.org/ STOC http://acm-stoc.org/ FOCS: http://ieee-focs.org/ SODA https://www.siam.org/conferences/cm/conference/soda21