Updated Gelly Roadmap

2016-04-26 Thread Vasiliki Kalavri
Hi all,

as promised, I have updated the Gelly roadmap [1].
Below, I am describing and reasoning about the changes I made. Please, let
me know whether you agree and if you have any other ideas for further
improvements and feature additions.

*1. Operators for highly skewed graphs*:
I have removed this item completely. It referred to Andra's master thesis
which has been completed. We had a discussion back then (see [2]), but no
activity after that.

*2. Scala API*:
I removed this item because it's done 

*3. Graph Streaming*:
I removed this item also. We have built an experimental API for graph
streaming with Paris and KTH students. The code is available in [3]. If you
think it would be a valuable addition as a library to Flink, we can start a
separate discussion thread about it.

*4. Library Methods*:
- Affinity Propagation: this is WIP in FLINK-1707
- HITS + Adsorption: Removed. They were started by TUB students and not
finished. I propose to only revisit these only if someone asks for them.
- Strongly CC + DIA: Removed. Nobody has worked on them AFAIK.

*5. Graph partitioning*:
This is still relevant in my opinion and thus I kept it in the updated
roadmap.

*6. Partition-centric iterations*:
We have created a POC implementation with KTH students [4]. In my opinion,
it would be nice to add this to Gelly and most of the work has already been
done, so I kept it.

*7. Generic Iterations*:
This requires caching intermediate results. Anyone has a status update on
that?

*8. Performance evaluation*:
I'm currently working on this, integrating Gelly with the Graphalytics
benchmark. My WIP is in [5] in case you want to get involved :)

*9. Bipartite support*:
This is still relevant in my opinion. Someone had started working on it,
but has been inactive for a while. I pinged the JIRA (FLINK-2254).

>From the wishlist:
- *Neo4j* input/output formats have been implemented as an external project
by Martin Junghanns. Shall we go ahead and them to project-flink? We should
definitely link to this from the third-party packages.

- *TinkerPop*: Discussion started from the TinkerPop community, but there
was not much activity from our side [6]. I wrote my opinion then and I am
very much in favor. Anyone else wants to share their thoughts?

Looking forward to your input,
-Vasia.


[1]: https://cwiki.apache.org/confluence/display/FLINK/Flink+Gelly
[2]:
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Proposal-Addition-to-Gelly-td7436.html
[3]: https://github.com/vasia/gelly-streaming
[4]: https://github.com/vasia/gelly-partition-centric
[5]: https://github.com/vasia/graphalytics-platforms-gelly
[6]:
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Apache-Tinkerpop-amp-Geode-Integration-td9252.html


Re: Gelly Roadmap

2015-06-03 Thread Ufuk Celebi

On 03 Jun 2015, at 23:51, Stephan Ewen se...@apache.org wrote:

 It may make sense to link the JIRAs from the roadmap, or add comments what
 people are working on and what is in the queue.

Linking JIRAs from the roadmap would solve all these points. ;) So +1 for issue 
links.

Re: Gelly Roadmap

2015-06-03 Thread Vasiliki Kalavri
Hey,

yes, this is kind of an ambitious and long-term roadmap :)

As Andra said, tasks 1-2-3-4 are already work in progress.
Regarding the rest, we could discuss on the priority or leave it on the
personal interest of contributors.

Personally, I think bipartite support should be quite easy to add and a
very nice feature to have.
Then, I would love to see some support for partitioning / partition-centric
iterations and generic iterations of course.


-Vasia.

On 3 June 2015 at 14:58, Andra Lungu lungu.an...@gmail.com wrote:

 Hi Stephan,

 Well the operators fro skewed graphs and the Gelly streaming project will
 be done by the end of the semester. Otherwise Daniel and I won't graduate
 :);
 The Scala API is ongoing work (there is a link to the corresponding JIRA in
 the roadmap);
 The Library methods are in the making; First two are being written; the
 second two are potential tasks for a Shivani, an intern I am supervising at
 DIMA;
 Graph Partitioning is also programmed for the near future. There is a JIRA
 there with the corresponding conversations;
 The generic iterations are still being discussed. We reached a conclusion
 there, but as far as I know nobody is working on it yet;
 Performance evaluation will probably be done with graphalytics, but that
 and the bipartite graph support are future tasks (there is nobody working
 on them yet);

 The wish-list is exactly as its name states, a wish-list :) All the other
 items are either currently addressed or will be addressed in the
 foreseeable future.

 The order is more or less the one on the roadmap. What's on the bottom is
 more of a set of wishes, whereas the tasks at the beginning are considered
 a priority and are being addressed.

 Andra

 On Wed, Jun 3, 2015 at 2:45 PM, Stephan Ewen se...@apache.org wrote:

  Wow, this is an extremely big and ambitious roadmap.
 
  Are all these things realistically possible in the foreseeable future? Or
  is this more a wish-list?
 
  Do you have an idea what issues to approach in what order?
 
  On Wed, May 20, 2015 at 8:58 PM, Andra Lungu lungu.an...@gmail.com
  wrote:
 
   The Roadmap is now available as a wiki page.
   https://cwiki.apache.org/confluence/display/FLINK/Flink+Gelly
  
   We're still happy to hear comments/suggestions, if any :)
  
   On Wed, May 20, 2015 at 8:43 PM, Andra Lungu lungu.an...@gmail.com
   wrote:
  
Thanks, Stephan!
   
On Wed, May 20, 2015 at 8:42 PM, Stephan Ewen se...@apache.org
  wrote:
   
All right, you should have permissions now.
   
On Wed, May 20, 2015 at 8:37 PM, Andra Lungu lungu.an...@gmail.com
 
wrote:
   
 Sure, but first I need permissions! :)

 *NOTE*: Due to spamming, we can not give every confluence user
 edit
 permissions to the wiki. Just write to the dev@flink.apache.org
  (you
can
 also email to rmetzger at apache.org) mailing list to get edit
 permissions.

 My user is lungu.andra

 Thanks!

 On Wed, May 20, 2015 at 4:27 PM, Vasiliki Kalavri 
 vasilikikala...@gmail.com
  wrote:

  Thank you for your feedback and ideas everyone!
 
  @Andra, how about moving the roadmap to the wiki?
 
  On 20 May 2015 at 15:48, Kostas Tzoumas ktzou...@apache.org
   wrote:
 
   :-D
  
   Great!
  
   On Tue, May 19, 2015 at 4:00 PM, Andra Lungu 
   lungu.an...@gmail.com

   wrote:
  
Hi Kostas,
   
We're way ahead of you! The first draft of the blog post is
 internally
reviewed as we speak ;)
   
   
On Tue, May 19, 2015 at 3:49 PM, Kostas Tzoumas 
ktzou...@apache.org
 
wrote:
   
 This is very cool!

 Would also love to see a blog post on Gelly at some point,
perhaps
 finishing off with the roadmap

 On Tue, May 19, 2015 at 10:55 AM, Andra Lungu 
 lungu.an...@gmail.com
  
 wrote:

  Apache TinkerPop made it to the wish-list :)
  Thanks, Fabian!
 
  On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier 
 pomperma...@okkam.it
  wrote:
 
   You're definitely right Fabian +1
  
   On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske 
  fhue...@gmail.com
  wrote:
  
Integration with Apache TinkerPop3 could also be
interesting.
TinkerPop3 is an API for transactional and
 analytical
graph
 processing
   and
supported by several Graph engines/databases.
   
It might be interesting to see if/how Gelly's and
TinkerPop's
 concepts
match and think about whether it makes sense to
   integrate
the
  two
   projects
in some way.
   
2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci
   Morales

   g...@apache.org
:
   
 Hi,
 

Re: Gelly Roadmap

2015-06-03 Thread Stephan Ewen
Wow, this is an extremely big and ambitious roadmap.

Are all these things realistically possible in the foreseeable future? Or
is this more a wish-list?

Do you have an idea what issues to approach in what order?

On Wed, May 20, 2015 at 8:58 PM, Andra Lungu lungu.an...@gmail.com wrote:

 The Roadmap is now available as a wiki page.
 https://cwiki.apache.org/confluence/display/FLINK/Flink+Gelly

 We're still happy to hear comments/suggestions, if any :)

 On Wed, May 20, 2015 at 8:43 PM, Andra Lungu lungu.an...@gmail.com
 wrote:

  Thanks, Stephan!
 
  On Wed, May 20, 2015 at 8:42 PM, Stephan Ewen se...@apache.org wrote:
 
  All right, you should have permissions now.
 
  On Wed, May 20, 2015 at 8:37 PM, Andra Lungu lungu.an...@gmail.com
  wrote:
 
   Sure, but first I need permissions! :)
  
   *NOTE*: Due to spamming, we can not give every confluence user edit
   permissions to the wiki. Just write to the dev@flink.apache.org (you
  can
   also email to rmetzger at apache.org) mailing list to get edit
   permissions.
  
   My user is lungu.andra
  
   Thanks!
  
   On Wed, May 20, 2015 at 4:27 PM, Vasiliki Kalavri 
   vasilikikala...@gmail.com
wrote:
  
Thank you for your feedback and ideas everyone!
   
@Andra, how about moving the roadmap to the wiki?
   
On 20 May 2015 at 15:48, Kostas Tzoumas ktzou...@apache.org
 wrote:
   
 :-D

 Great!

 On Tue, May 19, 2015 at 4:00 PM, Andra Lungu 
 lungu.an...@gmail.com
  
 wrote:

  Hi Kostas,
 
  We're way ahead of you! The first draft of the blog post is
   internally
  reviewed as we speak ;)
 
 
  On Tue, May 19, 2015 at 3:49 PM, Kostas Tzoumas 
  ktzou...@apache.org
   
  wrote:
 
   This is very cool!
  
   Would also love to see a blog post on Gelly at some point,
  perhaps
   finishing off with the roadmap
  
   On Tue, May 19, 2015 at 10:55 AM, Andra Lungu 
   lungu.an...@gmail.com

   wrote:
  
Apache TinkerPop made it to the wish-list :)
Thanks, Fabian!
   
On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier 
   pomperma...@okkam.it
wrote:
   
 You're definitely right Fabian +1

 On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske 
fhue...@gmail.com
wrote:

  Integration with Apache TinkerPop3 could also be
  interesting.
  TinkerPop3 is an API for transactional and analytical
  graph
   processing
 and
  supported by several Graph engines/databases.
 
  It might be interesting to see if/how Gelly's and
  TinkerPop's
   concepts
  match and think about whether it makes sense to
 integrate
  the
two
 projects
  in some way.
 
  2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci
 Morales
  
 g...@apache.org
  :
 
   Hi,
  
   Great job putting together this roadmap!
  
   Regarding 1) you may find this work relevant:
   http://arxiv.org/abs/1503.00626v1
  
   Regarding 2) the semi-streaming model allows
 polylog(n)
passes.
   If you allow only one pass, it's a streaming model.
   In addition, I think models such as the
 turnstile/fully
dynamic
   (with
   addition and deletion) would be more appropriate than
  the
   incremental
  one.
  
   Regarding 5) (when combined with 2) there are some
 good
recent
 algorithms
   for partitioning that you might want to look at.
   http://dl.acm.org/citation.cfm?id=2556213
   http://dl.acm.org/citation.cfm?id=2339722
  
   Cheers,
  
   --
   Gianmarco
  
   On 18 May 2015 at 12:42, Vasiliki Kalavri 
   vasilikikala...@gmail.com

   wrote:
  
Hello everyone,
   
thank you Andra for putting together this nice
  document
   and
 for
  sharing!
   
Gelly is already in a state that matches the basic
  functionality
   of
existing distributed graph libraries. We're very
 happy
   for
 this
   and
  also
very grateful to everyone who has contributed so
 far!
   
We are also very excited to keep on extending Gelly
  and
   our
   vision
is
  to
turn it into something much more than yet another
   library
 for
 running
PageRank-like algorithms. The document that Andra
 has
shared
 contains
ongoing work and our ideas on mid- and long-term
  plans,
   in
  order
   to
  make
this happen :)
   
We would love to get the community's feedback on
  this, so
  please
feel
   free
to 

Re: Gelly Roadmap

2015-05-20 Thread Kostas Tzoumas
:-D

Great!

On Tue, May 19, 2015 at 4:00 PM, Andra Lungu lungu.an...@gmail.com wrote:

 Hi Kostas,

 We're way ahead of you! The first draft of the blog post is internally
 reviewed as we speak ;)


 On Tue, May 19, 2015 at 3:49 PM, Kostas Tzoumas ktzou...@apache.org
 wrote:

  This is very cool!
 
  Would also love to see a blog post on Gelly at some point, perhaps
  finishing off with the roadmap
 
  On Tue, May 19, 2015 at 10:55 AM, Andra Lungu lungu.an...@gmail.com
  wrote:
 
   Apache TinkerPop made it to the wish-list :)
   Thanks, Fabian!
  
   On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier 
  pomperma...@okkam.it
   wrote:
  
You're definitely right Fabian +1
   
On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske fhue...@gmail.com
   wrote:
   
 Integration with Apache TinkerPop3 could also be interesting.
 TinkerPop3 is an API for transactional and analytical graph
  processing
and
 supported by several Graph engines/databases.

 It might be interesting to see if/how Gelly's and TinkerPop's
  concepts
 match and think about whether it makes sense to integrate the two
projects
 in some way.

 2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci Morales 
g...@apache.org
 :

  Hi,
 
  Great job putting together this roadmap!
 
  Regarding 1) you may find this work relevant:
  http://arxiv.org/abs/1503.00626v1
 
  Regarding 2) the semi-streaming model allows polylog(n) passes.
  If you allow only one pass, it's a streaming model.
  In addition, I think models such as the turnstile/fully dynamic
  (with
  addition and deletion) would be more appropriate than the
  incremental
 one.
 
  Regarding 5) (when combined with 2) there are some good recent
algorithms
  for partitioning that you might want to look at.
  http://dl.acm.org/citation.cfm?id=2556213
  http://dl.acm.org/citation.cfm?id=2339722
 
  Cheers,
 
  --
  Gianmarco
 
  On 18 May 2015 at 12:42, Vasiliki Kalavri 
  vasilikikala...@gmail.com
   
  wrote:
 
   Hello everyone,
  
   thank you Andra for putting together this nice document and for
 sharing!
  
   Gelly is already in a state that matches the basic
 functionality
  of
   existing distributed graph libraries. We're very happy for this
  and
 also
   very grateful to everyone who has contributed so far!
  
   We are also very excited to keep on extending Gelly and our
  vision
   is
 to
   turn it into something much more than yet another library for
running
   PageRank-like algorithms. The document that Andra has shared
contains
   ongoing work and our ideas on mid- and long-term plans, in
 order
  to
 make
   this happen :)
  
   We would love to get the community's feedback on this, so
 please
   feel
  free
   to comment on the doc or discuss individual points here.
  
   Cheers,
   Vasia.
  
   On 18 May 2015 at 11:01, Andra Lungu lungu.an...@gmail.com
   wrote:
  
Dear Squirrels,
   
The following document is intended to give you an insight on
   where
   Flink's
Graph API, Gelly, is at the moment and especially on the
   direction
in
   which
it's heading in the near future.
   
   
   
  
 

   
  
 
 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing
   
We would love to hear about your comments/ suggestions/
  additions
to
  the
wish-list :)
   
Cheers,
Andra
   
  
 

   
  
 



Re: Gelly Roadmap

2015-05-20 Thread Andra Lungu
Thanks, Stephan!

On Wed, May 20, 2015 at 8:42 PM, Stephan Ewen se...@apache.org wrote:

 All right, you should have permissions now.

 On Wed, May 20, 2015 at 8:37 PM, Andra Lungu lungu.an...@gmail.com
 wrote:

  Sure, but first I need permissions! :)
 
  *NOTE*: Due to spamming, we can not give every confluence user edit
  permissions to the wiki. Just write to the dev@flink.apache.org (you can
  also email to rmetzger at apache.org) mailing list to get edit
  permissions.
 
  My user is lungu.andra
 
  Thanks!
 
  On Wed, May 20, 2015 at 4:27 PM, Vasiliki Kalavri 
  vasilikikala...@gmail.com
   wrote:
 
   Thank you for your feedback and ideas everyone!
  
   @Andra, how about moving the roadmap to the wiki?
  
   On 20 May 2015 at 15:48, Kostas Tzoumas ktzou...@apache.org wrote:
  
:-D
   
Great!
   
On Tue, May 19, 2015 at 4:00 PM, Andra Lungu lungu.an...@gmail.com
wrote:
   
 Hi Kostas,

 We're way ahead of you! The first draft of the blog post is
  internally
 reviewed as we speak ;)


 On Tue, May 19, 2015 at 3:49 PM, Kostas Tzoumas 
 ktzou...@apache.org
  
 wrote:

  This is very cool!
 
  Would also love to see a blog post on Gelly at some point,
 perhaps
  finishing off with the roadmap
 
  On Tue, May 19, 2015 at 10:55 AM, Andra Lungu 
  lungu.an...@gmail.com
   
  wrote:
 
   Apache TinkerPop made it to the wish-list :)
   Thanks, Fabian!
  
   On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier 
  pomperma...@okkam.it
   wrote:
  
You're definitely right Fabian +1
   
On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske 
   fhue...@gmail.com
   wrote:
   
 Integration with Apache TinkerPop3 could also be
 interesting.
 TinkerPop3 is an API for transactional and analytical graph
  processing
and
 supported by several Graph engines/databases.

 It might be interesting to see if/how Gelly's and
 TinkerPop's
  concepts
 match and think about whether it makes sense to integrate
 the
   two
projects
 in some way.

 2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci Morales 
g...@apache.org
 :

  Hi,
 
  Great job putting together this roadmap!
 
  Regarding 1) you may find this work relevant:
  http://arxiv.org/abs/1503.00626v1
 
  Regarding 2) the semi-streaming model allows polylog(n)
   passes.
  If you allow only one pass, it's a streaming model.
  In addition, I think models such as the turnstile/fully
   dynamic
  (with
  addition and deletion) would be more appropriate than the
  incremental
 one.
 
  Regarding 5) (when combined with 2) there are some good
   recent
algorithms
  for partitioning that you might want to look at.
  http://dl.acm.org/citation.cfm?id=2556213
  http://dl.acm.org/citation.cfm?id=2339722
 
  Cheers,
 
  --
  Gianmarco
 
  On 18 May 2015 at 12:42, Vasiliki Kalavri 
  vasilikikala...@gmail.com
   
  wrote:
 
   Hello everyone,
  
   thank you Andra for putting together this nice document
  and
for
 sharing!
  
   Gelly is already in a state that matches the basic
 functionality
  of
   existing distributed graph libraries. We're very happy
  for
this
  and
 also
   very grateful to everyone who has contributed so far!
  
   We are also very excited to keep on extending Gelly and
  our
  vision
   is
 to
   turn it into something much more than yet another
  library
for
running
   PageRank-like algorithms. The document that Andra has
   shared
contains
   ongoing work and our ideas on mid- and long-term plans,
  in
 order
  to
 make
   this happen :)
  
   We would love to get the community's feedback on this,
 so
 please
   feel
  free
   to comment on the doc or discuss individual points
 here.
  
   Cheers,
   Vasia.
  
   On 18 May 2015 at 11:01, Andra Lungu 
   lungu.an...@gmail.com
   wrote:
  
Dear Squirrels,
   
The following document is intended to give you an
  insight
on
   where
   Flink's
Graph API, Gelly, is at the moment and especially on
  the
   direction
in
   which
it's heading in the near future.
   
   
   
  
 

   
  
 

   
  
 
 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing
   
 

Re: Gelly Roadmap

2015-05-20 Thread Stephan Ewen
All right, you should have permissions now.

On Wed, May 20, 2015 at 8:37 PM, Andra Lungu lungu.an...@gmail.com wrote:

 Sure, but first I need permissions! :)

 *NOTE*: Due to spamming, we can not give every confluence user edit
 permissions to the wiki. Just write to the dev@flink.apache.org (you can
 also email to rmetzger at apache.org) mailing list to get edit
 permissions.

 My user is lungu.andra

 Thanks!

 On Wed, May 20, 2015 at 4:27 PM, Vasiliki Kalavri 
 vasilikikala...@gmail.com
  wrote:

  Thank you for your feedback and ideas everyone!
 
  @Andra, how about moving the roadmap to the wiki?
 
  On 20 May 2015 at 15:48, Kostas Tzoumas ktzou...@apache.org wrote:
 
   :-D
  
   Great!
  
   On Tue, May 19, 2015 at 4:00 PM, Andra Lungu lungu.an...@gmail.com
   wrote:
  
Hi Kostas,
   
We're way ahead of you! The first draft of the blog post is
 internally
reviewed as we speak ;)
   
   
On Tue, May 19, 2015 at 3:49 PM, Kostas Tzoumas ktzou...@apache.org
 
wrote:
   
 This is very cool!

 Would also love to see a blog post on Gelly at some point, perhaps
 finishing off with the roadmap

 On Tue, May 19, 2015 at 10:55 AM, Andra Lungu 
 lungu.an...@gmail.com
  
 wrote:

  Apache TinkerPop made it to the wish-list :)
  Thanks, Fabian!
 
  On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier 
 pomperma...@okkam.it
  wrote:
 
   You're definitely right Fabian +1
  
   On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske 
  fhue...@gmail.com
  wrote:
  
Integration with Apache TinkerPop3 could also be interesting.
TinkerPop3 is an API for transactional and analytical graph
 processing
   and
supported by several Graph engines/databases.
   
It might be interesting to see if/how Gelly's and TinkerPop's
 concepts
match and think about whether it makes sense to integrate the
  two
   projects
in some way.
   
2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci Morales 
   g...@apache.org
:
   
 Hi,

 Great job putting together this roadmap!

 Regarding 1) you may find this work relevant:
 http://arxiv.org/abs/1503.00626v1

 Regarding 2) the semi-streaming model allows polylog(n)
  passes.
 If you allow only one pass, it's a streaming model.
 In addition, I think models such as the turnstile/fully
  dynamic
 (with
 addition and deletion) would be more appropriate than the
 incremental
one.

 Regarding 5) (when combined with 2) there are some good
  recent
   algorithms
 for partitioning that you might want to look at.
 http://dl.acm.org/citation.cfm?id=2556213
 http://dl.acm.org/citation.cfm?id=2339722

 Cheers,

 --
 Gianmarco

 On 18 May 2015 at 12:42, Vasiliki Kalavri 
 vasilikikala...@gmail.com
  
 wrote:

  Hello everyone,
 
  thank you Andra for putting together this nice document
 and
   for
sharing!
 
  Gelly is already in a state that matches the basic
functionality
 of
  existing distributed graph libraries. We're very happy
 for
   this
 and
also
  very grateful to everyone who has contributed so far!
 
  We are also very excited to keep on extending Gelly and
 our
 vision
  is
to
  turn it into something much more than yet another
 library
   for
   running
  PageRank-like algorithms. The document that Andra has
  shared
   contains
  ongoing work and our ideas on mid- and long-term plans,
 in
order
 to
make
  this happen :)
 
  We would love to get the community's feedback on this, so
please
  feel
 free
  to comment on the doc or discuss individual points here.
 
  Cheers,
  Vasia.
 
  On 18 May 2015 at 11:01, Andra Lungu 
  lungu.an...@gmail.com
  wrote:
 
   Dear Squirrels,
  
   The following document is intended to give you an
 insight
   on
  where
  Flink's
   Graph API, Gelly, is at the moment and especially on
 the
  direction
   in
  which
   it's heading in the near future.
  
  
  
 

   
  
 

   
  
 
 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing
  
   We would love to hear about your comments/ suggestions/
 additions
   to
 the
   wish-list :)
  
   Cheers,
   Andra
  
 

   
  
 

   
  
 



Re: Gelly Roadmap

2015-05-20 Thread Andra Lungu
The Roadmap is now available as a wiki page.
https://cwiki.apache.org/confluence/display/FLINK/Flink+Gelly

We're still happy to hear comments/suggestions, if any :)

On Wed, May 20, 2015 at 8:43 PM, Andra Lungu lungu.an...@gmail.com wrote:

 Thanks, Stephan!

 On Wed, May 20, 2015 at 8:42 PM, Stephan Ewen se...@apache.org wrote:

 All right, you should have permissions now.

 On Wed, May 20, 2015 at 8:37 PM, Andra Lungu lungu.an...@gmail.com
 wrote:

  Sure, but first I need permissions! :)
 
  *NOTE*: Due to spamming, we can not give every confluence user edit
  permissions to the wiki. Just write to the dev@flink.apache.org (you
 can
  also email to rmetzger at apache.org) mailing list to get edit
  permissions.
 
  My user is lungu.andra
 
  Thanks!
 
  On Wed, May 20, 2015 at 4:27 PM, Vasiliki Kalavri 
  vasilikikala...@gmail.com
   wrote:
 
   Thank you for your feedback and ideas everyone!
  
   @Andra, how about moving the roadmap to the wiki?
  
   On 20 May 2015 at 15:48, Kostas Tzoumas ktzou...@apache.org wrote:
  
:-D
   
Great!
   
On Tue, May 19, 2015 at 4:00 PM, Andra Lungu lungu.an...@gmail.com
 
wrote:
   
 Hi Kostas,

 We're way ahead of you! The first draft of the blog post is
  internally
 reviewed as we speak ;)


 On Tue, May 19, 2015 at 3:49 PM, Kostas Tzoumas 
 ktzou...@apache.org
  
 wrote:

  This is very cool!
 
  Would also love to see a blog post on Gelly at some point,
 perhaps
  finishing off with the roadmap
 
  On Tue, May 19, 2015 at 10:55 AM, Andra Lungu 
  lungu.an...@gmail.com
   
  wrote:
 
   Apache TinkerPop made it to the wish-list :)
   Thanks, Fabian!
  
   On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier 
  pomperma...@okkam.it
   wrote:
  
You're definitely right Fabian +1
   
On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske 
   fhue...@gmail.com
   wrote:
   
 Integration with Apache TinkerPop3 could also be
 interesting.
 TinkerPop3 is an API for transactional and analytical
 graph
  processing
and
 supported by several Graph engines/databases.

 It might be interesting to see if/how Gelly's and
 TinkerPop's
  concepts
 match and think about whether it makes sense to integrate
 the
   two
projects
 in some way.

 2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci Morales
 
g...@apache.org
 :

  Hi,
 
  Great job putting together this roadmap!
 
  Regarding 1) you may find this work relevant:
  http://arxiv.org/abs/1503.00626v1
 
  Regarding 2) the semi-streaming model allows polylog(n)
   passes.
  If you allow only one pass, it's a streaming model.
  In addition, I think models such as the turnstile/fully
   dynamic
  (with
  addition and deletion) would be more appropriate than
 the
  incremental
 one.
 
  Regarding 5) (when combined with 2) there are some good
   recent
algorithms
  for partitioning that you might want to look at.
  http://dl.acm.org/citation.cfm?id=2556213
  http://dl.acm.org/citation.cfm?id=2339722
 
  Cheers,
 
  --
  Gianmarco
 
  On 18 May 2015 at 12:42, Vasiliki Kalavri 
  vasilikikala...@gmail.com
   
  wrote:
 
   Hello everyone,
  
   thank you Andra for putting together this nice
 document
  and
for
 sharing!
  
   Gelly is already in a state that matches the basic
 functionality
  of
   existing distributed graph libraries. We're very happy
  for
this
  and
 also
   very grateful to everyone who has contributed so far!
  
   We are also very excited to keep on extending Gelly
 and
  our
  vision
   is
 to
   turn it into something much more than yet another
  library
for
running
   PageRank-like algorithms. The document that Andra has
   shared
contains
   ongoing work and our ideas on mid- and long-term
 plans,
  in
 order
  to
 make
   this happen :)
  
   We would love to get the community's feedback on
 this, so
 please
   feel
  free
   to comment on the doc or discuss individual points
 here.
  
   Cheers,
   Vasia.
  
   On 18 May 2015 at 11:01, Andra Lungu 
   lungu.an...@gmail.com
   wrote:
  
Dear Squirrels,
   
The following document is intended to give you an
  insight
on
   where
   Flink's
Graph API, Gelly, is at the moment and especially on
  the
   direction
in
   which

Re: Gelly Roadmap

2015-05-19 Thread Andra Lungu
Apache TinkerPop made it to the wish-list :)
Thanks, Fabian!

On Tue, May 19, 2015 at 9:42 AM, Flavio Pompermaier pomperma...@okkam.it
wrote:

 You're definitely right Fabian +1

 On Mon, May 18, 2015 at 2:03 PM, Fabian Hueske fhue...@gmail.com wrote:

  Integration with Apache TinkerPop3 could also be interesting.
  TinkerPop3 is an API for transactional and analytical graph processing
 and
  supported by several Graph engines/databases.
 
  It might be interesting to see if/how Gelly's and TinkerPop's concepts
  match and think about whether it makes sense to integrate the two
 projects
  in some way.
 
  2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci Morales 
 g...@apache.org
  :
 
   Hi,
  
   Great job putting together this roadmap!
  
   Regarding 1) you may find this work relevant:
   http://arxiv.org/abs/1503.00626v1
  
   Regarding 2) the semi-streaming model allows polylog(n) passes.
   If you allow only one pass, it's a streaming model.
   In addition, I think models such as the turnstile/fully dynamic (with
   addition and deletion) would be more appropriate than the incremental
  one.
  
   Regarding 5) (when combined with 2) there are some good recent
 algorithms
   for partitioning that you might want to look at.
   http://dl.acm.org/citation.cfm?id=2556213
   http://dl.acm.org/citation.cfm?id=2339722
  
   Cheers,
  
   --
   Gianmarco
  
   On 18 May 2015 at 12:42, Vasiliki Kalavri vasilikikala...@gmail.com
   wrote:
  
Hello everyone,
   
thank you Andra for putting together this nice document and for
  sharing!
   
Gelly is already in a state that matches the basic functionality of
existing distributed graph libraries. We're very happy for this and
  also
very grateful to everyone who has contributed so far!
   
We are also very excited to keep on extending Gelly and our vision is
  to
turn it into something much more than yet another library for
 running
PageRank-like algorithms. The document that Andra has shared
 contains
ongoing work and our ideas on mid- and long-term plans, in order to
  make
this happen :)
   
We would love to get the community's feedback on this, so please feel
   free
to comment on the doc or discuss individual points here.
   
Cheers,
Vasia.
   
On 18 May 2015 at 11:01, Andra Lungu lungu.an...@gmail.com wrote:
   
 Dear Squirrels,

 The following document is intended to give you an insight on where
Flink's
 Graph API, Gelly, is at the moment and especially on the direction
 in
which
 it's heading in the near future.



   
  
 
 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing

 We would love to hear about your comments/ suggestions/ additions
 to
   the
 wish-list :)

 Cheers,
 Andra

   
  
 



Re: Gelly Roadmap

2015-05-18 Thread Fabian Hueske
Integration with Apache TinkerPop3 could also be interesting.
TinkerPop3 is an API for transactional and analytical graph processing and
supported by several Graph engines/databases.

It might be interesting to see if/how Gelly's and TinkerPop's concepts
match and think about whether it makes sense to integrate the two projects
in some way.

2015-05-18 12:20 GMT+02:00 Gianmarco De Francisci Morales g...@apache.org:

 Hi,

 Great job putting together this roadmap!

 Regarding 1) you may find this work relevant:
 http://arxiv.org/abs/1503.00626v1

 Regarding 2) the semi-streaming model allows polylog(n) passes.
 If you allow only one pass, it's a streaming model.
 In addition, I think models such as the turnstile/fully dynamic (with
 addition and deletion) would be more appropriate than the incremental one.

 Regarding 5) (when combined with 2) there are some good recent algorithms
 for partitioning that you might want to look at.
 http://dl.acm.org/citation.cfm?id=2556213
 http://dl.acm.org/citation.cfm?id=2339722

 Cheers,

 --
 Gianmarco

 On 18 May 2015 at 12:42, Vasiliki Kalavri vasilikikala...@gmail.com
 wrote:

  Hello everyone,
 
  thank you Andra for putting together this nice document and for sharing!
 
  Gelly is already in a state that matches the basic functionality of
  existing distributed graph libraries. We're very happy for this and also
  very grateful to everyone who has contributed so far!
 
  We are also very excited to keep on extending Gelly and our vision is to
  turn it into something much more than yet another library for running
  PageRank-like algorithms. The document that Andra has shared contains
  ongoing work and our ideas on mid- and long-term plans, in order to make
  this happen :)
 
  We would love to get the community's feedback on this, so please feel
 free
  to comment on the doc or discuss individual points here.
 
  Cheers,
  Vasia.
 
  On 18 May 2015 at 11:01, Andra Lungu lungu.an...@gmail.com wrote:
 
   Dear Squirrels,
  
   The following document is intended to give you an insight on where
  Flink's
   Graph API, Gelly, is at the moment and especially on the direction in
  which
   it's heading in the near future.
  
  
  
 
 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing
  
   We would love to hear about your comments/ suggestions/ additions to
 the
   wish-list :)
  
   Cheers,
   Andra
  
 



Re: Gelly Roadmap

2015-05-18 Thread Vasiliki Kalavri
Hello everyone,

thank you Andra for putting together this nice document and for sharing!

Gelly is already in a state that matches the basic functionality of
existing distributed graph libraries. We're very happy for this and also
very grateful to everyone who has contributed so far!

We are also very excited to keep on extending Gelly and our vision is to
turn it into something much more than yet another library for running
PageRank-like algorithms. The document that Andra has shared contains
ongoing work and our ideas on mid- and long-term plans, in order to make
this happen :)

We would love to get the community's feedback on this, so please feel free
to comment on the doc or discuss individual points here.

Cheers,
Vasia.

On 18 May 2015 at 11:01, Andra Lungu lungu.an...@gmail.com wrote:

 Dear Squirrels,

 The following document is intended to give you an insight on where Flink's
 Graph API, Gelly, is at the moment and especially on the direction in which
 it's heading in the near future.


 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing

 We would love to hear about your comments/ suggestions/ additions to the
 wish-list :)

 Cheers,
 Andra



Re: Gelly Roadmap

2015-05-18 Thread Gianmarco De Francisci Morales
Hi,

Great job putting together this roadmap!

Regarding 1) you may find this work relevant:
http://arxiv.org/abs/1503.00626v1

Regarding 2) the semi-streaming model allows polylog(n) passes.
If you allow only one pass, it's a streaming model.
In addition, I think models such as the turnstile/fully dynamic (with
addition and deletion) would be more appropriate than the incremental one.

Regarding 5) (when combined with 2) there are some good recent algorithms
for partitioning that you might want to look at.
http://dl.acm.org/citation.cfm?id=2556213
http://dl.acm.org/citation.cfm?id=2339722

Cheers,

--
Gianmarco

On 18 May 2015 at 12:42, Vasiliki Kalavri vasilikikala...@gmail.com wrote:

 Hello everyone,

 thank you Andra for putting together this nice document and for sharing!

 Gelly is already in a state that matches the basic functionality of
 existing distributed graph libraries. We're very happy for this and also
 very grateful to everyone who has contributed so far!

 We are also very excited to keep on extending Gelly and our vision is to
 turn it into something much more than yet another library for running
 PageRank-like algorithms. The document that Andra has shared contains
 ongoing work and our ideas on mid- and long-term plans, in order to make
 this happen :)

 We would love to get the community's feedback on this, so please feel free
 to comment on the doc or discuss individual points here.

 Cheers,
 Vasia.

 On 18 May 2015 at 11:01, Andra Lungu lungu.an...@gmail.com wrote:

  Dear Squirrels,
 
  The following document is intended to give you an insight on where
 Flink's
  Graph API, Gelly, is at the moment and especially on the direction in
 which
  it's heading in the near future.
 
 
 
 https://docs.google.com/document/d/11ovK1xZZTl3Qinw6Ud1rqpaDhTvRHDv8PWBzcHITFkM/edit?usp=sharing
 
  We would love to hear about your comments/ suggestions/ additions to the
  wish-list :)
 
  Cheers,
  Andra