Github user gallenvara commented on the pull request:

    https://github.com/apache/flink/pull/1956#issuecomment-217070480
  
    Thanks a lot, @greghogan @vasia .
    My limited understand on the tips your have mentioned :
    1) The PR implemented HITS by dividing hub&authority updating into two 
phases. Value updating and normalization limit the two phased can not be 
handled in the same `superstep`. IMO, we can cache the hub updating result and 
send them to next authority iteration and package final authority and hub value 
as `Tuple2` type to return. What's your opinion on this?
    2) GSA does not support for choosing edge direction and each vertex will be 
updated based on the values of its in-neighbors only. In the implementation, 
hub updating use the value of target vertex where the edge direction is out, 
authority updating used the value of the value of source vertex where the edge 
direction is in. IMO, it does not work for hub updating procedure if used GSA.
    3)Yes, vertices have been initialized in the test. It should be better to 
be set into the algortithm before the first iteration using 
`Graph.translateVertexValues`.
    4) Because the edge value not used, the translation is optional and it can 
keep its original value and type.
    5) Yes, adding threshold may reduce iteration time for the case of small 
graph with great `maxIteration`. (Does the scatter-gather or GSA have a default 
threshold to check for no value updating during the iteration?)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to