[ https://issues.apache.org/jira/browse/MADLIB-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16419465#comment-16419465 ]
Himanshu Pandey commented on MADLIB-1084: ----------------------------------------- [~fmcquillan], [~jingyimei] Here are my test results on Performance Test on GPDB 4.x vs 5.x : *GPDB 4.3.22* (CentOS Linux release 7.4.1708 (Core) ) Without grouping and 2 special nodes: {code} gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src, dest=dest', 'pagerank_out', NULL, NULL, NULL, NULL,'{2,3}'); pagerank ---------- (1 row) Time: 1452.957 ms {code} With Grouping and 2 special nodes: {code} gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src, dest=dest', 'pagerank_out', NULL, NULL, NULL, 'user_id','{2,3}'); pagerank ---------- (1 row) Time: 3486.102 ms {code} *GPDB 5.6.1* (CentOS Linux release 7.4.1708 (Core) ) Without grouping, 2 special nodes and with Optimizer = ON(default ) : {code} gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src, dest=dest', 'pagerank_out', NULL, NULL, NULL, NULL, '{1,3}'); pagerank ---------- (1 row) Time: 11019.834 ms {code} With grouping, 2 special nodes and with Optimizer = ON(default ) : {code} gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src, dest=dest', 'pagerank_out', NULL, NULL, NULL, 'user_id', '{1,3}'); pagerank ---------- (1 row) Time: 121870.719 ms {code} Without grouping, 2 special nodes and with Optimizer = OFF : {code} gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src, dest=dest', 'pagerank_out', NULL, NULL, NULL, NULL, '{1,3}'); pagerank ---------- (1 row) Time: 1262.599 ms {code} With grouping, 2 special nodes and with Optimizer = OFF : {code} gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src, dest=dest', 'pagerank_out', NULL, NULL, NULL, 'user_id', '{1,3}'); pagerank ---------- (1 row) Time: 2732.486 ms {code} > Graph - Personalized PageRank > ----------------------------- > > Key: MADLIB-1084 > URL: https://issues.apache.org/jira/browse/MADLIB-1084 > Project: Apache MADlib > Issue Type: New Feature > Components: Module: Graph > Reporter: Frank McQuillan > Assignee: Himanshu Pandey > Priority: Major > Fix For: v1.14 > > > Personalized PageRank which is a variant of regular PageRank. > Please refer to > [http://madlib.apache.org/docs/latest/group__grp__pagerank.html] as a > starting point. > Reference: > Neighborhood Formation and Anomaly Detection in Bipartite Graphs > [http://www.cs.cmu.edu/~deepay/mywww/papers/icdm05.pdf] -- This message was sent by Atlassian JIRA (v7.6.3#76005)