[
https://issues.apache.org/jira/browse/MADLIB-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16419465#comment-16419465
]
Himanshu Pandey commented on MADLIB-1084:
-----------------------------------------
[~fmcquillan], [~jingyimei]
Here are my test results on Performance Test on GPDB 4.x vs 5.x :
*GPDB 4.3.22* (CentOS Linux release 7.4.1708 (Core) )
Without grouping and 2 special nodes:
{code}
gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src,
dest=dest', 'pagerank_out', NULL, NULL, NULL, NULL,'{2,3}');
pagerank
----------
(1 row)
Time: 1452.957 ms
{code}
With Grouping and 2 special nodes:
{code}
gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src,
dest=dest', 'pagerank_out', NULL, NULL, NULL, 'user_id','{2,3}');
pagerank
----------
(1 row)
Time: 3486.102 ms
{code}
*GPDB 5.6.1* (CentOS Linux release 7.4.1708 (Core) )
Without grouping, 2 special nodes and with Optimizer = ON(default ) :
{code}
gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src,
dest=dest', 'pagerank_out', NULL, NULL, NULL, NULL, '{1,3}');
pagerank
----------
(1 row)
Time: 11019.834 ms
{code}
With grouping, 2 special nodes and with Optimizer = ON(default ) :
{code}
gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src,
dest=dest', 'pagerank_out', NULL, NULL, NULL, 'user_id', '{1,3}');
pagerank
----------
(1 row)
Time: 121870.719 ms
{code}
Without grouping, 2 special nodes and with Optimizer = OFF :
{code}
gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src,
dest=dest', 'pagerank_out', NULL, NULL, NULL, NULL, '{1,3}');
pagerank
----------
(1 row)
Time: 1262.599 ms
{code}
With grouping, 2 special nodes and with Optimizer = OFF :
{code}
gpadmin=# SELECT madlib.pagerank( 'vertex', 'id', 'edge', 'src=src,
dest=dest', 'pagerank_out', NULL, NULL, NULL, 'user_id', '{1,3}');
pagerank
----------
(1 row)
Time: 2732.486 ms
{code}
> Graph - Personalized PageRank
> -----------------------------
>
> Key: MADLIB-1084
> URL: https://issues.apache.org/jira/browse/MADLIB-1084
> Project: Apache MADlib
> Issue Type: New Feature
> Components: Module: Graph
> Reporter: Frank McQuillan
> Assignee: Himanshu Pandey
> Priority: Major
> Fix For: v1.14
>
>
> Personalized PageRank which is a variant of regular PageRank.
> Please refer to
> [http://madlib.apache.org/docs/latest/group__grp__pagerank.html] as a
> starting point.
> Reference:
> Neighborhood Formation and Anomaly Detection in Bipartite Graphs
> [http://www.cs.cmu.edu/~deepay/mywww/papers/icdm05.pdf]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)