[
https://issues.apache.org/jira/browse/SPARK-3980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jarred Li updated SPARK-3980:
-
Description:
I run 4 workes in AWS (c3.xlarge), 4g memory for executor, 85,331,846 edges
from(http://socialcomputing.asu.edu/uploads/1296759055/Twitter-dataset.zip).
For PageRank algorithm, the job can not be completed within 7 hours. For small
dataset with 5,000,000
edges(http://socialcomputing.asu.edu/uploads/1296591553/Last.fm-dataset.zip)
, the job can be completed within 16 seconds.
was:I run 4 workes in AWS (c3.xlarge), 4g memory for executor, 85,331,846
edges
from(http://socialcomputing.asu.edu/uploads/1296759055/Twitter-dataset.zip).
For PageRank algorithm, the job can not be completed within 7 hours.
GraphX Performance Issue
Key: SPARK-3980
URL: https://issues.apache.org/jira/browse/SPARK-3980
Project: Spark
Issue Type: Bug
Components: GraphX
Affects Versions: 1.1.0
Reporter: Jarred Li
I run 4 workes in AWS (c3.xlarge), 4g memory for executor, 85,331,846 edges
from(http://socialcomputing.asu.edu/uploads/1296759055/Twitter-dataset.zip).
For PageRank algorithm, the job can not be completed within 7 hours. For
small dataset with 5,000,000
edges(http://socialcomputing.asu.edu/uploads/1296591553/Last.fm-dataset.zip)
, the job can be completed within 16 seconds.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org