Andrew Or created SPARK-7716:
--------------------------------
Summary: SparkUI stage page hangs with many tasks
Key: SPARK-7716
URL: https://issues.apache.org/jira/browse/SPARK-7716
Project: Spark
Issue Type: Bug
Components: Web UI
Affects Versions: 1.4.0
Reporter: Andrew Or
Assignee: Andrew Or
Priority: Critical
While running the following, I checked the stage page on the SparkUI:
{code}
sc.parallelize(1 to 5000, 100000).count()
{code}
Then I get
{code}
HTTP ERROR 500
Problem accessing /stages/stage/. Reason:
Server Error
Caused by:
java.lang.OutOfMemoryError: Java heap space
at java.util.Arrays.copyOf(Arrays.java:2367)
at
java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:130)
at
java.lang.AbstractStringBuilder.ensureCapacityInternal(AbstractStringBuilder.java:114)
at
java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:587)
at java.lang.StringBuilder.append(StringBuilder.java:214)
{code}
This is because end up concatenating all the scala XML nodes into raw strings
and shipping them to the UI through Jetty. The long-term correct fix would be
to add pagination, but even adding a compression layer will fix this for most
cases.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]