[
https://issues.apache.org/jira/browse/HIVE-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15769862#comment-15769862
]
Andrew Sears commented on HIVE-8373:
------------------------------------
[~kellyzly] Java 8 removes PermGen memory and replaces with Metaspace. See
above StackOverflow link and some additional ones.
http://karunsubramanian.com/websphere/one-important-change-in-memory-management-in-java-8/
https://blogs.oracle.com/poonam/entry/about_g1_garbage_collector_permanent
http://geekswithblogs.net/JoshReuben/archive/2016/04/11/jvm-tuning.aspx
So in JDK8, it appears that all native memory will be used unless capped.
For Local Spark Master:
HADOOP_OPTS="$HADOOP_OPTS -XX:MaxMetaspaceSize=512m"
For Spark History server:
SPARK_DAEMON_JAVA_OPTS="$SPARK_DAEMON_JAVA_OPTS -XX:MaxMetaspaceSize=512m"
Let me know if you need anything added to Wiki around this.
> OOM for a simple query with spark.master=local [Spark Branch]
> -------------------------------------------------------------
>
> Key: HIVE-8373
> URL: https://issues.apache.org/jira/browse/HIVE-8373
> Project: Hive
> Issue Type: Bug
> Components: Spark
> Reporter: Xuefu Zhang
> Assignee: liyunzhang_intel
>
> I have a straigh forward query to run in Spark local mode, but get an OOM
> even though the data volumn is tiny:
> {code}
> Exception in thread "Spark Context Cleaner"
> Exception: java.lang.OutOfMemoryError thrown from the
> UncaughtExceptionHandler in thread "Spark Context Cleaner"
> Exception in thread "Executor task launch worker-1"
> Exception: java.lang.OutOfMemoryError thrown from the
> UncaughtExceptionHandler in thread "Executor task launch worker-1"
> Exception in thread "Keep-Alive-Timer"
> Exception: java.lang.OutOfMemoryError thrown from the
> UncaughtExceptionHandler in thread "Keep-Alive-Timer"
> Exception in thread "Driver Heartbeater"
> Exception: java.lang.OutOfMemoryError thrown from the
> UncaughtExceptionHandler in thread "Driver Heartbeater"
> {code}
> The query is:
> {code}
> select product_name, avg(item_price) as avg_price from product join item on
> item.product_pk=product.product_pk group by product_name order by avg_price;
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)