Re: spark.sql.autoBroadcastJoinThreshold not taking effect

Lantao Jin Mon, 13 May 2019 21:24:23 -0700

Maybe you could try “--conf spark.sql.statistics.fallBackToHdfs=true"


On 2019/05/11 01:54:27, V0lleyBallJunki3 <[email protected]> wrote: 
> Hello,> 
>    I have set spark.sql.autoBroadcastJoinThreshold=1GB and I am running the> 
> spark job. However, my application is failing with:> 
> 
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)> 
>       at> 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)>
>  
>       at> 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)>
>  
>       at java.lang.reflect.Method.invoke(Method.java:498)> 
>       at> 
> org.apache.spark.deploy.yarn.ApplicationMaster$$anon$2.run(ApplicationMaster.scala:678)>
>  
> Caused by: org.apache.spark.SparkException: Cannot broadcast the table that> 
> is larger than 8GB: 8 GB> 
>       at> 
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1$$anonfun$apply$1.apply(BroadcastExchangeExec.scala:103)>
>  
>       at> 
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1$$anonfun$apply$1.apply(BroadcastExchangeExec.scala:76)>
>  
>       at> 
> org.apache.spark.sql.execution.SQLExecution$$anonfun$withExecutionId$1.apply(SQLExecution.scala:101)>
>  
>       at> 
> org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:125)>
>  
>       at> 
> org.apache.spark.sql.execution.SQLExecution$.withExecutionId(SQLExecution.scala:98)>
>  
>       at> 
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1.apply(BroadcastExchangeExec.scala:75)>
>  
>       at> 
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1.apply(BroadcastExchangeExec.scala:75)>
>  
>       at> 
> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)>
>  
>       at> 
> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)> 
>       at> 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)>
>  
>       at> 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)>
>  
>       at java.lang.Thread.run(Thread.java:748)> 
> 
> When I am running with a limit 1 GB how can I hit the 8 GB limit? I made> 
> sure in the Spark History Server as well by printing out the value of> 
> spark.sql.autoBroadcastJoinThreshold that the value is correctly set and> 
> explain plan also shows that it is trying to do a Broadcast Join. Any ideas? 
> > 
> 
> 
> 
> --> 
> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/> 
> 
> ---------------------------------------------------------------------> 
> To unsubscribe e-mail: [email protected]> 
> 
> 
---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]

Re: spark.sql.autoBroadcastJoinThreshold not taking effect

Reply via email to