[ 
https://issues.apache.org/jira/browse/SPARK-16357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zeweichen updated SPARK-16357:
------------------------------
    Description: 
We use TPCx-BB(BigBench) to evaluate the performance of Spark shuffle RPC 
encryption in our local cluster(E5-2699 v3, 256G, 72 vcores, 1 master node + 5 
worker nodes). 
During our performance test of Spark shuffle RPC encryption using 3DES on 
Sparksql, we found that throughput test (run queries in parallel ) of 2 stream 
lasts 2.68X time than power test(run query one by one), which is much larger 
than it should be. Q30 cost 0.5 hour in stream1,  but q30 last 2.5 hours in 
stream 0 which is much longer than it should be.  This caused that queries in 
stream1 go ahead much faster than stream0. So it works like single stream and 
can not fully use CPU resource.

  was:
We use TPCx-BB(BigBench) to evaluate the performance of Spark shuffle RPC 
encryption in our local cluster(E5-2699 v3, 256G, 72 vcores, 1 master node + 5 
worker nodes).
During our performance test of Spark shuffle RPC encryption using 3DES on 
Sparksql, we found that throughput test (run queries in parallel ) of 2 stream 
lasts 2.68X time than power test(run query one by one), which is much larger 
than it should be. Q30 cost 0.5 hour in stream1,  but q30 last 2.5 hours in 
stream 0 which is much longer than it should be.  This caused that queries in 
stream1 go ahead much faster than stream0. So it works like single stream and 
can not fully use CPU resource.


> After enabling Spark shuffle RPC encryption using 3DES, Sparksql query has 
> poor performance when running in parallel.
> ---------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-16357
>                 URL: https://issues.apache.org/jira/browse/SPARK-16357
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.6.1
>         Environment: Apache Spark 1.6.1 
> TPCx-BB 1.0.1
> jdk 1.7
>            Reporter: zeweichen
>            Priority: Minor
>
> We use TPCx-BB(BigBench) to evaluate the performance of Spark shuffle RPC 
> encryption in our local cluster(E5-2699 v3, 256G, 72 vcores, 1 master node + 
> 5 worker nodes). 
> During our performance test of Spark shuffle RPC encryption using 3DES on 
> Sparksql, we found that throughput test (run queries in parallel ) of 2 
> stream lasts 2.68X time than power test(run query one by one), which is much 
> larger than it should be. Q30 cost 0.5 hour in stream1,  but q30 last 2.5 
> hours in stream 0 which is much longer than it should be.  This caused that 
> queries in stream1 go ahead much faster than stream0. So it works like single 
> stream and can not fully use CPU resource.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to