[ 
https://issues.apache.org/jira/browse/SPARK-15821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-15821:
------------------------------------

    Assignee:     (was: Apache Spark)

> Should we use mvn -T for multithreaded Spark builds?
> ----------------------------------------------------
>
>                 Key: SPARK-15821
>                 URL: https://issues.apache.org/jira/browse/SPARK-15821
>             Project: Spark
>          Issue Type: Question
>          Components: Build
>            Reporter: Adam Roberts
>            Priority: Minor
>
> With Maven we can build Spark in a multithreaded way and benefit from 
> increased build time performance as a result.
> On a machine with eight cores, I noticed the build time reduced from 20-25 
> minutes to five minutes; this is by building with
> mvn -T 1C -Pyarn -Phadoop-2.7 -Phive -Phive-thriftserver -DskipTests clean 
> package
> -T 1C says that we'll use one extra thread for each core available, I've 
> never experienced a problem with using this option (ranging from a single 
> cored box to one with 192 cores available)
> Should we use this for building Spark quicker or is the Jenkins job 
> deliberately set up such that each "executor" is needed for each pull request 
> and we wouldn't see an improvement anyway? 
> This can be discovered by checking core utilization across the farm and can 
> potentially reduce our build times.
> Here's more information on the feature: 
> https://cwiki.apache.org/confluence/display/MAVEN/Parallel+builds+in+Maven+3
> If this isn't suitable for the current farm then I think we should document 
> it for those building Spark from source



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to