[ 
https://issues.apache.org/jira/browse/CARBONDATA-102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391395#comment-15391395
 ] 

ASF GitHub Bot commented on CARBONDATA-102:
-------------------------------------------

GitHub user ravipesala opened a pull request:

    https://github.com/apache/incubator-carbondata/pull/53

    [CARBONDATA-102]Reduce the size of the CarbonData jar file.

    Following modifications done with this PR.
    1. Refactored and cleaned POM to remove unnecessary dependency jar files.
    2. Default CarbonData assembly jar does not include Spark, Scala and Hadoop 
dependencies. Example:
    ``` 
    mvn clean -DskipTests package
    ```
    3. User can provide profile `include-all` to include all dependencies. 
Example:
    ```
    mvn clean -DskipTests -Pinclude-all package
    ```
    
    Default size of CarbonData jar is 19MB
    With including all dependencies, CarbonData jar size is 207MB


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ravipesala/incubator-carbondata localstore_bug

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/53.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #53
    
----
commit 3cc132a17454afa466e66da6449de71e9ec0c736
Author: ravipesala <[email protected]>
Date:   2016-07-24T12:01:20Z

    Refactored and cleaned up POM

----


> Exclude the Spark and hadoop from CarbonData assembly jar by default and 
> reduce the jar file size
> -------------------------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-102
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-102
>             Project: CarbonData
>          Issue Type: Improvement
>            Reporter: Ravindra Pesala
>            Priority: Minor
>             Fix For: Apache CarbonData 0.1.0-incubating
>
>
> Currently CarbonData assembly jar is huge and it is about 200MB size because 
> it includes Spark, Scala and Hadoop dependency jars.
> So we should not include Hadoop, Scala and Spark dependencies in CarbonData 
> jar by default as it is going to be deployed in Spark cluster.
> If user wish to include them we should give the option in maven build to 
> include all dependencies.
> Like default build like below will have only CarbonData and its dependencies 
> apart from Spark, Scala and Hadoop dependencies. 
> {code}
> mvn clean -DskipTests package
> {code}
> Below build includes all dependencies like Spark, Scala and Hadoop
> {code}
> mvn clean -DskipTests -Pinclude-all package
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to