[ 
https://issues.apache.org/jira/browse/HADOOP-13688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15550134#comment-15550134
 ] 

Andrew Wang commented on HADOOP-13688:
--------------------------------------

The other issue I noticed is that there seems to be tons of extra content in 
the javadoc jars, like other jars and classfiles. It looks like it's sucking 
down the entire target directory:

After unzipping the hadoop-common javadoc jar:

{noformat}
-> % ls
allclasses-frame.html    findbugsXml.xml                                       
hadoop-maven-plugins-protoc-checksums.json  META-INF               src-html
allclasses-noframe.html  generated-sources                                     
help-doc.html                               org                    
stylesheet.css
antrun                   generated-test-sources                                
index-all.html                              overview-frame.html    test
archive-tmp              hadoop-annotations.jar                                
index.html                                  overview-summary.html  test-classes
classes                  hadoop-common-3.0.0-alpha2-SNAPSHOT                   
javadoc-bundle-options                      overview-tree.html     test-dir
constant-values.html     hadoop-common-3.0.0-alpha2-SNAPSHOT.jar               
jdiff.jar                                   package-list           xerces.jar
deprecated-list.html     hadoop-common-3.0.0-alpha2-SNAPSHOT-sources.jar       
log                                         script.js
docs-src                 hadoop-common-3.0.0-alpha2-SNAPSHOT-tests.jar         
maven-archiver                              serialized-form.html
findbugsExcludeFile.xml  hadoop-common-3.0.0-alpha2-SNAPSHOT-test-sources.jar  
maven-shared-archive-resources              site
andrew@zanarkand [03:21:07 PM] [/tmp/javadoc] 
-> % find . -name "*.class" | wc -l 
3455
andrew@zanarkand [03:21:11 PM] [/tmp/javadoc] 
-> % find . -name "*.jar" | wc -l
85
-> % find . -name "*.jar" -exec wc -c {} \; | awk -e '{SUM+=$1} END {print 
SUM}' 
41692556
{noformat}

Since the jarfiles by themselves are another 41MB, there's the potential to 
really streamline the javadoc jars.

> Stop bundling HTML source code in javadoc JARs
> ----------------------------------------------
>
>                 Key: HADOOP-13688
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13688
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: build
>    Affects Versions: 2.6.4
>            Reporter: Andrew Wang
>            Assignee: Andrew Wang
>         Attachments: HADOOP-13688.001.patch
>
>
> We generate source code with line numbers for inclusion in the javadoc JARs. 
> Given that there's github and other online viewers, this doesn't seem so 
> useful these days.
> Disabling the "linkSource" option saves us 40MB for the hadoop-common javadoc 
> jar:
> {noformat}
> -rw-r--r-- 1 andrew andrew 98M Oct  5 14:44 
> hadoop-common-3.0.0-alpha2-SNAPSHOT-javadoc.jar
> -rw-r--r-- 1 andrew andrew 58M Oct  5 15:00 
> ./hadoop-common-project/hadoop-common/target/hadoop-common-3.0.0-alpha2-SNAPSHOT-javadoc.jar
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to