[ 
https://issues.apache.org/jira/browse/SOLR-10298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926938#comment-15926938
 ] 

Cassandra Targett commented on SOLR-10298:
------------------------------------------

After some Confluence upgrade, we had a similar problem (SOLR-5819). At that 
time, the issue was determined to be some massive amount of duplication of 
images inside the PDF itself. A tool was developed to reduce the PDF size, 
which was used for a time for each new release.

The problem in SOLR-5819 is likely *not* the same problem here. As a test, I 
ran that tool on the new PDF and the result was a PDF around ~30Mb, really only 
about a 500Kb difference, which is negligible at this size.

> Reduce size of new Ref Guide PDF
> --------------------------------
>
>                 Key: SOLR-10298
>                 URL: https://issues.apache.org/jira/browse/SOLR-10298
>             Project: Solr
>          Issue Type: Sub-task
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: documentation
>            Reporter: Cassandra Targett
>
> The new Ref Guide PDF is ~31Mb in size, which is more than 2x the current PDF 
> produced by Confluence (which is 14Mb).
> The asciidoctor-pdf project has a script to optimize the PDF, mostly by 
> scaling down images. When I run this tool on the new PDF, the size is reduced 
> to ~18Mb. (More info on this script: 
> https://github.com/asciidoctor/asciidoctor-pdf#optional-scripts).
> Some of the current image files are very large in size, so I believe that by 
> scaling the images down, we can make the size smaller without adding a step 
> in the build to run the optimize script programmatically (it also has a 
> dependency on GhostScript, so it would be nice to not add another dependency 
> if it can be avoided).
> The new PDF is also about 300 pages longer, but this issue is primarily 
> concerned with file size. However, reducing the number of pages will also 
> make it smaller. A few things that could be tried to reduce the # of pages:
> * Reduce font sizes
> * Increase page margins
> * Review options for when a forced page-break is used and modify if possible



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to