On 02/01/2013, at 11:43 PM, Brett Porter <br...@apache.org> wrote:

> 
> On 02/01/2013, at 10:45 AM, Hervé BOUTEMY <herve.bout...@free.fr> wrote:
> 
>> Le lundi 24 décembre 2012 14:17:02 Brett Porter a écrit :
>>> I had the same feeling pushing up Continuum's Maven site recently...
>>> 
>>> On 23/12/2012, at 9:36 AM, Olivier Lamy <ol...@apache.org> wrote:
>>>> 2012/12/22 Kristian Rosenvold <kristian.rosenv...@zenior.no>:
>>>>> Svnsubpub is just ridiculously inefficient and we need to do something
>>>>> about it...
>>>> 
>>>> That remember me old time when pushing maven sites to sourceforge tru
>>>> wagon ftp ... :-)
>>> 
>>> Is the main problem the file-by-file nature, or something else (such as the
>>> size)?
>> I tried to measure and find facts about this: my conclusion is that it is 
>> the 
>> file-by-file nature
>> exactly like webdav
>> notice that we have really a bunch of generated files: did you realize that 
>> maven-scm site has more files than maven core? that it represents 131 MB, 
>> tens 
>> of thousand of files?
> 
> Yes, similar size in Continuum which I've been working on migrating to 
> svnpubsub recently.
> 
>> 
>> IMHO, if we stored zip files of documentations (or tar, the archive format 
>> can 
>> be discussed) and httpd could serve their content on the fly like if they 
>> were 
>> unzipped, this would be awesome
>> Apparently, coding an lua script could do the job: just need to find 
>> somebody 
>> able to do it :)
> 
> I've raised this topic and that thread on #asfinfra and will try and follow 
> up - I know this was flagged by them some time ago.

See my response to your mail over there today - this may already be possible: 
http://www.apache.org/dev/cmsref.html#generated-docs

But aside from that...

> 
>> 
>> any other idea?
> 
> For now I'm just focusing on reducing the number of changes each site 
> deployment to make it a smaller checkin.
> 
> See: http://continuum.apache.org/development/publishing-site.html
> 
> What I'm doing there is deploying to /docs/latest each time, then copying 
> that to the versioned location on the server-side. That way, the next version 
> is just the diff against the last, not a whole new one.
> 
> There are a few things that cause changes across generation runs:
> - the javadocs generate a timestamp, and this is the biggest culprit as it is 
> the largest number of files. I'm going to add -notimestamp tomorrow and see 
> if I can get sequential deploys to remain unmodified
> - most skins incorporated a timestamp that changes each build (I've removed 
> that in Continuum's skin). The publish date is still there, but it doesn't 
> seem to be a big problem.
> - the dependencies report seems to change between runs

By addressing those 3 things, I got down to either no, or very few, changes 
between commits (it seems Javadoc doesn't consistently order some elements on 
the page), e.g.: http://svn.apache.org/r1428378

The dependencies issue was fixed in MPIR-266 (which may need a release). Making 
the skin timestamp configurable is another task.

I added a few tips to the ASF site: 
http://www.staging.apache.org/dev/project-site.html#generated

> 
> For other reports the change more frequently, I've stopped publishing them to 
> the main site. There is Sonar for most of it at http://analysis.apache.org/, 
> or you can publish them to a filesystem staging in CI and view them through 
> the workspace.
> 
> With these sorts of improvements, incremental deployments will actually be 
> better than shipping up a large ZIP that gets unpacked, even though it is 
> mostly unchanged.


Does this help?

Cheers,
Brett

--
Brett Porter
br...@apache.org
http://brettporter.wordpress.com/
http://au.linkedin.com/in/brettporter
http://twitter.com/brettporter






---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@maven.apache.org
For additional commands, e-mail: dev-h...@maven.apache.org

Reply via email to