Re: Trimming the CPAN - Automatic Purging

2010-03-30 Thread David Cantrell
On Sun, Mar 28, 2010 at 07:28:48AM -0700, dhu...@hudes.org wrote:

 The danger in a CPAN::Mini and in removing old versions is that one is
 assuming that the latest and greatest is the one to use. This is false.

And this is why I run cp5.6.2an.barnyard.co.uk etc.  

It wouldn't be difficult for someone to take my code and customise it
further to, eg, also pin a few modules that rely on the particular
versions of third-party libraries that you use.

-- 
David Cantrell | Bourgeois reactionary pig

Eye have a spelling chequer / It came with my pea sea
It planely marques four my revue / Miss Steaks eye kin knot sea.
Eye strike a quay and type a word / And weight for it to say
Weather eye am wrong oar write / It shows me strait a weigh.


Re: Trimming the CPAN - Automatic Purging

2010-03-30 Thread David Cantrell
On Sun, Mar 28, 2010 at 06:04:03PM -0400, David Golden wrote:

 As always with perl, it depends.  They are laid out just as a normal
 CPAN repository, so if you have one in your urllist, something
 specified as author/distribution.tar.gz might well resolve.

Not just might well resolve.  It *will* work.  If you use one of my
cpXXXan mirrors, you're hitting a BackPAN mirror with a custom index.

  *However*,
 they don't necessarily have up-to-date index files.  Compare
 timestamps on 02packages.details.txt

Indeed.  I don't imagine that that would be hard for Andreas to keep in
sync!

-- 
David Cantrell | even more awesome than a panda-fur coat

IMO, the primary historical significance of Unix is that it marks the
time in computer history where CPUs became so cheap that it was possible
to build an operating system without adult supervision.
 -- Russ Holsclaw in a.f.c


Re: Tidy up your PAUSE directories

2010-03-30 Thread Rene Schickbauer

brian d foy wrote:

It's time for Spring cleaning again. If you have ancient versions of
modules sitting around in your PAUSE directory, consider letting them
retire to BackPAN (http://backpan.cpan.org). They don't disappear from
the world, but they don't inflate CPAN either. You don't have to do
anything, but many mirror operators might be happy that you did. :)


Just my one point nine periodic cents: If you got ancient modules 
sitting around that you wont be updating anymore be good and ask around 
if someone wants to take over.


If the module is simple enough (and possibly an Acme module), you could 
also ask in beginners: Maintaining a simple module with an existing user 
base is in my opinion a good learning experience!


LG
Rene

Note: I've taken over three Acme-modules. Among them is Acme-AutoColor. 
If you lack any non-standard colors (i just added octarine), more 
features or something like that, just mail me. The other two, 
Acme::Innuendo and Acme::Mobile::Therbligs need updating too (working on 
it), ideas are welcome as well.


Re: Trimming the CPAN - Automatic Purging

2010-03-30 Thread Arthur Corliss

On Tue, 30 Mar 2010, Matija Grabnar wrote:


Er, not exactly. Read
http://www.cvsup.org/howsofast.html


I had read  http://www.cvsup.org/faq.html#features  item #3.

From what I can see, cvsup uses the rsync algorithm on a file-by-file basis 
(it uses just the differential send part of the rsync algorithm). It doesn't 
rsync the whole tree, which was what I understood to be the original problem 
(wasn't the complaint about the flood of stats?).


Sounds like I may have interpreted the FAQ incorrectly, then.  Thanks for
pointing that out.  I have a few question, though: the explanation says:

   At the same time, the Tree Differ generates a list of the server's
   files.

That seems to infer that it's doing the exact same thing as rsync, so all 
the stats are still present on the server, right?


Nowhere do I see it mentioning that the daemon is maintaining state between
requests.  The primary speed-ups (beyond special file update handling) is
better use of bidirectional bandwidth.

Do you have access to a cvsup server so you can verify its behavior?

So if you want to make a tool that works fine for large mirrors, your 
priority apparently should be to reduce the lots of stats part which is 
used to determine exactly what files need to be considered for checking. 
(Rsync already makes sure all the *other* I/O operations are minimized).


Agreed.

Now the key, as I see it, is that unlike all the other use cases where rsync 
is used, large mirrors are likely to have their directories directly 
transfered from another mirror. So, the client that pulled the tree update 
down could store a list of changed files, and the server could then just use 
that list to determine which files
need to be synced to the downstream mirror. (Sure, the original site has to 
generate the list, but if they use a tool like PAUSE to upload the files, 
that shouldn't be hard to do).


Agreed, but I'm not sure we've gotten past the stat storm on the server,
though.

--Arthur Corliss
  Live Free or Die


Re: Trimming the CPAN - Automatic Purging

2010-03-30 Thread Arthur Corliss

On Tue, 30 Mar 2010, Rene Schickbauer wrote:

snip

This could work like any modern, distributed version control systems. That 
way, the user would also be able to apply local patches and/or deciding which 
changesets to pull in from the main server. Or have a complete, local mirror 
and one for the production systems where he/she pulls in changes after they 
have been reviewed.



NOW its time to kick my butt, if you want to.


:-) No one can accuse you of not being ambitious.  It's a neat idea, but
definitely an involved solution.  While it could solve a lot of problems I
think the human component is going to be your biggest obstacle.  As we've
seen from the reaction to the heretical notion of ditching rsync I have to
imagine getting everyone to ditch their favorite RCS tool would be even
worse.

Basically, we should just all get onboard with git (disclaimer:  I don't use
git myself, so my understanding may be deficient), a decentralized
distributed RCS.  And have developers periodically merge their branches.

Tough sell.  It probably would solve a bunch of issues, but you're treading
into vi versus emacs territory.  ;-)

--Arthur Corliss
  Live Free or Die


Re: Trimming the CPAN - Automatic Purging

2010-03-30 Thread David Nicol
On Sun, Mar 28, 2010 at 2:32 PM, Elaine Ashton eash...@mac.com wrote:

 On Mar 28, 2010, at 12:48 PM, Randy Kobes wrote:


 Has some sort of disk quota system for CPAN author accounts ever been 
 considered?

 Not specifically, no, at least not that I'm aware of. That would have to be 
 implemented on PAUSE and quotas frequently end up not solving the real 
 problem and create a headache both for the sysadmin and the users.

new proposal: Make modules pay rent in order to remain on a mirror.
Rent could be in the form of actual user interest, or good reviews.

Use as a dependency could count as rent.

Or simple downloading.  A mirror server that functioned more as a
cache than a mirror would also work: only the files that are actually
requested need be stored, as long as the mirror server knows how to
get something else if requested.  If the root cause of The Pain turns
out to be full mirroring then do partial mirroring, and automate the
partition with a policy instead of trying to plan the partition.




-- 
question doubt