Re: ASF Repository, closer.cgi and Depot

2004-07-15 Thread Erik Abele
On 14.07.2004, at 19:57, Mark R. Diggory wrote:
But then this becomes a project spanning both the Repository group and 
the various clients out there Depot/Maven/etc. And agreement on the 
GEO_IP request protocol and xml format etc becomes a touchy subject 
don't they?
Maybe but as I understood it, that's exactly the task assigned to the 
repository@ list, no? Setting up a basic repository framework on which 
the other projects can build their tools.

Well, spec work gets always a bit touchy ;-)
Cheers,
Erik
-Mark


smime.p7s
Description: S/MIME cryptographic signature


Re: ASF Repository, closer.cgi and Depot

2004-07-14 Thread Adam R. B. Jack

- Original Message - 
From: Mark R. Diggory [EMAIL PROTECTED]
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Sent: Wednesday, July 14, 2004 8:48 AM
Subject: ASF Repository, closer.cgi and Depot


 Sorry for the cross post but this seems relevant to both these groups.

 I was thinking about the subject of mirroring and redirection for the
 ASF Repository. Currently, there was some discussion on the Depot list
 concerning this. I feel we could address this subject again for both
 groups interest.

 www.apache.org/dyn/closer cgi provides a simple resolution strategy to
 attempt to determine the closest mirror available to the client browser.
 It then generates an html page via a template that lists the selected
 mirror as well as other available mirrors. With Depot, we have a
 customized download client that could be extended to manage downloading
 from a list of mirrors as well.

 Here are my thoughts on this subject:

 A.) This script is really not that big (90% of it is just parsing the
 mirrors file), and the database (a flat text file called mirrors.list)
 as well is not very big. While closer.cgi is a neat service for
 browsers. Its not exactly helpful for automated clients. Yet,
 mirrors.list is an excellent example of metadata that is exposed in a
 effective manner such that automated clients can access it.

 http://www.apache.org/mirrors/mirrors.list

 I'm somewhat convinced that a it would be simple to create a client
 implementation which accomplished the same functionality as closer.cgi
 programatically so that it could be used in terms of resolving a
 location to download from when mirrors are available.

 This would be beneficial to the Apache Bandwidth issue in that if a
 client such as Depot/DownloadManager managed the same capability as
 closer.cgi then:

Hmm, it seems to me that infra@ or mirrors@ (is that a list) probably have
views on this. (But then, we probably don't want 4 lists on here. :-) I
suspect their views would include what you suggest, that distribution might
save some nomimal (c.f. artifact sizes) bandwidth savings  give some CPU
saving, but it'd be at significant loss of 'control' (of well behaved
clients). Central control over this seems the most appealing.

Since I doubt the CPU cycles are worth saving (or the script would've been
optimised), could we not just change the script to check for some header
from the client, and return XML or some structured text, for non-human
browsers. [BTW: viewcvs seems to do this nicely, returning the file if
non-human and the presentation is human (as browser identifies).

regards,

Adam



Re: ASF Repository, closer.cgi and Depot

2004-07-14 Thread Mark R. Diggory
Erik Abele wrote:
I suspect their views would include what you suggest, that 
distribution might
save some nomimal (c.f. artifact sizes) bandwidth savings  give some 
CPU
saving, but it'd be at significant loss of 'control' (of well behaved
clients). Central control over this seems the most appealing.

Agreed.
Since I doubt the CPU cycles are worth saving (or the script would've 
been
optimised), could we not just change the script to check for some header
from the client, and return XML or some structured text, for non-human
browsers. [BTW: viewcvs seems to do this nicely, returning the file if
non-human and the presentation is human (as browser identifies).

This sounds promising. You have central control, you get the 
geoip-mapping stuff for free and the CPU cycles as well as the 
bandwidth for (XML-ized) responses are a no-brainer in this case.

But then this becomes a project spanning both the Repository group and 
the various clients out there Depot/Maven/etc. And agreement on the 
GEO_IP request protocol and xml format etc becomes a touchy subject 
don't they?

-Mark
begin:vcard
fn:Mark Diggory
n:Diggory;Mark
org:Harvard University;Harvard MIT Data Center
adr:Harvard University;;G-6 Littauer Center (North Yard);Cambridge;Ma;02138-2901;United States
email;internet:[EMAIL PROTECTED]
title:Software Engineer
tel;work:617 496 7246
tel;fax:617 495 0438
tel;home:617 718 2033 
tel;cell:617 285 4106
url:http://www.hmdc.harvard.edu
version:2.1
end:vcard