Send Devl mailing list submissions to
        devl at freenetproject.org

To subscribe or unsubscribe via the World Wide Web, visit
        http://www.uprizer.com/mailman/listinfo/devl
or, via email, send a message with subject or body 'help' to
        devl-request at freenetproject.org

You can reach the person managing the list at
        devl-admin at freenetproject.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Devl digest..."


Today's Topics:

   1. Re: Proposal: algorithm for forgetting documents in datastore (Scott G. 
Miller)
   2. Re: Proposal: algorithm for forgetting documents in datastore (Scott G. 
Miller)
   3. Protozilla (Adam Langley)
   4. Seems quite relevant (Scott G. Miller)
   5. Re: Protozilla (Scott G. Miller)
   6. Re: Seems quite relevant (Sebastian Spaeth)
   7. Re: Proposal: algorithm for forgetting documents in datastore (Theodore 
Hong)
   8. Re: Proposal: algorithm for forgetting documents in datastore (Theodore 
Hong)
   9. Re: Protozilla (Adam Langley)

--__--__--

Message: 1
Date: Sun, 28 Jan 2001 12:05:33 -0500
To: devl at freenetproject.org
Subject: Re: [freenet-devl] Proposal: algorithm for forgetting documents in 
datastore
From: "Scott G. Miller" <scgmi...@indiana.edu>
Reply-To: devl at freenetproject.org


--hYooF8G/hrfVAmum
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, Jan 28, 2001 at 02:28:26AM -0500, Tavin Cole wrote:
> On Sat, Jan 27, 2001 at 05:14:55PM -0500, Benjamin Coates wrote:
> > >It seems totally reasonble for a node to reject caching documents grea=
ter
> > >in size than a certain fraction of the total datastore size (1% or less
> > >I should think).  That prevents your worst-case example, but saves us =
from
> > >making relative value judgments using file size as the metric..
> >=20
> > 1% is pretty tiny, that's 1MB on the default datastore size, and also l=
eads to=20
> > the bizarre situation where all the insert nodes pass on the request an=
d=20
> > return success, even though none of them store it.  Maybe nodes should =
return=20
> > failure instead of passing through an insert they can't cache...
>=20
> OTOH, it's probably not good if a datastore can only hold a handful of ke=
ys
> b/c they are all a considerable fraction of its max size.  1% may not be
> right, tho.
1% is definitely too small.  I don't want any assumptions made unless they
include datastores that are on average 100-200mb in size (a size I
consider optimal in a large freenet consisting of 'normal', i.e., not
dedicated nodes).

I think its probably reasonable to store 5mb files in a datastore, as
well.  This all comes down to the file splitting issue, where there are a
number of questions;

Max file size
Minimum file size (under which we pad)
Continuous or discrete file sizes (ie only powers of 2, or any size).
Number of parts into which a file can be split but still be retrieved
Redundancy? (I'm with Ian, this is not really a good idea in Freenet)


--hYooF8G/hrfVAmum
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.4 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iD8DBQE6dFFdr9IW4v3mHtQRAmA7AJ4tv/y7q/b416bkDj+X8rVPeZ2D7QCfQ1CT
QJU7dqh3xrJoPuA9EJvaiX8=
=hiq7
-----END PGP SIGNATURE-----

--hYooF8G/hrfVAmum--


--__--__--

Message: 2
Date: Sun, 28 Jan 2001 12:08:04 -0500
To: devl at freenetproject.org
Subject: Re: [freenet-devl] Proposal: algorithm for forgetting documents in 
datastore
From: "Scott G. Miller" <scgmi...@indiana.edu>
Reply-To: devl at freenetproject.org


--g7w8+K/95kPelPD2
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, Jan 28, 2001 at 04:38:49PM +0000, Theodore Hong wrote:
> Tavin Cole <tavin at mailandnews.com> wrote:
> > It's clear to me now that the only sane mechanism for deciding what fil=
es
> > to keep in the document store is one that consistently preserves the
> > files that receive the greatest number of requests.  So let's define P =
as
> > the measure of a file's request history.  Simplistically,
> >
> >  P =3D total no. of requests for this file / time since file was stored
> >
> > In practice, though, I think we'd want to limit the past time interval =
to
> > what is "recent".  Also, the units in the denominator should not be too
> > granular, to keep P from blowing out of proportion for brand new files.
> > Something in the neighborhood of hours or minutes, probably.
>=20
> Limiting the past time interval would require picking an arbitrary cutoff.
> The best way is to apply a decay factor to old requests, something like:
>=20
> P =3D #requests in the past hour
>       + (#requests in the previous 2 hrs)/2
>       + (#requests in the previous 4 hrs)/4
>       + (#requests in the previous 8 hrs)/8
>       + ...

Every two hours you halve P.  When a request comes in you add a constant.


--g7w8+K/95kPelPD2
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.4 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iD8DBQE6dFH0r9IW4v3mHtQRApN8AJ9d0fkUJWcC/1R8/8iAs/OUvDs2ZACfe/lY
05LzAtj+xs5aDi8OEuxGRIc=
=cAj/
-----END PGP SIGNATURE-----

--g7w8+K/95kPelPD2--


--__--__--

Message: 3
Date: Sun, 28 Jan 2001 17:47:40 +0000
From: Adam Langley <a...@linuxpower.org>
To: devl at freenetproject.org
Subject: [freenet-devl] Protozilla
Reply-To: devl at freenetproject.org


--0F1p//8PRICkK4MW
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

http://protozilla.mozdev.org/white-paper.html

Interresting bit:
Peer-to-peer (P2P) URLs: Seamless access to P2P protocols through=20
the familiar browser interface should make them much more accessible
to the end-user. Not only will the the user be able to access the
protocols by typing a P2P URL in the URL bar, but the clickable P2P
URLs may also be embedded in e-mail messages and other documents=20
and exchanged just like HTTP or FTP URLs. Protozilla enables=20
clients for P2P protocols to be implemented trivially in Mozilla.

AGL

--=20
Join in the new game that's sweeping the country.  It's called "Bureaucracy=
". Everybody stands in a circle.  The first person to do anything loses.

--0F1p//8PRICkK4MW
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.4 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iEYEARECAAYFAjp0WzsACgkQzaVS3yy2PWCR7gCgpzd0ptByLFGnHjfOcM7oM46Z
uqYAnjJCO1LoJW903Nm53Ct3JCvK06//
=lJsk
-----END PGP SIGNATURE-----

--0F1p//8PRICkK4MW--


--__--__--

Message: 4
Date: Sun, 28 Jan 2001 12:39:44 -0500
To: devl at freenetproject.org
From: "Scott G. Miller" <scgmi...@indiana.edu>
Subject: [freenet-devl] Seems quite relevant
Reply-To: devl at freenetproject.org


--45Z9DzgjV8m4Oswq
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline

http://protozilla.mozdev.org/white-paper.html

Looks like the last to bullet points aim directly at us.


--45Z9DzgjV8m4Oswq
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.4 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iD8DBQE6dFlgr9IW4v3mHtQRAl2OAJ4lWyAAC/Q1SXwIRQhajBwOYz4uGACdGpRG
6uAY/S2ZhcxZXxsuIuT7tQI=
=AWY+
-----END PGP SIGNATURE-----

--45Z9DzgjV8m4Oswq--


--__--__--

Message: 5
Date: Sun, 28 Jan 2001 12:58:03 -0500
To: devl at freenetproject.org
Subject: Re: [freenet-devl] Protozilla
From: "Scott G. Miller" <scgmi...@indiana.edu>
Reply-To: devl at freenetproject.org


--T4sUOijqQbZv57TR
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

Damn you!  FIRST POST!

On Sun, Jan 28, 2001 at 05:47:40PM +0000, Adam Langley wrote:
> http://protozilla.mozdev.org/white-paper.html
>=20
> Interresting bit:
> Peer-to-peer (P2P) URLs: Seamless access to P2P protocols through=20
> the familiar browser interface should make them much more accessible
> to the end-user. Not only will the the user be able to access the
> protocols by typing a P2P URL in the URL bar, but the clickable P2P
> URLs may also be embedded in e-mail messages and other documents=20
> and exchanged just like HTTP or FTP URLs. Protozilla enables=20
> clients for P2P protocols to be implemented trivially in Mozilla.
>=20
> AGL
>=20
> --=20
> Join in the new game that's sweeping the country.  It's called "Bureaucra=
cy". Everybody stands in a circle.  The first person to do anything loses.



--=20
cog gil sorter myrtle

--T4sUOijqQbZv57TR
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.4 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iD8DBQE6dF2rr9IW4v3mHtQRAmozAJ0XW6jarQImWKMNTXOq4vVvks7WRACfYrlm
PhujwlDvBErgkmyrMIO1o0A=
=k61W
-----END PGP SIGNATURE-----

--T4sUOijqQbZv57TR--


--__--__--

Message: 6
Date: Sun, 28 Jan 2001 19:16:12 +0100
From: Sebastian Spaeth <sebast...@sspaeth.de>
Organization: University of =?iso-8859-1?Q?Link=F6ping?=
To: devl at freenetproject.org
Subject: Re: [freenet-devl] Seems quite relevant
Reply-To: devl at freenetproject.org

"Scott G. Miller" wrote:
> 
> http://protozilla.mozdev.org/white-paper.html
> 
> Looks like the last to bullet points aim directly at us.

It's already the official browser plugin for Mozilla (speaking for Win
at least) once we decide to include it...

Sebastian


--__--__--

Message: 7
From: Theodore Hong <t...@doc.ic.ac.uk>
Subject: Re: [freenet-devl] Proposal: algorithm for forgetting documents in 
datastore
To: devl at freenetproject.org
Date: Sun, 28 Jan 2001 18:45:00 +0000 (GMT)
Reply-To: devl at freenetproject.org

Oskar Sandberg <md98-osa at nada.kth.se> wrote:
> On Sun, Jan 28, 2001 at 02:16:21PM +0000, Theodore Hong wrote:
> > "Mark J. Roberts" <mjr at statesmean.com> wrote:
> < > 
> > > I tend to favor Ian's model because I'm a proponent of segmentation (even
> > > slightly redundant segmentation), and Ian's model encourages such
> > > segmentation. However, I certainly wouldn't want to see it pushed to
> > > extremes, where data must be inserted as tiny segments to survive. But I
> > > think we should discourage large (>1M) files. Looks like a compromise is
> > > necessary.
> > 
> > Historically, this tension between large and small resulted in the Great
> > Compromise of 1787 in the United States.  The rights of small states were
> > preserved by equal representation by state in the Senate, while those of
> > large states were preserved by proportional representation by size in the
> > House.  Perhaps we should adopt something similar.
> 
> Because the whole American democracy thing worked out sooo well....

As a matter of fact, it did (the Great Compromise part, anyway).  If you
notice, there are no arguments between large and small American states
anymore.  It is a complete non-issue, whereas it once threatened to
undermine the entire concept of "united" states.  See "qualified majority
voting" in the EU and tell me that getting states not to argue wasn't a
great triumph...

theo



--__--__--

Message: 8
From: Theodore Hong <t...@doc.ic.ac.uk>
Subject: Re: [freenet-devl] Proposal: algorithm for forgetting documents in 
datastore
To: devl at freenetproject.org
Date: Sun, 28 Jan 2001 18:47:56 +0000 (GMT)
Reply-To: devl at freenetproject.org

Theodore Hong <twh1 at doc.ic.ac.uk> wrote:
> Tavin Cole <tavin at mailandnews.com> wrote:
> > It's clear to me now that the only sane mechanism for deciding what files
> > to keep in the document store is one that consistently preserves the
> > files that receive the greatest number of requests.  So let's define P as
> > the measure of a file's request history.  Simplistically,
> >
> >  P = total no. of requests for this file / time since file was stored
> >
> > In practice, though, I think we'd want to limit the past time interval to
> > what is "recent".  Also, the units in the denominator should not be too
> > granular, to keep P from blowing out of proportion for brand new files.
> > Something in the neighborhood of hours or minutes, probably.
> 
> Limiting the past time interval would require picking an arbitrary cutoff.
> The best way is to apply a decay factor to old requests, something like:
> 
> P = #requests in the past hour
>       + (#requests in the previous 2 hrs)/2
>       + (#requests in the previous 4 hrs)/4
>       + (#requests in the previous 8 hrs)/8
>       + ...

obviously I mean "frequency of requests", not #requests...

> (I'm sure there's a nicer way to express this as a continuous function 
> using an integral, but I can't think of it at the moment.  Anyway, you
> get the idea.)
> 
> > When a new file is added to the store, as a result of either a request or
> > an insert, it could be allocated 1 request in the above equation.  When
> > the store is full, you basically want to discard the entries with lowest
> > P.
> >
> > This is trivial if all files are the same size.  The question is, if a
> > file is going to displace >1 smaller files, do we compare its P to the
> > summed P of the smaller files?  Do we compare its P to some function of
> > the P of the smaller files, where f(P1..n) < P1 + .. + Pn ?  Or do we
> > ignore the size difference and just look at the highest P of the smaller
> > files?
> 
> I think the most straightforward way is simply to compare the one file's P
> to the sum of all the P's it would displace.  This maximizes the number of
> successful requests.  For example, one 100k file getting 11 requests/day
> should be ranked higher than ten 10k files getting 1 request/day each.
> 
> The difficulty is deciding how to assess P for a new file.  If it's too
> low, new files may never make it in.  If it's too high, we just have the
> current situation (new files always have highest priority).
> 
> theo



--__--__--

Message: 9
Date: Sun, 28 Jan 2001 19:08:34 +0000
From: Adam Langley <a...@linuxpower.org>
To: devl at freenetproject.org
Subject: Re: [freenet-devl] Protozilla
Reply-To: devl at freenetproject.org


--G4iJoqBmSsgzjUCe
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, Jan 28, 2001 at 12:58:03PM -0500, Scott G. Miller wrote:
> Damn you!  FIRST POST!

Yea, ok. Your post hadn't come thru when I sent mine. You win ;)

AGL

--=20
Join in the new game that's sweeping the country.  It's called "Bureaucracy=
". Everybody stands in a circle.  The first person to do anything loses.

--G4iJoqBmSsgzjUCe
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.4 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iEYEARECAAYFAjp0bjIACgkQzaVS3yy2PWAaHQCfZDU8g2WzqQk4GNoQvRZz/ysW
aXkAoICaykDujwNpPiVDRgAwSOGUt1AI
=bgr9
-----END PGP SIGNATURE-----

--G4iJoqBmSsgzjUCe--



--__--__--

_______________________________________________
Devl mailing list
Devl at freenetproject.org
http://www.uprizer.com/mailman/listinfo/devl


End of Devl Digest

Reply via email to