Re: [tahoe-dev] new stable grid volunteers

Brian Warner Sun, 01 Mar 2009 13:29:54 -0800

Unfortunately, Andrej's concerns are well-founded. sizelimit= was in1.2.0, but I removed it from 1.3.0, and reserved_space= does not offerexactly the same functionality. This ought to be mentioned in the NEWSfile (and sizelimit= should be listed as removed from docs/configuration.txt .. if not, please file a docs bug ticket).

I removed sizelimit= because it turned out to be too expensive to useon large storage servers, such as the ones we run here atallmydata.com, which hold several million shares each. Each time thenode started, it had to do the python equivalent of /bin/du, to wallthrough all shares, measure their size, add them all together, thencompare the total against the sizelimit= value. This was causing nodestartup to block for a long time (upwards of 15 minutes), impactingserver availability and discouraging us from upgrading servers in atimely fashion.

In addition, using du to measure space-consumed was inaccurate, as itdidn't always take minimum block size into account. Consequently theserver could easily use much more space than you wanted it to.

The new reserved_space= control, added in 1.3.0, uses the pythonequivalent of /bin/df (specifically os.statvfs), which is practicallyinstantaneous (a single syscall), because the filesystem keeps trackof partition-wide space usage continually. However, it doesn't enablethe kind of limits that Andrej would like to enforce, and we're stilllooking for an os.statvfs equivalent for windows (currentlyreserved_space= is not honored on windows).

With the new share-crawling framework I just added last week, we couldconceivably bring back sizelimit=. it would probably be appliedslowly: when first enabled, we do a slow (hours or days) crawl of allshares to add up their size in a non-blocking CPU-yielding manner,then start enforcing the limit once we'd found out how much space wewere actually using. We'd persist the results of the crawl to let usget moving faster on subsequent restarts.

We could also bring sizelimit= back as is, with a warning that it maytake a long time to reatart the node when there are a lot of shares.

Or, the Accounting work I'm slowly accomplishing in the backgroundwould provide a more fine-grained limit, and would include a coarseserver-wide limit as a side-effect.


Hope that helps,
 -Brian

On Mar 1, 2009, at 1:44 AM, Rogério Schneider <[email protected]>wrote:

Andrej, what you want is 'sizelimit'. This configuration limits theutilization of
storage for a given client.

To share only 2GB, for example:

[storage]
enabled = true
sizelimit = 2000000000

http://allmydata.org/source/tahoe/trunk/docs/configuration.txt

Regards,
Rogério Schneider
On Sun, Mar 1, 2009 at 3:09 AM, Andrej Falout <[email protected]>wrote:
Thanks Rogério,
That 'reserved_space' is, as documented, the minimum space tahoewill try to keep free in your disk. Say, when you disk goes to only2gb (in my config) it will stop storing new chunks. This is exactlywhat you want.
Unless I'm missing something, not exactly.
On my 3T mount, this would consume all space between currently used1T and (3T-2G)
I'd like to allocate only a portion of (3T-2G) to Tahoe storage, asI will need it in the future.
therefor the need for something like storage_use_max=xxxx parameter.
IIUC, reserved_space= has the meaning of "dont store any more dataif partition space is less then this"
I would like to be able to declare "use maximum x MB, but only ifdoing so will not reduce available space under y MB"
--
Andrej Falout


_______________________________________________
tahoe-dev mailing list
[email protected]
http://allmydata.org/cgi-bin/mailman/listinfo/tahoe-dev



--
Rogério Schneider

MSN: [email protected]
GTalk: [email protected]
TerraVoip: stockrt
Skype: stockrt
_______________________________________________
tahoe-dev mailing list
[email protected]
http://allmydata.org/cgi-bin/mailman/listinfo/tahoe-dev

_______________________________________________
tahoe-dev mailing list
[email protected]
http://allmydata.org/cgi-bin/mailman/listinfo/tahoe-dev

Re: [tahoe-dev] new stable grid volunteers

Reply via email to