[Users] ploop, simfs, and disk space

Kir Kolyshkin Fri, 06 Apr 2012 14:38:34 -0700

I want to summarise some facts about simfs, ploop, and disk quota.


== simfs case ==

If container is on simfs, it is using host file system, usually /vz.Because many containers share one file system, per-container limits areneeded for both disk space used and disk i-nodes (roughly number offiles/directories) used. These two limits are called vzquota, and arecontrolled by --diskspace and --diskinodes parameters for vzctl set command.

For both diskspace and diskinodes, there are two values -- soft limitand hard limit, you can specify those using sss:hhh syntax. For example,"vzctl set 333 --diskspace 10G:11G --save" command sets the soft diskspace limit to 10 GB and the hard disk space limit to 11G. Thedifference between soft and hard limit is that the soft limit can betemporary exceeded, while soft limit can not be exceeded. Here"temporary" is defined by the third parameter, --quotatime, which setsthe time (in seconds) during which soft limit can be exceeded. Thisvalue is otherwise known as the grace period. Once the grace periodhas expired, the soft limit is enforced as a hard limit.


Example: admin sets disk limits in the following way:

vzctl set 333 --diskspace 10G:11G --diskinodes 1M:1.1M --quotatime3600 --save

Now, a container root can use 10G of disk space, and have about 1million files inside his CT. He can have 11G of disk space and about 1.1million files, but for no longer than 1 hour. If he uses more than 10Gof disk space, during the first hour (and only during the first hour) hewill still be able to use 11th gigabyte.

This dimensional system of space, inodes, soft limits, hard limits andgrace period is nothing new, it's the same as traditional UNIX per-userand per-group disk quotas. The only major difference is in this casequotas are per-CT (per simfs mount point).

There is a --diskquota parameter (and DISK_QUOTA config file parameter)which is used to enable/disable per-CT disk quotas. If you setDISK_QUOTA=no in /etc/vz/vz.conf, no per-CT disk quotas will beinitialized. If you set DISK_QUOTA=no in CT configuration file (e.g./etc/vz/conf/333.conf), no disk quotas for this CT will be initialized.

NOTE that as with any other disk quota, if you will write to the filesystem bypassing the quota (such as directly to VE_PRIVATE, e.g./vz/private/333), current quota usage values will be incorrect. In thatcase, you need to stop the CT and run vzctl quotainit, to recalculatequota usage. In some cases (such as after incorrect system shutdowncaused by power outage) quota files are marked dirty, and suchrecalculation is happening automatically during CT start.

For the sake of completeness, there is vzctl quotaon and vzctl quotaoffcommands, but usually you don't have to use those two, because quotaonis performed during vzctl mount (and vzctl start), and quotaoff isperformed during vzctl stop (and vzctl umount).

From inside the CT, utilities such as df are showing those quota limitsinstead of actual available disk space and inodes (this is implementedin the kernel by having a special version of statfs() syscall for simfswhich looks into vzquota). Sometimes it gets complicated, so if you seesomething strange in df output, it is either incorrect quota values(and you need to recalculate quota usage, see above), or perhaps thefilesystem disk space available is less than quota limits. For lots ofgory details on this stuff, please seehttp://wiki.openvz.org/Disk_quota,_df_and_stat_weird_behaviour

Also, you can check /proc/vz/vzquota to see for which containers quotais on, as well as its current limits and usage values.

I am leaving more advanced topics such as using vzquota utility directlyas a (highly optional) exercise for (highly) advanced users.


== ploop case ==

In ploop case, there is an image file and the underlying file system, sothere is no shared file system and vzquota is naturally not required.Therefore, options --diskquota (and DISK_QUOTA parameter), --diskinodesand --quotatime are silently ignored.

Option --diskspace is not ignored, but instead of changing vzquota diskspace limit, it initiates the resize of the CT ploop image file and thefilesystem which resides on top of that image.

NOTE that image and file system resize, especially in case when the CTis running (so-called online resize) is quite tricky, and in worst casescenario can lead to image or filesystem damage that is beyond repair.So exercise it a lot in testing environment, but do not abuse it inproduction*.

NOTE that specifying two values for --diskspace in case of ploop makesno sense. Only one value (hard limit) is used (and as with otherparameters, if you only specify one value, second one becomes equal tothe first one). So using --diskspace 1G:1.1G is the same as --diskspace1.1G (or --diskspace 0:1.1G). Easy rule: do not use two numbers fordiskspace, just one.

* NOTE ploop is not yet ready for production, and will not be for atleast a few more months.

_______________________________________________
Users mailing list
[email protected]
https://openvz.org/mailman/listinfo/users

[Users] ploop, simfs, and disk space

Reply via email to