date:20090904

[zfs-discuss] one time passwords - apache infrastructure incident report 8/28/2009

2009-09-04 Thread russell aspinwall

Hi,

Just be reading about apache.org incident report for 8/28/2009 
( https://blogs.apache.org/infra/entry/apache_org_downtime_report )

The use of Solaris and ZFS on the European server was interesting including the 
recovery.

However, what I found more interesting was the use of one time passwords which 
is supported by FreeBSD ( 
http://www.freebsd.org/doc/en/books/handbook/one-time-passwords.html ). 
Could or should this technology be incorporated into OpenSolaris?
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Read about ZFS backup - Still confused

2009-09-04 Thread Thomas Burgess

Let me explain what i have and you decide if it's what you're looking for.
I run a home NAS based on ZFS (due to hardware issues i am using FreeBSD 7.2
as my os but all the data is on ZFS)
This system has multiple uses.  I have about 10 users and 4 HTPC's connected
via gigabit.  I have ZFS filesystems for Video, Audio and Data.

I have no problem using it for my main itune library or storing downloaded
and recorded video.  Each user also has thier own share to store data and
backups.

The system itself is made up of 3 raidz vdevs rights now, each with 4 1tb
hard drives so i have about 9 TB total space right now. Having a setup like
this sort of changes how you do things.  I have several computers, but all
the stuff i care about it on the NAS.  I am very happy with ZFS for this
purpose.  I originally used a linux backend with mdadm and xfs but i am very
much in love with my new system.  I love the ability to clone and snapshot
and i use it often.  It's already saved me from human error on 2 occasions.
It's also very fast.  I'm using cheap parts and have seen speeds over 250
MB/s, although i get around 30 MB/s per client average with samba.  for
streaming music and video it has never shuddered or skipped.  I have mostly
720p video but a large amount of 1080p as well.  It's not uncommon to have 3
htpc's streaming at the same time and 2 people using the network for other
stuff.i'm very happy with it.


I'm SURE you can find a method to backup/restore your data with ZFS.  Just
think of it more as a backend solution.  You'll still probably use whatever
method you're used to for transfering data, although i use a combination of
samba/nfs and even FTP.  If you're used to tar, no need to stop using it.
You might also look at rysnc.
You could set up a ZFS filesystem on the NAS and set up rsync on your
client, then set up automatic snapshots on the ZFS machine.  This way you'd
have multiple methods of restoring (you could just dump back the latest
rsync or you could clone one of the older snapshots and dump THAT back)




On Thu, Sep 3, 2009 at 4:58 PM, Cork Smith corkb...@sbcglobal.net wrote:

 Let me try rephrasing this. I would like the ability to restore so my
 system mirrors its state at the time when I backed it up given the old hard
 drive is now a door stop.

 Cork
 --
 This message posted from opensolaris.org
 ___
 zfs-discuss mailing list
 zfs-discuss@opensolaris.org
 http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] How to find poor performing disks

2009-09-04 Thread Roch


Scott Lawson writes:
  Also you may wish to look at the output of 'iostat -xnce 1' as well.
  
  You can post those to the list if you have a specific problem.
  
  You want to be looking for error counts increasing and specifically 'asvc_t'
  for the service times on the disks. I higher number for asvc_t  may help to
   isolate poorly performing individual disks.
  
  

I blast the pool with dd, and look for drives that are
*always* active, while others in the same group have
completed their transaction group and get no more activity.
Within a group drives should be getting the same amount of
data per 5 second (zfs_txg_synctime) and the ones that are
always active are the ones slowing you down.

If whole groups are unbalanced that's a sign that they have
different amount of free space and the expectation is that
you will be gated by the speed on the group that needs to
catch up. 

-r

  
  Scott Meilicke wrote:
   You can try:
  
   zpool iostat pool_name -v 1
  
   This will show you IO on each vdev at one second intervals. Perhaps you 
   will see different IO behavior on any suspect drive.
  
   -Scott
 
  
  
  ___
  zfs-discuss mailing list
  zfs-discuss@opensolaris.org
  http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] ARC limits not obeyed in OSol 2009.06

2009-09-04 Thread Roch


Do you have the zfs primarycache property on this release ?
if so, you could set it to 'metadata'  or none.

 primarycache=all | none | metadata

 Controls what is cached in the primary cache  (ARC).  If
 this  property  is set to all, then both user data and
 metadata is cached. If this property is set  to  none,
 then  neither  user data nor metadata is cached. If this
 property is set to metadata,  then  only  metadata  is
 cached. The default value is all.


-r


Udo Grabowski writes:
  Hi,
  we've capped Arcsize via set zfs:zfs_arc_max = 0x2000 in /etc/system to 
  512 MB, since ARC 
  still does not release memory when applications need it (this is another 
  bug). But this hard limit is 
  not obeyed, instead, when traversing all files in a large and deep 
  directory, we see the values below 
  (arc started with 300 MB). After a while, machine (Ultra 20 M2 with 6GB) 
  swaps and then, hours later, freezes completely (even no reaction on quick 
  push power button, no ping, no mouse, have to hard 
  reset). Arc summary shows clearly that limits are not what they supposed to 
  be. If this is working as
  intended, then the intention must be changed. As poorly as ARC is working 
  now, it's absolutely 
  necessary that a hard limit is indeed a hard limit for ARC. Please fix this. 
  Is there anything I can do to
  really limit or switch off the ARC completely ? It's breaking our production 
  work often since we've
  installed OSol (we came from SXDE 1.08 which worked better), we must find a 
  way to stop this 
  problem as fast as possible !
  
  arcstat:
  Time  read  miss  miss%  dmis  dm%  pmis  pm%  mmis  mm%  arcsz c  
  13:22:16   95M   23M 24   10M   14   12M   64   22M   24   963M  536M  
  13:22:172K   256 10796   177   15   2229   965M  536M  
  13:22:182K   490 22   119   10   371   38   482   22   970M  536M  
  13:22:194K   214  4   1506643   1403   971M  536M  
  13:22:202K   427 19574   370   37   419   19   971M  536M  
  13:22:211K   208 19   103   17   105   21   202   19   971M  536M  
  
  13:23:161K   481 27808   401   47   478   27 1G  536M  
  13:23:172K   255 11   125   10   130   13   218   10 1G  536M  
  and counting...
  arc_summary:
  System Memory:
   Physical RAM:  6134 MB
   Free Memory :  1739 MB
   LotsFree:  95 MB
  
  ZFS Tunables (/etc/system):
   set zfs:zfs_arc_max = 0x2000
  
  ARC Size:
   Current Size: 1357 MB (arcsize)
   Target Size (Adaptive):   512 MB (c)
   Min Size (Hard Limit):191 MB (zfs_arc_min)
   Max Size (Hard Limit):512 MB (zfs_arc_max)
  
  ARC Size Breakdown:
   Most Recently Used Cache Size:  93%479 MB (p)
   Most Frequently Used Cache Size: 6%32 MB (c-p)
  
  ARC Efficency:
   Cache Access Total: 97131108
   Cache Hit Ratio:  75%   7321   [Defined State for 
  buffer]
   Cache Miss Ratio: 24%   23886667   [Undefined State for 
  Buffer]
   REAL Hit Ratio:   67%   65874421   [MRU/MFU Hits Only]
  
   Data Demand   Efficiency:66%
   Data Prefetch Efficiency: 8%
  
  CACHE HITS BY CACHE LIST:
Anon:   --%Counter Rolled.
Most Recently Used: 15%11463028 (mru) [ 
  Return Customer ]
Most Frequently Used:   74%54411393 (mfu) [ 
  Frequent Customer ]
Most Recently Used Ghost:   10%7537123 (mru_ghost)[ 
  Return Customer Evicted, Now Back ]
Most Frequently Used Ghost: 19%14619417 (mfu_ghost)   [ 
  Frequent Customer Evicted, Now Back ]
  CACHE HITS BY DATA TYPE:
Demand Data: 3%2716192 
Prefetch Data:   0%3506 
Demand Metadata:86%63089419 
Prefetch Metadata:  10%7435324 
  CACHE MISSES BY DATA TYPE:
Demand Data: 5%1365132 
Prefetch Data:   0%36544 
Demand Metadata:40%9664064 
Prefetch Metadata:  53%12820927
  -- 
  This message posted from opensolaris.org
  ___
  zfs-discuss mailing list
  zfs-discuss@opensolaris.org
  http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Petabytes on a budget - blog

2009-09-04 Thread Marc Bevand

Bill Moore Bill.Moore at sun.com writes:

Moving on, modern high-capacity SATA drives are in the 100-120MB/s
range. Let's call it 125MB/s for easier math. A 5-port port multiplier
(PM) has 5 links to the drives, and 1 uplink. SATA-II speed is 3Gb/s,
which after all the framing overhead, can get you 300MB/s on a good day.
So 3 drives can more than saturate a PM. 45 disks (9 backplanes at 5
disks + PM each) in the box won't get you more than about 21 drives
worth of performance, tops. So you leave at least half the available
drive bandwidth on the table, in the best of circumstances. That also
assumes that the SiI controllers can push 100% of the bandwidth coming
into them, which would be 300MB/s * 2 ports = 600MB/s, which is getting
close to a 4x PCIe-gen2 slot.

Wrong. The theoretical bandwidth of an x4 PCI-E v2.0 slot is 2GB/s per
direction (5Gbit/s before 8b-10b encoding per lane, times 0.8, times 4),
amply sufficient to deal with 600MB/s.

However they don't have this kind of slot, they have x2 PCI-E v1.0
slots (500MB/s per direction). Moreover SiI3132 default to a
MAX_PAYLOAD_SIZE of 128 bytes therefore my guess is that each 2-port
SATA card is only able to provide 60% of the theoretical throughput[1],
or about 300MB/s.

Then they have 3 such cards: total throughput of 900MB/s.

Finally the 4th SATA card (with 4 ports) is in a 32-bit 33MHz PCI slot
(not PCI-E). In practice such a bus can only provide a usable throughput
of about 100MB/s (out of 133MB/s theoretical).

All the bottlenecks are obviously the PCI-E links and the PCI bus.
So in conclusion, my SBNSWAG (scientific but not so wild-ass guess)
is that the max I/O throughput when reading from all the disks on
1 of their storage pod is about 1000MB/s. This is poor compared to
a Thumper for example, but the most important factor for them was
GB/$, not GB/sec. And they did a terrific job at that!

48 matches

Mail list logo