I store about ~4-6 months of data on a 400GB volume. You’ll have about 4-6x my
amount of data, so I would say you should be able to get away with a 5 TB RAID
10 volume on 7.2k drives. I’m not sure how caching would help still though, It
might help if you are continually accessing the same data frequently in the
same week. RAID card looks good.
Greg Williams, ME
Director of Networks and Infrastructure
Information Technology
University of Colorado Colorado Springs
1420 Austin Bluffs Parkway, (EPC 136A)
Colorado Springs, CO 80918
www.uccs.edu<http://www.uccs.edu/>
From: Roderick Mooi [mailto:roder...@sanren.ac.za]
Sent: Friday, November 4, 2016 3:24 AM
To: Greg Williams <gwill...@uccs.edu>; nfdump-disc...@lists.sourceforge.net;
nfsen-discuss@lists.sourceforge.net
Cc: joh...@sanren.ac.za; Ishmael Tsoaela <ishm...@sanren.ac.za>; Schalk Peach
<spe...@csir.co.za>
Subject: Re: [Nfsen-discuss] Netflow collector hardware advice
Hi Greg
Thanks for the advice. We want to keep the flows for at least 3 months (ideally
6). Most of our queries will run on the last week's data - will the cache
drives help or not really? 10k SAS drives will effectively cost 4x as much as
the 7.2k - is the speed improvement worth it? RAID 10 seems to make sense
although is less efficient (uses more drives) than the initial RAID 6 but seems
to be the logical choice for pure performance - in RAID 6 we had 16x 4TB 7.2k
drives. For RAID10 we'll need 28x 7.2k drives or 62x1.8TB 10k drives (ouch).
Isn't the read performance the same though for both RAID levels and the
important one in this scenario? Ideally I want to go with RAID 10 10k SAS
drives but may have to settle with just RAID 10... PS reducing the RAM gets me
very few extra drives.
Oh yes, the included RAID chip is the SAS3108
(http://www.avagotech.com/products/storage/raid-on-chip/sas-3108). A CacheVault
kit has been included for protection. This seems good enough - what do you
think?
Regards,
Roderick
On 2016/11/01 9:56 PM, Greg Williams wrote:
Hi Roderick, While we don’t process that much, only 4 Gbps and 500k pps peak, I
have it running on an old core 2 duo with 6G of RAM. Yes, it is an old desktop
system (8 years old). Nfsen uses barely any resources. I think your server is
fine with the exception of storage. How long do you want to keep the flows?
Your cache is only used for those queries where you are frequently accessing
the data. Therefore, you will not see any speed increase with the SSD caching
when searching for random data. I would highly recommend either 10/15k SAS or
better yet, SSDs. Also look at RAID 10 for speed. If you are doing RAID 6,
might as well do RAID 10. If cost is an issue, I would decrease the RAM and
increase your storage. Also with data that important, make sure you get a good
RAID card
http://www.avagotech.com/products/storage/raid-controllers/megaraid-sas-9361-16i.
If you just use a simple RAID card without a battery and large cache, you may
corrupt your data in the event of a power failure.
Greg Williams, ME
Director of Networks and Infrastructure
Information Technology
University of Colorado Colorado Springs
1420 Austin Bluffs Parkway, (EPC 136A)
Colorado Springs, CO 80918
www.uccs.edu<http://www.uccs.edu/>
From: Roderick Mooi [mailto:roder...@sanren.ac.za]
Sent: Tuesday, November 1, 2016 7:17 AM
To:
nfdump-disc...@lists.sourceforge.net<mailto:nfdump-disc...@lists.sourceforge.net>;
nfsen-discuss@lists.sourceforge.net<mailto:nfsen-discuss@lists.sourceforge.net>
Cc: joh...@sanren.ac.za<mailto:joh...@sanren.ac.za>; Ishmael Tsoaela
<ishm...@sanren.ac.za><mailto:ishm...@sanren.ac.za>
Subject: [Nfsen-discuss] Netflow collector hardware advice
Good day
We would like to procure a server for netflow data collection and processing
(nfdump + nfsen) and are looking for some hardware advice if you could please
assist?
Requirement:
Import flow data from 4 x 10 Gbps router interfaces
- currently we have 3 active interfaces with 5-6 Gbps of peak traffic each and
between 500k and 1mil pps each
- Juniper MX480 and Cisco ASR 9010
- unsampled (full)
Execute queries in a reasonable amount of time (ideally no more than a few
minutes but I know it depends on the complexity and time frame which will vary
depending on our use case at the time)
Spec so far:
Supermicro SSG-6028R-E1CR24N chassis
Intel CPU: 4C E5-2623V4 2.6G 10M 8GT QPI
128GB DDR4 RAM [expandable]
60TB storage (10x6TB SAS 12Gb/s) RAID6 [expandable]
2 x 200GB SSD caching drives with LSI Cachecade Pro 2.0
Questions:
CPU
- which is better for nfdump+nfsen - more cores or more speed? Or pros and cons
of each? i.e. would a 10C E5-2630V4 2.2G 25M 8GT QP be a better or worse choice?
Memory
- how much is needed? (I figure it depends on the queries, so let's say we're
looking for any connections containing a certain destination IP address within
a 1 month period)
Storage
- we'd like to keep at least 3 months data for 4x10Gbps interfaces unsampled.
How much do we need?
- 4TB or 6TB drives (Seagate Makara)
- read cache only?
Thanks very much,
Roderick
--
Engineer
South African National Research Network (SANReN)
------------------------------------------------------------------------------
Developer Access Program for Intel Xeon Phi Processors
Access to Intel Xeon Phi processor-based developer platforms.
With one year of Intel Parallel Studio XE.
Training and support from Colfax.
Order your platform today. http://sdm.link/xeonphi
_______________________________________________
Nfsen-discuss mailing list
Nfsen-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfsen-discuss