Re: [squid-users] Squid on DualxQuad Core 8GB Rams - Optimization - Performance - Large Scale - IP Spoofing

Haytham KHOUJA (devnull) Sun, 14 Oct 2007 09:35:16 -0700

Hello Marcus,

I'll send all this info within this week as i'll perform many tests on alive large scale system, i'll submit them soon.

Please specify types of tests you'd want me to run


Marcus Kool wrote:

Hi Haytham,
you stated that the current Squid server is faster than the Netcacheboxes.
Just to do a fair and clear comparison (I and possibly others like to
have a more clear picture of how Netcache and Squid compare to eachother):can you give the Netcache spec. (model, version, memory size etc) anddo you have
numbers like reqs/sec or delays ?

thanks
Marcus

Haytham KHOUJA (devnull) wrote:
Dear Amos,
Thank you for your reply, check my comments:

Amos Jeffries wrote:
Haytham KHOUJA (devnull) wrote:
Hello,
The purpose of this thread is to join forces to have the best Squidconfiguration for generic affordable Intel machines available bymajor vendors (Dell/HP...) specifically for ISPs and corporationsthat want a basic setup but with optimal response and throughputand maximizing bandwidth savings.I work for an important ISP and I currently replaced 2 NetAppNetCache with 3 Dell 2950 hooked up on a Foundry Switch for LoadBalancing.I used tproxy to enable IP Spoofing to IP spoofing the outgoingaddress with some configurations on the Cisco core router, I had tocompile iptables and tproxy on a Debian kernel source (2.6.18)
I've read almost every single thread on Optimizing Squid and Linuxand want to share my setup with you.I do have some questions, clarifications and bugs but overall theperformance is pretty impressive. (Yes, much better than the NetApps)
What i want to do is since i have 8 GB of RAMs, i want to storemore hot objects in the RAMs to maximize Memory hit ratio, but withmy setup, Squid doesn'tgo above 2GB~3GB of usage. (Remember, that there are no other heavyprocesses on the machine).
You will need a 64-bit enabled squid to go higher than 2GB.
Yea, I hope i'll be able to replace the CPUs
If i knew beforehand that Squid doesn't make use of SMP, i wouldn'thave bought Dual Quad Core and would have invested in Intel CPUswith 8mb of Cache, but what's done is done :)
Before i had Squid go down because of File Delimiters and maximumopen files and ip_conntrac fill up, i fixed both with some iptablesand sysctl configuration.Now i'm hitting a "Oct 14 01:17:06 proxy4 squid[8883]: assertionfailed: diskd/store_io_diskd.c:384:"!diskdstate->flags.close_request" Error, so Squid kills andrestarts (which flushes the Memory cache).
I'm looking forward for some contributions, idea sharing, knowledgecorrecting to make this setup a standard setup for large scale,well optimized and high performant Squid for future tweakings. Ihope this configuration would be then uploaded to the Squid wiki.
Post your squid.conf to
  http://squid.treenet.co.nz/cf.check/
and review the results. I've pointed out the biggest worries below.
Here's my setup:
Dell 2950
Dual Quad Core 2.4Ghz / 8 GB Rams / 4x 136 GB 15000 RPM drives
I have 3 cache_dir on separate drives and I formated the 3 diskswith ReiserFS:
   /dev/sdb1       /CACHE1 reiserfs notail,noatime         0 0
   /dev/sdc1       /CACHE2 reiserfs notail,noatime         0 0
   /dev/sdd1       /CACHE3 reiserfs notail,noatime         0 0

I run Debian GNU/Linux Etch and compiled Squid with the following:
Squid Cache: Version 2.6.STABLE16
configure options: '--bindir=/usr/bin' '--sbindir=/usr/sbin/''--sysconfdir=/etc' '--enable-icmp' '--enable-snmp''--enable-async-io' '--enable-linux-netfilter''--enable-linux-tproxy' '--with-dl' '--with-large-files''--enable-large-cache-files' '--with-maxfd=1000000''--enable-storeio=diskd,ufs' '--with-aio' '--enable-epoll''--disable-ident-lookups' '--enable-removal-policies=heap''CFLAGS=-DNUMTHREADS=120'
As you can see i have the following modules enabled: linux-tproxy,diskd, epoll, and removal policies./dev/epoll improves network I/O performance, Diskd separates diskI/O CPto separate processes (which reduces process locking from Squid towrite on disks), and read benchmarks for memory and disk removalpolicies.
aufs does a better job, particularly where threads are available andis not quite so broken as diskd.
I will recompile, use aufs and do more testing
My /etc/squid.conf is composed of the following:

http_port 80 transparent tproxy
tcp_outgoing_address IP of the Machine
:: Those are for IP Spooding and Transparency

via off
forwarded_for off
:: Those are for total transparency, remote hosts will never guessthat the request came from a proxy
IIRC, theres more than this needed for complete silence. They justreplace the Via and Forwarded-For with text 'unknown'. still leavingthe headers in place for anon-proxy identification.
True, but this is used with tproxy for ip spoofing
cache_mem 600 MB
:: A bit confused about this, When i go higher than 2GB, Squidkills with a "out of memory" error. I have 8GB and want to maximizethe use of it.
cache_effective_user nobody
cache_effective_group nogroup
:: Security and bla bla
So i can leave it to 2GB maximum? The rest of the OS will have therest of the RAM for OS purposes.
This is the default UID. If this is going to be a standard configthese MUST not be explicitly set.Also when GID is configured as above, will in fact cause asquid-specific deviation from the configured OS-level security policy.
They are no longer to be used, unless the machine-specific setuprequires it AND the admin knows how to setup for them properly.
cache_replacement_policy heap LFUDA
memory_replacement_policy heap GDSF
:: Very objective, you can google about them

cache_dir diskd /CACHE1 61440 16 256 Q1=144 Q2=128
cache_dir diskd /CACHE2 61440 16 256 Q1=144 Q2=128
cache_dir diskd /CACHE3 61440 16 256 Q1=144 Q2=128
:: DISKD configuration, i'm only using 60GB of each disk

cache_access_log /var/log/squid/access.log
Obsolete option. Use access_log with same parameters instead.
Which is obsolete?
cache_log /var/log/squid/cache.log
cache_store_log none
:: No need to log cache_store, so minimizing the Disk I/O

fqdncache_size 51200
ipcache_size 51200
:: Caching IPs/Domain Name and whatnot

pipeline_prefetch on
:: Performance enhancement

shutdown_lifetime 1 second
:: Tired to wait whenever i restart my Squids (Only on testing)

read_ahead_gap 60 KB
maximum_object_size 2 GB
minimum_object_size 0 KB
maximum_object_size_in_memory 128 KB
cache_swap_high 80%
cache_swap_low 70%
half_closed_clients off
memory_pools on
positive_dns_ttl 24 hours
negative_dns_ttl 30 seconds
request_timeout 60 seconds
connect_timeout 30 seconds
pconn_timeout 30 seconds
ie_refresh on
dns_nameservers DNS1 DNS2
emulate_httpd_log off
log_ip_on_direct on
debug_options ALL, 9
performance enhancements above to minimize disk IO yet you logeverything at full-debug? this *,9 could cause extremely high diskusage under load. Try *,1 (minimal) or *,5 (detailed overview) instead.
Will do, thanks
pid_filename /var/run/squid.pid

My IPtables/sysctl and startup file:
#!/bin/sh
iptables -t tproxy -A PREROUTING -i eth0 -p tcp -m tcp --dport 80-j TPROXY --on-port 80:: I run Squids on port 80 so that i can forward all incomingrequests on port 80 to the Squids on the Cisco router level
echo 1 > /proc/sys/net/ipv4/ip_forward
echo 1 > /proc/sys/net/ipv4/ip_nonlocal_bind
echo 0 > /proc/sys/net/ipv4/conf/all/rp_filter
echo 1024 65535 > /proc/sys/net/ipv4/ip_local_port_range
echo 102400  > /proc/sys/net/ipv4/tcp_max_syn_backlog
echo 1000000 > /proc/sys/net/ipv4/ip_conntrack_max
echo 1000000 > /proc/sys/fs/file-max
echo 60 > /proc/sys/kernel/msgmni
echo 32768 > /proc/sys/kernel/msgmax
echo 65536 > /proc/sys/kernel/msgmnb
:: Maximizing Kernel configuration

ulimit -HSn 1000000
/etc/init.d/squid stop
/etc/init.d/squid start
:: Re-enforcing ulimit parameters for the Squid process.

Thank you
No, thank you.

Amos

Re: [squid-users] Squid on DualxQuad Core 8GB Rams - Optimization - Performance - Large Scale - IP Spoofing

Reply via email to