On 11/21/2017 05:36 AM, Kevin A. McGrail wrote:
On 11/21/2017 12:33 AM, Matthias Leisi wrote:
In addition to server-side blocking, would it make sense for
sa-update to rate-limit itself?
— Matthias
Yes, good idea. Added to my notes.
Based on my logs, it appears the 3.4.x sa-update's check the DNS TXT
record, compare that to it's last update, and don't pull anything if it
was the same. From the user agents in the web logs, it just looks like
very old 3.3.x sa-updates aren't behaving so they need the mod_evasive.
Then there are curls that are pounding away with the vast majority (>
80%) of hits that also need mod_evasive/.htaccess.
Did some versions of sa-update ever use curl to do the fetching or are
these home-grown scripts trying to pull down for their own private mirror?
Von meinem iPhone gesendet
Am 21.11.2017 um 03:53 schrieb Kevin A. McGrail
<kevin.mcgr...@mcgrail.com>:
On 11/20/2017 7:17 PM, Dave Jones wrote:
Could we use something like mod_evasive to limit any IP connecting
more than 3 times (one batch of ruleset files) an hour? SA
instances behind NAT'd IPs could cause a legitimate reason for more
than 2x hits per day.
I'd like to keep it simpler for now. The abuse hasn't been too bad.
I've put them on notice on the users@ list and I'm going to look at
adding more information such as a unique id to sa-update's call for
wget/curl so we can identify NAT'ing.
There may be some abusers in the future that we would want to
permanently block with a centralized .htaccess file that gets
distributed with the normal rsync pulls by each mirror.
Agreed. Let's keep an eye on things.
So from the last 3.8mm GETs Top 14 IPs
(grep GET sa-update.pccc.com-access_log | awk -F" " '{ print $1 }' |
sort | uniq -c | sort -n -r | head -n 14)
964649 52.169.9.191 (Machine we already had taken care of)
71273 176.61.138.136
40397 41.76.211.56
22535 108.163.197.66
21100 108.61.28.10
21037 79.137.36.178
20270 149.56.17.151
19826 91.204.24.253
18141 178.32.88.139
18003 207.210.201.60
14037 158.69.200.153
12539 78.229.96.116
12525 37.221.192.173
11568 45.77.52.43
Looks like some overlaps in the top IPs. For the most part people are
being good with sa-update. Seems to be a small percentage that we need
to address with mod_evasive and/or .htaccess. My 2 mirrors for
sa-update.ena.com hit about 63 GB yesterday. The hourly download volume
seem to match the rule promotions at 2:30 AM UTC and 9 AM UTC. The
majority of the daily total 63 GB was 56 GB from 2:30 AM UTC to about 6
AM UTC. This is the middle of the night for US Central TZ which works
out pretty well for us.
Here are the top 10 IPs that seem to be running sa-update or a
curl script most frequently:
41.76.211.56 (sa-update/svn917659/3.3.2 every 5 minutes)
108.61.28.10 (sa-update/svn917659/3.3.2 every 15 minutes)
202.191.60.145 (curl/7.19.7 every minute rotating mirrors)
202.191.60.146 (curl/7.19.7 every minute rotating mirrors)
108.163.197.66 (sa-update/svn917659/3.3.2 every 5 minutes)
208.74.121.106 (NAT'd IP? curl/7.29.0 & curl/7.19.7)
91.204.24.253 (NAT'd IP? various user agents)
207.210.201.60
78.110.96.3
190.0.150.3
--