[web2py] Re: How can Web2py prevent web scraping?

Richard Penman Tue, 26 Aug 2014 05:38:43 -0700

I created a simpler system just counting requests and then blocking when 
exceeded maximum in a time frame:
http://www.web2pyslices.com/slice/show/1991/block-fast-bots



On Friday, May 10, 2013 1:36:22 AM UTC+2, Derek wrote:
>
> I've read an idea about using a 'ticket' system... each session gets X # 
> of tickets. Tickets regenerate at a fixed rate. Normal users would never 
> run out of tickets.
> Each query operation would have a fixed cost of tickets. Inserts would 
> cost double selects... 
> You don't have to calculate regeneration - just when an operation is about 
> to be performed, you check the last time a request was made, and the last 
> number of tickets. calculate regeneration and then if they have enough 
> tickets, you do the request. If not, you return a 503 error, or perhaps a 
> friendly message saying "swiper no swiping" (to quote Dora)
>
> On Thursday, May 9, 2013 11:58:43 AM UTC-7, Alex Glaros wrote:
>>
>> What techniques can be used in a Web2py site to prevent data mining by 
>> harvester bots?
>>
>> In my day job, if the Oracle database slows down, I go to the Unix OS, 
>> see if the same IP address is doing a-lot-faster-than-a-human-could-type 
>> queries, and then block that IP address in the firewall.
>>
>> Are there any ideas that that I could use with a Web2py website?
>>
>> Thanks,
>>
>> Alex Glaros
>>
>>

-- 
Resources:
- http://web2py.com
- http://web2py.com/book (Documentation)
- http://github.com/web2py/web2py (Source code)
- https://code.google.com/p/web2py/issues/list (Report Issues)
--- 
You received this message because you are subscribed to the Google Groups 
"web2py-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

[web2py] Re: How can Web2py prevent web scraping?

Reply via email to