Smart input filtering

Simon Willison Sun, 20 Nov 2005 16:11:26 -0800

I had the chance to chat with Rasmus Lerdorf a few weeks ago, and oneof the topics that came up was input filtering for web applicationsecurity. Rasmus knows a heck of a lot about this stuff (don't letPHP's past mistakes let you think otherwise) and described thefollowing scheme which I think Django could make good use of.

Basically, instead of accessing data from GET and POST directly,applications use utility functions that filter depending on what theapplication is asking for. Say you want to get an integer thatsomeone has entered. In current Django, you might do this:


a = int(request.GET['a'])

With smart input filtering, you would do something like this instead:

a = request.GET.as_int('a')

Functions like this can be created for all kinds of data. Here are afew examples off the top of my head:


a = request.GET.as_email('email')
a = request.GET.as_float('f')
a = request.GET.as_safe_html('body')

This is great for people who use them, but what about developers wholack the discipline to do so? The proposed solution is to strip /anything/ that is potentially harmful from all input unless expresslytold otherwise. Consider the following input data:

This has <script>alert('scary')</script> looking code in it as wellas some \00 null bytes and other weird escape characters \\'''';DELETE FROM pages;

Accessed through the regular method it would automatically havepotential nasties stripped:


>>> text = request.GET['body']
>>> text

This has alert(scary) looking code in it as well as some null bytesand other weird escape characters DELETE from pages

(I don't know if stripping should be this aggressive; this is just anexample)


If you want the raw data without stripping applied, you do this:

text = request.GET.as_raw('body')

The idea is to make developers have to go out of their way to avoidinput filtering. This is certainly something that would greatlybenefit the world of PHP. Developers who use Django may like to thinkthemselves above such mistakes, but mistakes are easy to make.Functionality like this, even if limited to the as_email etc. inputfilters rather than filtering everything by default, would make it alot harder to mistakenly create insecure apps.


Cheers,

Simon

Smart input filtering

Reply via email to