Re: proposal related to allow-html defaults

David E Jones Mon, 22 Jun 2009 11:57:41 -0700

Thank you for writing this up Harmeet. Before getting into the detailsit is important that you understand that you are proposing to changesomething, or in fact basically get rid of something, that wasdiscussed in a lot of detail and that has seen effort from quite a fewpeople. In general when recommending to change something it's a goodidea to understand where that came from and why things are done theway they are. Based on what you've written in this and other messagesit does not seem that you have done this research. I might be wrong,and if so I apologize, but in order to continue with a discussion likethis it is important that you know the background and understand theproblem and the reasons for the solutions.

There is a good thread that discusses most of this, and it isavailable here:


http://www.nabble.com/Security-Issues-td21622188.html


On Jun 22, 2009, at 11:31 AM, Harmeet Bedi wrote:

A issue(https://issues.apache.org/jira/browse/OFBIZ-2645) is thatdefault value of allow-html is very restrictive.
This is as per start with most constrained best practice in security

Default is allow-html="none"
It not only does not allow html but it also does not allow simpletext like "Tom's age is likely > Paul's age". '>' breaks.

I agree this could be improved, but the trick is how do we tell thedifference? One interesting page that shows just how difficult it isto recognize HTML is this one which is oriented around finding hacksrelated to this very problem in various browsers:


http://ha.ckers.org/xss.html

What you have mentioned is definitely a weakness in the current code,which is admittedly fairly simple, and an opportunity to improve thatcode. I don't think it would be a good idea to change the overallapproach and throw away the attempt to filter incoming text,especially text coming in from untrusted sources.

Around the time this was written I asked for recommendations fromanyone on the mailing list that might be more familiar with thistopic, especially as it relates to what a browser will or will notinterpret as markup. My limited knowledge on the topic is that youpretty much have to have something that looks like a tag, in otherwords starts with a less than or open angle bracket and has some sortof word after it, in order for the browser to consider it markup andtry to interpret it.

One thing that might work is to allow a > if there is no <. I'm notsure about the other way around because if there is otherwise validHTML a browser might not care about a missing > to close a tag. It mayalso be valid to allow a < if there is a space after it, but I'm notsure about that either.

Please keep in mind that the reason for this is to not allow abuse ofa production system. If that is not important to you or to yourclients then you can certainly disable it globally. However, I'mguessing that's not what you would want to do and the best approachfor all of this is to simply improve the code so that it allows asmuch as possible while still protecting against the security threat itis meant to.

allow-html="safe" also does not seem to work well. It does not allowwell formed html.

Could you be more specific? Depending on what you're seeing there mayvery well be a bug with the safe HTML code, or just a need to changethe antisamy-esapi.xml configuration file for it. In any case, itwould be good to find out more about what you mean by this because itsounds like a bug.

Here is a proposal:
HTML and descriptive text with characters like '>' should beallowable whenever there is a description text input by user.Change services that deal with description/comments/reason/noteInfofields to have allow-html="any"
An example of this is 'updateWorkEffortNote' service. it couldchange from
   <service name="updateWorkEffortNote" engine="simple"
location="component://workeffort/script/org/ofbiz/workeffort/workeffort/WorkEffortSimpleServices.xml"invoke="updateWorkEffortNote" auth="true">
       <description>Update a WorkEffort Note</description>
<attribute name="workEffortId" type="String" mode="IN"optional="false"/><attribute name="noteId" type="String" mode="IN"optional="false"/><attribute name="internalNote" type="String" mode="IN"optional="false"/><attribute name="noteInfo" type="String" mode="IN"optional="true"/>
   </service>


to

   <service name="updateWorkEffortNote" engine="simple"
location="component://workeffort/script/org/ofbiz/workeffort/workeffort/WorkEffortSimpleServices.xml"invoke="updateWorkEffortNote" auth="true">
       <description>Update a WorkEffort Note</description>
<attribute name="workEffortId" type="String" mode="IN"optional="false"/><attribute name="noteId" type="String" mode="IN"optional="false"/><attribute name="internalNote" type="String" mode="IN"optional="false"/><attribute name="noteInfo" type="String" mode="IN"optional="true" allow-html="any"/>
   </service>
if this seems acceptable, i can send patches with services forreview and commits.

I mentioned some things related to this in my comments above, but ingeneral no, my opinion is that we should not open up this securityhole in so many places by default. I would be very interested to hearwhat others have to say and how others would like to see this working,but it may be that they are not talking so much because this hasalready been discussed.

In general it is not safe to assume that output filtering will alwaystake care of this problem, and so when ever text comes from an un-trusted source, or a potentially un-trusted source, we should filterthe input to avoid the problem right there.

I guess this goes back to what I said at the beginning of thismessage. It would be very valuable to put this in the context of whathas already been researched and discussed. What you have written herewould be a lot more meaningful and a lot easier to discuss if youreferred back to the original discussion and decisions that lead tothe functionality you are looking at. What I mean by that is thatanything that has been discussed and decided on can certainly bechanged, but not without referring back to the original discussion anddecisions.


-David

Re: proposal related to allow-html defaults

Reply via email to