Re: proposal related to allow-html defaults

David E Jones Sun, 28 Jun 2009 23:58:53 -0700


On Jun 28, 2009, at 2:53 PM, Harmeet Bedi wrote:

Hi David,

Read the threads on the topic. thanks for pointing them out.

For safe allow-html="safe" it does not seem to work well with these 2
examples
Bold a word: Hi <span style="font-weight: bold;">There</span><br>
The converted safe html is Hi There<br>
Color a word: Hi <span style="color: rgb(255, 0, 0);">There</span><br>
The converted safe html is Hi There<br>

Color, bold are relatively common operations. I don't know how toimprovethe allow-html options to make them better, but not being able toput simple

text and rich text features is a pain.

The original configuration I put in SVN is the one from the ESAPIproject based on what is allowed on slashdot. We can certainly changeit...

The configuration is in the "antisamy-esapi.xml" file. I just did asmall update to allow the span tag, though I think it doesn't allowthe style attribute. We may want to allow that too... any thoughtsanyone?

On a side note, bold is easier to do with the <b> tag, or with a <h?>tag. I don't know if there is a good alternative for color though...

That said, we should see what the WYSIWIG HTML textarea editor inOFBiz now generates and make sure that is supported, and of course ifthere are any that others are using that generate HTML that OFBiz isnot accepting as "safe" we should change it for those too.

In short, I don't think the antisamy-esapi.xml file has been touchedat all since I put the original slashdot based one in there, and itcertainly should be touched for these things!

I realize that selectively turning off html is not a good idea.Security isas good as the weakest link in chain.. So an opening anywhere wouldbe bad.There are a few places in ofbiz where allow-html="any" is specified.You may
want to remove those as well to keep security barrier high.

Do you have any specific instances of this you have noticed? I'vetried to keep an eye on commits to look for service attributes thatare set this way but that may not come from a reliable/safe source...but I know I miss a lot of stuff and honestly don't read every line ofevery commit.


-David

On Mon, Jun 22, 2009 at 2:56 PM, David E Jones <[email protected]> wrote:
Thank you for writing this up Harmeet. Before getting into thedetails itis important that you understand that you are proposing to changesomething,or in fact basically get rid of something, that was discussed in alot ofdetail and that has seen effort from quite a few people. In generalwhenrecommending to change something it's a good idea to understandwhere thatcame from and why things are done the way they are. Based on whatyou'vewritten in this and other messages it does not seem that you havedone thisresearch. I might be wrong, and if so I apologize, but in order tocontinuewith a discussion like this it is important that you know thebackground and
understand the problem and the reasons for the solutions.
There is a good thread that discusses most of this, and it isavailable
here:

http://www.nabble.com/Security-Issues-td21622188.html


On Jun 22, 2009, at 11:31 AM, Harmeet Bedi wrote:
A issue(https://issues.apache.org/jira/browse/OFBIZ-2645) is thatdefault
value of allow-html is very restrictive.
This is as per start with most constrained best practice in security

Default is allow-html="none"
It not only does not allow html but it also does not allow simpletext
like "Tom's age is likely > Paul's age". '>' breaks.
I agree this could be improved, but the trick is how do we tell the
difference? One interesting page that shows just how difficult itis torecognize HTML is this one which is oriented around finding hacksrelated to
this very problem in various browsers:

http://ha.ckers.org/xss.html
What you have mentioned is definitely a weakness in the currentcode, whichis admittedly fairly simple, and an opportunity to improve thatcode. Idon't think it would be a good idea to change the overall approachand throwaway the attempt to filter incoming text, especially text coming infrom
untrusted sources.
Around the time this was written I asked for recommendations fromanyone onthe mailing list that might be more familiar with this topic,especially as
it relates to what a browser will or will not interpret as markup. My
limited knowledge on the topic is that you pretty much have to have
something that looks like a tag, in other words starts with a lessthan oropen angle bracket and has some sort of word after it, in order forthe
browser to consider it markup and try to interpret it.
One thing that might work is to allow a > if there is no <. I'm notsure
about the other way around because if there is otherwise valid HTML a
browser might not care about a missing > to close a tag. It mayalso bevalid to allow a < if there is a space after it, but I'm not sureabout that
either.
Please keep in mind that the reason for this is to not allow abuseof aproduction system. If that is not important to you or to yourclients thenyou can certainly disable it globally. However, I'm guessing that'snot whatyou would want to do and the best approach for all of this is tosimply
improve the code so that it allows as much as possible while still
protecting against the security threat it is meant to.
allow-html="safe" also does not seem to work well. It does notallow well
formed html.
Could you be more specific? Depending on what you're seeing theremay very
well be a bug with the safe HTML code, or just a need to change the
antisamy-esapi.xml configuration file for it. In any case, it wouldbe goodto find out more about what you mean by this because it sounds likea bug.
Here is a proposal:
HTML and descriptive text with characters like '>' should beallowable
whenever there is a description text input by user.
Change services that deal with description/comments/reason/noteInfo fields
to have allow-html="any"
An example of this is 'updateWorkEffortNote' service. it couldchange from
 <service name="updateWorkEffortNote" engine="simple"
location="component://workeffort/script/org/ofbiz/workeffort/workeffort/WorkEffortSimpleServices.xml"
invoke="updateWorkEffortNote" auth="true">
     <description>Update a WorkEffort Note</description>
     <attribute name="workEffortId" type="String" mode="IN"
optional="false"/>
<attribute name="noteId" type="String" mode="IN"optional="false"/>
     <attribute name="internalNote" type="String" mode="IN"
optional="false"/>
<attribute name="noteInfo" type="String" mode="IN"optional="true"/>
 </service>


to

 <service name="updateWorkEffortNote" engine="simple"
location="component://workeffort/script/org/ofbiz/workeffort/workeffort/WorkEffortSimpleServices.xml"
invoke="updateWorkEffortNote" auth="true">
     <description>Update a WorkEffort Note</description>
     <attribute name="workEffortId" type="String" mode="IN"
optional="false"/>
<attribute name="noteId" type="String" mode="IN"optional="false"/>
     <attribute name="internalNote" type="String" mode="IN"
optional="false"/>
<attribute name="noteInfo" type="String" mode="IN"optional="true"
allow-html="any"/>
 </service>
if this seems acceptable, i can send patches with services forreview and
commits.
I mentioned some things related to this in my comments above, but in
general no, my opinion is that we should not open up this securityhole inso many places by default. I would be very interested to hear whatothershave to say and how others would like to see this working, but itmay bethat they are not talking so much because this has already beendiscussed.
In general it is not safe to assume that output filtering willalways takecare of this problem, and so when ever text comes from an un-trusted source,or a potentially un-trusted source, we should filter the input toavoid the
problem right there.
I guess this goes back to what I said at the beginning of thismessage. Itwould be very valuable to put this in the context of what hasalready beenresearched and discussed. What you have written here would be a lotmoremeaningful and a lot easier to discuss if you referred back to theoriginaldiscussion and decisions that lead to the functionality you arelooking at.What I mean by that is that anything that has been discussed anddecided oncan certainly be changed, but not without referring back to theoriginal
discussion and decisions.

-David

Re: proposal related to allow-html defaults

Reply via email to