Re: comment formatting and escaping is a bit kludgy

Allen Gilliland Wed, 11 Jul 2007 10:01:17 -0700

Okay, this work is basically complete, but after doing a fair amount oftesting I am a little uncertain if our handling of html in comments isworking how we expected. My understanding from looking at the UI isthat we are supposed to have these options ...

1. HTML disabled and not allowed in comments. Triggered by checking"escape comment html" on global config page. This is also the propertywhich toggles the "HTML Syntax: Enabled/Disabled" text on the comment form.

2. HTML subset available. Only certain html tags work. I'm not sure Ifully understand how this is supposed to be enforced, it looks like thecode tries to escape all html into < > syntax and then unescapescertain tags.


3. HTML enabled.  Basically, anything goes.

Now, from looking at the code it doesn't look like we are properlyproviding these options. From my testing, when HTML is supposed to bedisabled it's not, and I could still put HTML in comments. And to alarger degree, it looks like the escapeHTML() method isn't really doingmuch. This basically means that the HTML subset option wasn't workingeither because the comment was never being escaped in the first place,so that option was just doing nothing.

So can someone verify that my assumptions above are correct and that'sthe actual functionality we are shooting for?

I think that fixing #1 should be relatively easy and we can just displaya comment error message to users if they put html in a comment when theyaren't supposed to. To handle the conditions for #1 and #3 I'd like tojust migrate the old "escape comment html" property to a new one called"enable html in comments" and if it's enabled then we don't do anythingto check html in comments, if it's disabled then we reject any commentswith html in them and display and error message.

#2 is a bit more tricky since you then need to be parsing the commentand looking for specific html. At the end of the day this one feelsmore like a comment validator than a built-in option to me becauseeffectively all you are trying to do is check if an incoming commentuses any html that you are not allowing, which is the purpose of ourcomment validators. So I think that one should be rewritten as acomment validator.


Does that all make sense?

-- Allen


Allen Gilliland wrote:



Dave wrote:

On 7/10/07, Allen Gilliland <[EMAIL PROTECTED]> wrote

One thing that I didn't mention but is probably worth saying ... it's
also possible to design things a bit differently such that instead of
saving comments unmodified and then applying reformatting plugins as
they get displayed, we could apply the plugins on the incoming comment
and save the transformed version.

Each approach has it's own benefits.  Applying plugins on the incoming
comment simplifies things quite a bit because we no longer need to add
the 'plugins' attribute to the db or pojo and it would improve
performance a fair amount since we wouldn't need to apply XX plugins per
comment during rendering.  The downside is that since you modify the
comment before saving it you have no way of getting back to the original
comment if you wanted to, although that's not likely to be necessary.

I kind of like the idea of just applying the plugins once as the comment
is being submitted and then not having to worry about it after that, but
the solution I detailed above is slightly more flexible.


I don't have a strong feeling about this, but I guess I prefer to
store the raw comment data.

Actually, I remember why I abandoned this option as I was prototyping.The major difficulty here is upgrade process because currently we aredoing transformations of comment content as it gets displayed and so ifwe wanted to change that then it means going through all comments in thedb, applying the tranformations, and then re-saving. Basically, a majorPITA.


-- Allen


- Dave

Re: comment formatting and escaping is a bit kludgy

Reply via email to