Simon Groenewolt <[EMAIL PROTECTED]> wrote:
We have a submit form in a site to let visitors submit their own story. If you don't specify a contentType in the form page "strange" characters like a e with a trema will give problems (everything appearing after that character won't arrive in the mmbase object).
Specifying a contentType works, but should we use utf-8 or iso-8859-1?
-- I thought the default for webpages (HTML) was iso-8859-1, but I'm nogt sure and maybe mmbase has it's own preferences for one of these encodings. Java is more utf-8 minded isn't it? I looked at the JSP-editors and I think they are using utf-8 as encoding.
JSP-editors indeed do use UTF-8. Not specifying an encoding in HTTP-headers might indeed be interpreted as ISO-8859-1, but I consider this legacy.
If there's no character encoding specified the servlet spec says it must be interpreted as iso-8859-1.
Gerard
