Re: Multi-byte character support.

2001-03-27 Thread Thom Park


Kazuhiro,

thank you for your reply,
I will investigate the possibility of using tomcat 3.2.b2 -
however
it is likely I will need to await 3.2.2 milestone before
considering it
as a release candidate.
-Thom

Thom,

From: "T. Park" [EMAIL PROTECTED]
Subject: Multi-byte character support.
Date: Mon, 26 Mar 2001 10:32:31 -0800
Message-ID: [EMAIL PROTECTED]
 Tomcat 3.2[.1] doesn't seem to like static html pages (or
servlets for
 that matter) that offer multi-byte (non-latin) characters.

Tomcat 3.2.2b2 has many i18n improvements. Would you use and
check
3.2.2b3 if possible?

Of cource, a few i18n problem (for example,
ServletRequest#getProperty
method don't return value in Unicode) isn't resolved in 3.2.2.
But
they will resolve them in 3.3  4.0.

Kazuhiro Kazama ([EMAIL PROTECTED])NTT Network Innovation
Laboratories






Re: Multi-byte character support.

2001-03-27 Thread Kazuhiro Kazama

From: "Java Poop" [EMAIL PROTECTED]
Subject: Re: Multi-byte character support.
Date: Mon, 26 Mar 2001 22:33:36 -0800
Message-ID: [EMAIL PROTECTED]
 What are the il8n limitations in 3.2.1?

For example, there are the following limitations:

1, The getParameter method in ServletRequest class returns non-Unicode
strings.

If a HTTP GET or POST request is done by a HTML form, servlet can't
know its charset because the request don't have its charset
information.  The HTML 4.0 specification recommends to encode UTF-8
strings but most of systems don't do it.

Tomcat 3.2 treats parameters like their charset is ISO-8859-1. Other
servlet container may return Unicode strings by its own encoding
detection method but it is tricky and don't work under some situation.

For avoiding this problem, Servlet 2.3 API has setCharacterEncoding
method and Tomcat 4.0 based on it.

Because Tomcat 3.2 based on Servlet 2.2 API, we can't use this method.
But Costin will provide an extension module for Tomcat 3.3 (for
example, specify the default encoding per context).

2, A JSP content's encoding may not be recognized correctly.

This cause is a lack of a function to specify JSP content's encoding
(JSP output encoding is specified by contentType attribute of Page
directive).

JSP 1.2 has new pageEncoding attribute of Page directive. And a file
localization method, it is proposed by Arieh, may provide another
solution.
 
Kazuhiro Kazama ([EMAIL PROTECTED]) NTT Network Innovation Laboratories



Re: Multi-Byte character support.

2001-03-26 Thread tttye


I have helped work on a fix for this on 4.0 (which supports the
javax.servlet.ServletRequest.setCharacterEncoding() method).  Tomcat 3.x
does not support the new servlet 2.3 methods, so I did not look at fixing
it in Tomcat.
However, I have an easy to use object (included) which is able to process
the mangled string and convert it to the encoding that you specify.
This method will work for all Strings parsed into Tomcat (or other web
servers). 
Tim Tye


T. Park writes:

 
 Greetings,
 
 
 Tomcat 3.2[.1] doesn't seem to like static html pages (or servlets for
 that matter) that offer multi-byte (non-latin) characters.
 
 Has anyone coded a patch to fix this and/or have a workaround for the
 issue.
 
 I'd rather avoid having to re-invent the wheel.
 
 Does anyone have any insights as to how big a problem this is to fix?
 
 
 Cheers,
 
 Thom
 
 --
 http://www.borland.com/newsgroups
 http://www.borland.com/devsupport/disclaim.html
 
 




 ChangeStringEncoding.java


Re: Multi-byte character support.

2001-03-26 Thread Kazuhiro Kazama

Thom,

From: "T. Park" [EMAIL PROTECTED]
Subject: Multi-byte character support.
Date: Mon, 26 Mar 2001 10:32:31 -0800
Message-ID: [EMAIL PROTECTED]
 Tomcat 3.2[.1] doesn't seem to like static html pages (or servlets for
 that matter) that offer multi-byte (non-latin) characters.

Tomcat 3.2.2b2 has many i18n improvements. Would you use and check
3.2.2b3 if possible?

Of cource, a few i18n problem (for example, ServletRequest#getProperty
method don't return value in Unicode) isn't resolved in 3.2.2. But
they will resolve them in 3.3  4.0.

Kazuhiro Kazama ([EMAIL PROTECTED]) NTT Network Innovation Laboratories