Bugs item #887491, was opened at 2004-01-30 11:02
Message generated for change (Comment added) made by loubyansky
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=376685&aid=887491&group_id=22866

Category: JBossCMP
Group: v3.2
>Status: Closed
>Resolution: Invalid
Priority: 5
Submitted By: Milen Dyankov (azzazzel)
>Assigned to: Alexey Loubyansky (loubyansky)
Summary: Problem with non-English characters in UTF-8 encoded queries

Initial Comment:
It seems that all characters coded with more than one
byte (2+ bytes) in UTF-8 encoded queries are
incorrectly parsed by [EJB/JBoss]QLParser as seen in
this log fragment:

...
2004-01-20 09:21:44,507 DEBUG
[plugins.cmp.jdbc.JDBCDynamicQLQuery.Company#ejbSelectGeneric]
DYNAMIC-QL: SELECT OBJECT(c) FROM Company c WHERE
((c.name LIKE '%ąężźćśł�ń%'))
2004-01-20 09:21:44,510 DEBUG
[plugins.cmp.jdbc.JDBCDynamicQLQuery.Company#ejbSelectGeneric]
 SQL: SELECT t0_c.symbol_ FROM COMPANY_ t0_c WHERE
(((t0_c.name_ LIKE '%|z[B�D%')))
2004-01-20 09:21:44,510 DEBUG
[plugins.cmp.jdbc.JDBCDynamicQLQuery.Company#ejbSelectGeneric]
Executing SQL: SELECT t0_c.symbol_ FROM COMPANY_ t0_c
WHERE (((t0_c.name_ LIKE '%|z[B�D%'))) 
...

I don't know much for JJTree but I feel these two
parameters:
UNICODE_INPUT = false;
JAVA_UNICODE_ESCAPE = true;
in "JBossQLParser.jjt" and "EJBQLParser.jjt" files are
responsible for this behaviour! I guess due to the way
the are set [EJB/JBoss]QLParser expects queries to be
non-Unicode encoded (1 character - 1 byte) with
characters not defined in current character set in
Unicode-escaped format (\uXXXX).
If I pass parameter like '\u0105' instead of 'ą' then
it works. 

Simply setting   JAVA_UNICODE_ESCAPE = false;
reverses the situation. I beleive this is supposed to
be the default. I have not checked if queries in other
encodings (like ISO-8859-X) work with this
configuration, but I don't see a reason why they should
not!

Tested on:
JBoss 3.2.1, 3.2.2 and 3.2.3
JVM 1.4.1_02 and 1.4.2_03
OS: Linux  2.4.22 and Widows 2000/XP

JBoss was started with "-Dfile.encoding=UTF-8"

 


----------------------------------------------------------------------

>Comment By: Alexey Loubyansky (loubyansky)
Date: 2004-02-05 01:19

Message:
Logged In: YES 
user_id=543482

Why is '&#261' supposed to be understood?
Either you provide unicode content as is (not the '&#261'
form) or you use unicode escapes as defined in the Java
spec, i.e. '\u'.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=376685&aid=887491&group_id=22866


-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
JBoss-Development mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/jboss-development

Reply via email to