Tucker, Wanli,
I've done some testing with OL 4.0.15 and Webtop running 1.5.2 running
on OL 4.0.15. While the fix works for OL trunk, it doesn't work for
4.0.15. Here's the debug output for test file "test/css/encoding/
utf8_with_BOM_no_charset_attr.lzx". The UTF-8 encoded data is read
correctly.
http://localhost:8080/4.0.15/test/css/encoding/utf8_with_BOM_no_charset_attr.lzx
09 Jun 2009 01:03:06 (127.0.0.1 15) INFO
compiler.StyleSheetCompiler - StyleSheetCompiler.compile called!
09 Jun 2009 01:03:06 (127.0.0.1 15) INFO
compiler.StyleSheetCompiler - @charset=utf-8 found on stylesheet tag
09 Jun 2009 01:03:06 (127.0.0.1 15) INFO
compiler.StyleSheetCompiler - reading in stylesheet from
src="utf8_with_BOM_no_charset_attr.css"
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG compiler.FileResolver -
Resolving pathname: utf8_with_BOM_no_charset_attr.css and base: /
Users/rajubitter/src/svn/openlaszlo/4.0.15/test/css/encoding
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG compiler.FileResolver -
Resolved utf8_with_BOM_no_charset_attr.css to /Users/rajubitter/src/
svn/openlaszlo/4.0.15/test/css/encoding/
utf8_with_BOM_no_charset_attr.css
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG cm.DependencyTracker -
addFile Path is /Users/rajubitter/src/svn/openlaszlo/4.0.15/test/css/
encoding/utf8_with_BOM_no_charset_attr.css
09 Jun 2009 01:03:06 (127.0.0.1 15) INFO css.CSSHandler -
creating CSSHandler
09 Jun 2009 01:03:06 (127.0.0.1 15) INFO css.CSSHandler -
Trying to parse CSS with charset setting of utf-8
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG utils.FileUtils -
Read the following bytes: text[name='german'] {
backgroundColor: #ffcc00;
buttonText: "öäüÖÄܧ";
}
text[nam
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG utils.FileUtils -
Testing for UTF-8 BOM!
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG utils.FileUtils -
Found BOM on file, encoding is UTF-8
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG css.CSSHandler -
Opening CSS file /Users/rajubitter/src/svn/openlaszlo/4.0.15/test/
css/encoding/utf8_with_BOM_no_charset_attr.css using encoding UTF-8
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG css.CSSHandler -
Skip first 3 bytes containing UTF-8 BOM
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - compiling CSSHandler using new unique
names
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Conditional selector: [name="german"]
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Attribute condition
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - simple selector:text
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Cond string: { attrname: "name",
attrvalue: "german", simpleselector: "text"}
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Conditional selector: [name="korean"]
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Attribute condition
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - simple selector:text
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Cond string: { attrname: "name",
attrvalue: "korean", simpleselector: "text"}
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Conditional selector: [name="chinese"]
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Attribute condition
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - simple selector:text
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - Cond string: { attrname: "name",
attrvalue: "chinese", simpleselector: "text"}
09 Jun 2009 01:03:06 (127.0.0.1 15) DEBUG
compiler.StyleSheetCompiler - whole stylesheet as css $lzc
$style._addRule(new $lzc$rule({ attrname: "name", attrvalue:
"german", simpleselector: "text"}, {backgroundColor: 0xFFCC00,
buttonText: "???????"}));
$lzc$style._addRule(new $lzc$rule({ attrname: "name", attrvalue:
"korean", simpleselector: "text"}, {backgroundColor: 0xFF00CC,
buttonText: "? ?? ??? ?? ?? ?? ?? ??"}));
$lzc$style._addRule(new $lzc$rule({ attrname: "name", attrvalue:
"chinese", simpleselector: "text"}, {backgroundColor: 0x00FFCC,
buttonText: "???????????????"}))
But the debug console in the browser shows the following error message:
ERROR: Unknown cohort for rule: #0 * (undefined)
ERROR: Unknown cohort for rule: #1 * (undefined)
ERROR: Unknown cohort for rule: #2 * (undefined)
WARNING @utf8_with_BOM_no_charset_attr.lzx#7: No CSS value found for
node «lz.text#3| #german» for property name buttonText
WARNING @utf8_with_BOM_no_charset_attr.lzx#7: No CSS value found for
node «lz.text#3| #german» for property name backgroundColor
WARNING @utf8_with_BOM_no_charset_attr.lzx#8: No CSS value found for
node «lz.text#4| #korean» for property name buttonText
WARNING @utf8_with_BOM_no_charset_attr.lzx#8: No CSS value found for
node «lz.text#4| #korean» for property name backgroundColor
WARNING @utf8_with_BOM_no_charset_attr.lzx#9: No CSS value found for
node «lz.text#5| #chinese» for property name buttonText
WARNING @utf8_with_BOM_no_charset_attr.lzx#9: No CSS value found for
node «lz.text#5| #chinese» for property name backgroundColor
Any idea what this might be?
- Raju
On Jun 8, 2009, at 9:42 PM, P T Withington wrote:
[cc-ing Wanli]
Wanli,
Maybe you also want to review (or have someone on your team review)
as this issue affects you?
---
Raju,
This looks good.
I think you might have trouble trying checking in your utf-16
examples to svn. I'm not sure svn will do the right thing. It
looks like it is treating them as binary files. Maybe that will
work. I don't know.
It looks to me like you may also have some tab characters in your
sources. You need to expand tabs to check in, otherwise the pre-
commit filter will reject your checkin.
I approve this change. You might want to check in your fix
separately from your tests, just in case the UTF-16 files cause svn
issues.
On 2009-06-08, at 14:44EDT, Raju Bitter wrote:
Added a "+" in debug output string concatenation.
Change 20090608-raju-Y by [email protected] on
2009-06-08 20:09:02 CEST
in /Users/rajubitter/src/svn/openlaszlo/trunk-cssunicode
for http://[email protected]/openlaszlo/trunk
Summary: Fix for CSS parser uses incorrect file encoding
New Features: Adds an optional @charset to the stylesheet tag, in
case the user wants to use a CSS file in a different encoding then
utf-8
Bugs Fixed: LPP-8045
Technical Reviewer: ptw
QA Reviewer: (pending)
Doc Reviewer: (pending)
Documentation:
Release Notes:
Details:
+ StyleSheetCompiler.java: Handling for @charset on stylesheet tag
added. Added a
2nd parameter with the encoding value to the CSSHandler.parse() call.
+ CSSHandler.java:
parse() method takes the encoding from the LZX stylesheet tag as a
2nd parameter.
getInputSource() method does a few more things now:
1) checks for a possible BOM on the CSS file
2) checks if a possible BOM conflicts with the value of the
stylesheet tag's @charset
3) if there's a BOM, the BOM bytes are removed from the input stream
+ FileUtils.java:
Added method public static String
detectBOMEncoding(BufferedInputStream in)
The method returns the BOM marker interpreted as one of the
following strings:
UTF-8
UTF-16LE
UTF-16BE
Tests:
+ test files in folder test/css/encoding
The following test exist:
1) iso8859-1_with_charset_attr.lzx
Reading an iso-8859-2 encoded CSS file with some German special
chars
2) utf16BE_with_BOM.lzx
Reading an utf-16 BE CSS file with BOM marker
3) utf16LE_with_BOM.lzx
Reading an utf-16 LE CSS file with BOM marker
4) utf8_with_BOM_no_charset_attr.lzx
Reading an utf-8 CSS file with BOM and no charset attribute on
the stylesheet tag
5) utf8_with_BOM_conflicting_charset_attr.lzx
Reading a CSS with @charset value of utf-16, but CSS having a
UTF-8 BOM marker, will
throw a compile error
Files:
A test/css/encoding
A test/css/encoding/iso8859-1_with_charset_attr.lzx
A test/css/encoding/utf16BE_with_BOM.lzx
A test/css/encoding/utf8_with_BOM_no_charset_attr.css
A test/css/encoding/utf16LE_with_BOM.css
A test/css/encoding/utf8_no_BOM_no_charset_attr.css
A test/css/encoding/utf8_with_BOM_conflicting_charset_attr.css
A test/css/encoding/utf8_with_BOM_no_charset_attr.lzx
A test/css/encoding/utf16LE_with_BOM.lzx
A test/css/encoding/utf8_no_BOM_no_charset_attr.lzx
A test/css/encoding/iso8859-1_with_charset_attr.css
A test/css/encoding/utf16BE_with_BOM.css
A test/css/encoding/utf8_with_BOM_conflicting_charset_attr.lzx
M WEB-INF/lps/server/src/org/openlaszlo/css/CSSHandler.java
M WEB-INF/lps/server/src/org/openlaszlo/utils/FileUtils.java
M WEB-INF/lps/server/src/org/openlaszlo/compiler/
StyleSheetCompiler.java
Changeset: http://svn.openlaszlo.org/openlaszlo/patches/20090608-raju-Y.tar