Re: [webkit-dev] Webkit compatibility in India - Transcoding Indic fonts

Maciej Stachowiak Wed, 19 Nov 2008 17:26:30 -0800

On Nov 19, 2008, at 10:42 AM, Jungshik Shin (신정식, 申政湜)wrote:

2008/11/6 Prunthaban Kanthakumar <[EMAIL PROTECTED]>

Now we can do the following,
1. Add an additional condition in styleDidChange method to check ifthe font-family is supported by our transcoder (At present a fastlook-up table should do because we plan to support only limited setof fonts) - This condition will be #ifdefed onENABLE(TRANSCODER_SUPPORT).
Shouldn't this be triggered by (font-family, site) rather than justfont-family?

Since we're looking at this as a legacy compatibility feature, andwould like future sites to move to proper Unicode-encoded text, myfirst instinct would be {font, site} pairs. But that depends onwhether we can achieve acceptable Indic browsing results with just afixed list of sites.

On a related note, I would like to mention here that, we cannot gowith the approach of 'one look-up table' per font-face and a singletranscoder to do the look-up for all fonts. The problem is that manyindic languages use multiple code-points to represent one characterand different fonts use different standards! For example there aresituations where one glyph in EOT needs to be transcoded to 5+Unicode code points. A reverse situation is also possible. Due tothese issues, we cannot go with a simple look-up table for allfonts. This forces us to write some specialized code to handle eachfont (there might also be some fonts where a one-to-one look-uptable will be enough).
In October, I listed two alternatives for this transformation. Oneis adding ICU converters for Indic font encodings (it can deal withm-to-n mappings) and the other is implementing your own. The firstwas ruled out because it's not easy to add new converters on Mac OSX where ICU is a part of the OS. There's another approach you cantake. You can build ICU transliterator rules and it seems to be thecleanest way to do this. You don't need to port/implement conversioncode (from another project : e.g. Padma) but just need to 'port' theconversion tables to ICU transliterator rules.
This transcoding will be invoked on the content of a text nodealready in Unicode just like 'text-transform: capitalize' or 'text-transform: lowercase' is. ICU transformer is for transforming achunk of text in Unicode to another chunk of text in Unicode.( http://www.icu-project.org/userguide/Transform.html ) So, itappears to be almost a perfect fit.

This sounds like it would work for any ICU-based, though it wouldprevent the feature from working for ports that use something otherthan ICU for unicode and text transcoding support, most notably the Qtport. Would it simplify the code significantly to make it an ICUtransformer rather than something custom?


Regards,
Maciej

_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] Webkit compatibility in India - Transcoding Indic fonts

Reply via email to