Did a dynamic five language site (Japanese, Korean, Cantonese, Mandarin, 
English) using access for prototyping and oracle for publishing. The 
language code is simple, note that you can only specify one charset
per page: 

<meta http-equiv='content-type' content='text/html;charset=#bnkCharSet#'>

If you're using explorer and don't have support for that locale it will 
automatically try to download a font set for you, with Netscape you've 
to manually install the fonts. 

A list of common charsets can be found at: 
http://www.w3.org/International/O-charset-list.html

The database and interacting with it via forms was a little more tricky. 
After going through a few file formats we finally had the translators 
use notepad to translate into the different languages because double-byte 
and unicode characters were causing problems and losing certain characters 
in insert and update statements. They would save the files as text or html 
and we could simply copy paste them into the fields. Use the extended ascii 
set. Oracle has some nifty language support I would check out 
technet.oracle.com for more info, in access we used memo fields to store 
the html and translated content. On an English machine all except the html 
tags look garbled when you open the db but once you query it and throw it 
into a page with the above meta tag, the browser interprets the content and 
all's displayed fine. I have not tried this but I'm pretty sure if we had 
a Chinese version of windows/access we would see Chinese characters in 
access as well. Using ie goto ntt.co.jp and refuse to install the fonts or 
chose western European from the encoding menu under view, that's what the 
data looks like inside the field and this is what I'm referring to as 
'garbled'. One problem was updating the db from a textarea. The admins for 
the site were using English machines and were to be responsible for 
maintaining the site, but we had localized the admin pages where the 
database was to be updated using the charset code above so that when you 
copied and pasted the (garbled) text in to the textareas you could see 
the localized content and do some visual formatting if need be. Bad choice, 
since the page was 'labeled' as Japanese when you inserted the textarea 
content into the db it would come back out as Unicode. Lots of u1234, u2345 
stuff. Changed the the charset back to English and all was fine for a while. 
After a myriad of different problems in different languages similar to the 
above I switched from textareas to a modified version of activEdit 
(cfdev.com) to update the content. For some reason this dandy tag would 
always update the db in English (I'm assuming because it uses windows 
components), hence here's a solution for managing and translating localized 
content with the least amount of headache. 

-Setup acitvEdit instead of using textareas and specify the local charset 
 at the top of the page.

-Feed the panel formatted (tables, images, vs.) ENGLISH version of the 
 content. All the charsets that I used had English fonts embedded so this 
 was possible. 

-Have the translators TYPE into the panel over the English instead of 
 sending you some unrecognizable file which you have to sift through. 
 This is actually the most difficult part and a huge pia, because you 
 have to put in an incredible amount of effort to get the translators 
 to use your admin panel rather than his/her copy of word 2.0. I'll 
 explain below. This method actually solves a major organizational 
 problem. How do you know where one sentence ends and the other starts 
 if you don't know Chinese? If you end up having to send out text only 
 to get translated at least make sure you leave breadcrumbs behind)

-Update the db from activEdit, and serve it back out from the frontend 
 making sure that you specify the charset.

Well that's basically it. Except for one minor problem... the translators. 
I can't emphasize the importance of having them TYPE over the English 
content. Worst case copy and paste from NOTEPAD sentence by sentence, 
but don't let them know that upfront because more often than necessary 
they ended up copying and pasting from word documents or other similar 
fancy word processors. With activEdit that was a huge problem as word 
retains a whole bunch of utterly useless data along with the content. 
Weird div tags and incomprehensible font declarations really messes up 
your visual formatting not to mention that it usually spurts out ie only 
code. Export a localized document as html from word and you'll see what 
I mean. 

Did I mention that Arabic reads right to left? Think about it. ];-)

Hope that helps, good luck and please let me know if you're in need 
of a telecommuter. 

Benekli23:50P21I2001



<->Has anybody successfully built a multi-language dynamic website - where the
<->page data, etc, is kept within the database? How did you display the
<->languages on screen, in the database, etc..

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm

Archives: http://www.mail-archive.com/[email protected]/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

Reply via email to