I have some users who enter data into my web application through one of
two ways:
 
- copy/paste from microsoft word
- XML export from InDesign UTF-16
- XML export from Quark
 
In all 3 of the cases I've described above, the orign software is
putting through characters that do not display correctly on the web.
 
The problem I'm having is that some of the characters such as an
ellipsis mark or hyphen. When I run into these characters, they display
as the wrong character... sometimes a question mark. Othertimes a square
box... yet other times sequences of characters that are just totally
crazy.
 
My basic understanding of character encoding tells me that I want to
reduce all of the characters down to ASCII. I do not know a good way to
do this.
 
How can I accept text from each of the above mentioned sources, perhaps
others, and somehow *normalize* all of the character data into a set of
characters that will display properly on my page every time?
 
Thank you,
Jon


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:253259
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4

Reply via email to