On Sunday 21 December 2008 5:30:25 pm James Colannino wrote:
> Hey everyone.  I have a question.  I have a web scraper that grabs
> information from web pages that often contain characters such as vowels
> with umlots (I know I spelled that wrong.)
>
> The data is editable, so the characters show up unmodified in an
> editable text box.  However, when I try to import the data into a MySQL
> database, the first occurrence of such a character, along with the rest
> of the string, is truncated from the result.  Not all special characters
> cause the problem; vowels with macrons work, for example.
>
> I don't know if it's failing during the actual query or if the character
> is being filtered out at some earlier stage, but whatever the cause,
> it's not working.
>
> My question is, is there a way to replace these characters with their
> HTML equivalents?  For example, the a with an umlot over the top is
> ä in HTML, so before the query is made, and before the filtering on
> the string is done, I'd like to replace that special character with its
> HTML representation.  This allows the user to see the character while
> it's in its text box, yet at the same time allow it to be successfully
> imported into the database.
>
> I know about str_replace, but assuming it's the right function for the
> job, how would I go about representing these special characters in PHP
> so that it will understand what I'm trying to do?
>
> Thanks!
>
> James

You may find this useful:

http://www.garfieldtech.com/blog/unicode-8-vs-16

-- 
Larry Garfield
la...@garfieldtech.com

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to