Re: [PHP] regex - filtering out chinese utf8 characters

2009-07-30 Thread Jim Lucas
Merlin Morgenstern wrote:
> Hi there,
> 
> I am trying to filter out content that is not ascii. Can I do this with
> regex? For example:
> 
> $regex = '[AZ][09]';
> if (preg_match($regex, $text)) {
> return TRUE;
> }
> else {
> return FALSE;
> }
> 
> The reason I need to do this is that I am doing a mysql query with the
> text and I need to make sure it is not UTF8. Otherwise I do get
> following error:
> 
> Error: Illegal mix of collations (latin1_swedish_ci,IMPLICIT)
> and (utf8_general_ci,COERCIBLE) for operation '='
> 
> I am new to regex and would be happy for a jump start to get this fixed.
> 
> Best regards, Merlin
> 

You might want to read up on iconv.  I think it will do what you are
wanting to do.

http://us2.php.net/manual/en/book.iconv.php

specifically...

http://us2.php.net/manual/en/function.iconv.php


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



Re: [PHP] regex - filtering out chinese utf8 characters

2009-07-30 Thread Daniel Kolbo
Merlin Morgenstern wrote:
> Hi there,
> 
> I am trying to filter out content that is not ascii. Can I do this with
> regex? For example:
> 
> $regex = '[AZ][09]';
> if (preg_match($regex, $text)) {
> return TRUE;
> }
> else {
> return FALSE;
> }
> 
> The reason I need to do this is that I am doing a mysql query with the
> text and I need to make sure it is not UTF8. Otherwise I do get
> following error:
> 
> Error: Illegal mix of collations (latin1_swedish_ci,IMPLICIT)
> and (utf8_general_ci,COERCIBLE) for operation '='
> 
> I am new to regex and would be happy for a jump start to get this fixed.
> 
> Best regards, Merlin
> 
You prolly have already been here:
http://www.regular-expressions.info/

But if not, that site is certainly useful for all things regex.

Sorry I can't be of more help for your specific question.

dK
`

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



Re: [PHP] regex - filtering out chinese utf8 characters

2009-07-30 Thread Stuart Connolly

Hi Merlin,

I think the pattern you're looking for is '/[a-zA-Z0-9]/' which will  
match all alphanumeric characters.


Cheers

Stuart

On 30 Jul 2009, at 19:13, Merlin Morgenstern wrote:


Hi there,

I am trying to filter out content that is not ascii. Can I do this  
with regex? For example:


$regex = '[AZ][09]';
if (preg_match($regex, $text)) {
return TRUE;
}
else {
return FALSE;
}

The reason I need to do this is that I am doing a mysql query with  
the text and I need to make sure it is not UTF8. Otherwise I do get  
following error:


Error: 		Illegal mix of collations (latin1_swedish_ci,IMPLICIT) and  
(utf8_general_ci,COERCIBLE) for operation '='


I am new to regex and would be happy for a jump start to get this  
fixed.


Best regards, Merlin

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php





smime.p7s
Description: S/MIME cryptographic signature


[PHP] regex - filtering out chinese utf8 characters

2009-07-30 Thread Merlin Morgenstern

Hi there,

I am trying to filter out content that is not ascii. Can I do this with 
regex? For example:


$regex = '[AZ][09]';
if (preg_match($regex, $text)) {
return TRUE;
}
else {
return FALSE;
}

The reason I need to do this is that I am doing a mysql query with the 
text and I need to make sure it is not UTF8. Otherwise I do get 
following error:


Error: 		Illegal mix of collations (latin1_swedish_ci,IMPLICIT) and 
(utf8_general_ci,COERCIBLE) for operation '='


I am new to regex and would be happy for a jump start to get this fixed.

Best regards, Merlin

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php