Richard, Adam, Barry, Dave, David,

Thank you all for your helpful advice regarding expressions.

I was able to combine all your advice, and made some additional discoveries along the way.

The winning expression is:
"#^(.*)\s#iU"

First, I discovered that sometimes the source text had an unexpected space character at the very beginning. So I realized that I needed to do trim() to ensure proper functionality of the expression.

Next, it seems it's better to use \s than an actual space, " ". It might be the case that both are okay, but I've had success with \s, so I decided to stop experimenting.

I tried, as Adam suggested, to use substr() instead of a regular expression. The syntax he provided may work as far as finding the right text to extract. But I can't be sure because it seems to have problems with the character encoding.

The word being extracted is in UTF-8 encoded Japanese. The regular expression seems to input and output it fine. But I couldn't find a way of using the substr() function without the text coming out as ASCII gibberish.

So I'm calling this one solved for me, as I have working code. Although if anyone believes that it could be more efficient or something, of course I'm all ears.

Thank you for all your time and advice.

--
Dave M G

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to