On Wed, 2006-08-09 at 02:57 -0400, Robert Cummings wrote:
> On Wed, 2006-08-09 at 15:30 +0900, Dave M G wrote:
> > PHP List,
> >
> > This regular expression stuff is way tricky.
> >
> > Thanks to help from this list, I have an expression that will select the
> > first word of a string, up to the first white space:
> > "#^(.*)\s#iU"
> >
> > But after some consideration, I realized that I wanted to keep both
> > parts of the original text. The first word, and then everything that
> > came after it, should be divided and stored in separate variables.
> >
> > I looked through the php.net manual, and found preg_split().
> >
> > At first I thought using my original expression would work, if I
> > included the PREG_SPLIT_DELIM_CAPTURE parameter. It would pull out the
> > first word, but also keep it.
> >
> > Turns out that's not true. It removes the first word and the space, and
> > looks for something before and after and only finds something after.
> > Thus, in my current situation, it returns only one variable, which
> > contains everything after the first space.
> >
> > I realized what I need to do in this case is not select everything up to
> > the first space, but to find the first space without selecting what
> > comes before it. So the expression I had wasn't suitable for preg_split().
> >
> > So then, what is the right expression?
> >
> > Looking back over previous discussion and resources about regular
> > expression syntax, I thought I had to say:
> > Start at the beginning: ^
> > Ignore anything that isn't a space: [^\s]
> > Select the space character: \s
> > Be case insensitive and not greedy: iU
> >
> > Thus, my expression should be (using hash marks, #, as delimiters):
> > #^[^\s]|s#iU
> >
> > More specifically, my preg_split() syntax is:
> > $parts = preg_split("#^[^\s]|s#iU", $word, PREG_SPLIT_DELIM_CAPTURE);
> >
> > But it returns a two element array, where the first element is empty,
> > and the second element is the whole original string.
> >
> > Where did I go wrong this time?
>
> Use preg_match() and pay special attention to the manual as it refers to
> the third parameter :) The expression you need follows:
>
> "#^([^\\s]*)\\s(.*)$#U"
>
> You don't need the insensitive modifier btw since you aren't actually
> matching anything that is applicable.
Also you don't need the ungreedy modifier either since you can't
overshoot with the above expression.
Cheers,
Rob.
--
.------------------------------------------------------------.
| InterJinn Application Framework - http://www.interjinn.com |
:------------------------------------------------------------:
| An application and templating framework for PHP. Boasting |
| a powerful, scalable system for accessing system services |
| such as forms, properties, sessions, and caches. InterJinn |
| also provides an extremely flexible architecture for |
| creating re-usable components quickly and easily. |
`------------------------------------------------------------'
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php