Re: regex alternation problem

Tim Chase Fri, 17 Apr 2009 15:06:37 -0700

s1 = "I am an american"

s2 = "I am american an "


for s in [s1, s2]:
    print re.findall(" (am|an) ", s)

# Results:
# ['am']
# ['am', 'an']

-------

I want the results to be the same for each string.  What am I doing
wrong?

In your first case, the regexp is consuming the " am " (fourcharacters, two of which are spaces), leaving no leading spacefor the second one to find. You might try using \b as aword-boundary:


  re.findall(r"\b(am|an)\b", s)

-tkc




--
http://mail.python.org/mailman/listinfo/python-list

Re: regex alternation problem

Reply via email to