Having the \D+ I can see it not working because it will stop at 3 in 3000. I
tried a couple of other things, but was unable to get it working. One of the regex
gurus hopefully will respond.
Sorry for not more than \D+
Wags ;)
-----Original Message-----
From: Peter Cline [mailto:[EMAIL PROTECTED]]
Sent: Friday, January 04, 2002 12:49
To: [EMAIL PROTECTED]
Subject: Strange (from my perspective) regex behavior
I am trying to extraxt some text from a file using a regular
expression. It is not behaving as expected and am totally perplexed as to why.
Here is an excerpt of the text
1. Top Story: Dynegy in Agreement to Get Enron Pipeline
2. M&A: Newmont-Normandy, Hewlett-Compaq, Pax TV, WorldCom
3. Investment Banking: Goldman, Sandler, Merrill Lynch
4. I.P.O.s/Offerings: Sirius Satellite Radio, Neuer Markt
5. Venture Capital: Lucent-Coller Capital, EM.TV
6. Private Equity: HSBC, Canada 3000, Edel Music, Kumho Tire
7. Legal: GE Capital Aviation, EchoStar-DirecTV
8. Correction: Daily Deal Echostar-DirecTV Article
/------------------advertisement--------------\
I want to extract the numbered list.
here is the regex I am using to do it:
m!((\d\.\s\D+)+)/[-]+advertisement!
For some reason this starts matching at number 7. If I eliminate
everything after / the regex matches from 1 to the / in item 4.
I am totally perplexed as to why this is happening. If anyone has insite,
I would be most appreciative.
Thanks
Peter Cline
Inet Developer
New York Times Digital
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]