On Tue, Jan 20, 2015 at 9:47 AM, Mike Martin <m...@redtux.org.uk> wrote: > Thanks for the idea about qr, I did try this before, but I've now relooked > at at it and got about 75% improvement. > > As regards the uninitialized point the errors were coming from regexes > (different ones) when the regex wasnt matching, so testing the result of > each regex match was not really an option. As an aside the source is really > horrible - job ad listings. > > Basically the idea is > > Take a load of Job Vacancy posts (xml files - loads of) > Parse the Information, getting rid of as much garbage as possible > Push a distinct list into a lookup hash > Do replace to this list against a long list of regexes > Spit out nicely formatted Clean Job Titles > > > Since qr was that significant, you don't have a static regex and there may well be other improvements. You might want to demo the list of regexes for the shooting gallery :)
-- Charles DeRykus -- To unsubscribe, e-mail: beginners-unsubscr...@perl.org For additional commands, e-mail: beginners-h...@perl.org http://learn.perl.org/