On Tue, Jan 20, 2015 at 9:47 AM, Mike Martin <m...@redtux.org.uk> wrote:
> Thanks for the idea about qr, I did try this before, but I've now relooked
> at at it and got about 75% improvement.
>
> As regards the uninitialized point the errors were coming from regexes
> (different ones) when the regex wasnt matching, so testing the result of
> each regex match was not really an option. As an aside the source is really
> horrible - job ad listings.
>
> Basically the idea is
>
> Take a load of Job Vacancy posts (xml files - loads of)
> Parse the Information, getting rid of as much garbage as possible
> Push a distinct list into a lookup hash
> Do replace to this list against a long list of regexes
> Spit out nicely formatted Clean Job Titles
>
>
>
 Since qr was that significant, you don't have a static regex and there may well
be other improvements.  You might want to demo the list of regexes for the
shooting gallery :)

-- 
Charles DeRykus

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/


Reply via email to