xml::xslt and regexes

Chris Cosner Tue, 09 Dec 2008 17:10:29 -0800

Question: What is the speediest tool to pull data from an xml feed thatwill only be a few hundred lines at most? Some regexes will be necessary.


Context:

I am playing with the google books data api. They provide a feed, whichyou can see an example of here:

http://code.google.com/apis/books/docs/gdata/developers_guide_protocol.html
(scroll about halfway down)

I can send search terms to the api and get back some information aboutthe first three results in Google Book Search to integrate with our ownsearch results. [Done] So in some cases the user may click through toGBS, and in others stay on our site. The GBS feed duplicates some tags,such as "dc:identifier" and the only way to distinguish them will bewith a regex on the contents, or by noting tag order.

With the CPAN module XML::XSLT I am able to transform this prettyrapidly. I tried using XML::Twig, but it seemed too slow for this purpose.


However, XML::XSLT does not support regexes.

So I expect that I'll just have to transform the text as far as possiblewith XML::XSLT and the use Perl directly to finish the job.



-Chris


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

xml::xslt and regexes

Reply via email to