I want to extract the daily political cartoon out of this page: http://search.csmonitor.com/commentary/index.html
I have used Lauren's JXL for extracting images (http://jpluck.sourceforge.net/examples/extract-images.xsl), and have successfully generated a page of images from this site. However my understanding of this xsl is that it looks at the original html and copies only the image containers to the output document.
Could anyone tell me the regexp to use (and how to use it), so that it only copies image containers that match the string "cartoon.jpg" ?
Also, for anyone wanting to learn more about JXL structures, this is a great place:
http://www.w3schools.com/xsl/xsl_functions.asp
Cheers, Stephen
_______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

