Octavian Rasnita wrote:
Hi all,

I've made a spider that downloads all the web pages starting from an URL and
I don't know how I could downloads the pages that are linked using
Javascript.

If the link would be something like <a
href="javascript:window.location='http://www.site.com/';">Site</a>
this would be simple to parse, but the problem is with the javascript that
run a function which opens an URL.

There is a Javascript.pm module which relies on the Mozilla JS engine. I'm not sure how much that will help but it might be useful if it works. To be honest I think you may have to search for things that look like web addresses within your hrefs - i.e. you need to do additional processing on the javascript: hrefs.

Francis
--
Francis Turner, CIO Juelich Enzyme Products GmbH
[EMAIL PROTECTED] - http://www.biocatalysis.com
+49-611-962-4693(office) +49-173-291-7278(cell)

Enzymes are things invented by biologists that explain things which
otherwise require harder thinking. -- Jerome Lettvin

Reply via email to