I think you have to hack the parse-html plugin. Look in DOMContentUtils.java
in getOutlinks.java. You'll probably have to look for targets that start with
"javascript:" and do some string replacing.

Howie

Hi,

 Anyone here know how to make Nutch read "<a href=javascript(aaa);>" as
http://www.myurl.com/one.php?id=aaa ?

Thanks in advance.

Marco





Reply via email to