I think you have to hack the parse-html plugin. Look in DOMContentUtils.javain getOutlinks.java. You'll probably have to look for targets that start with
"javascript:" and do some string replacing.
Howie
Hi, Anyone here know how to make Nutch read "<a href=javascript(aaa);>" as http://www.myurl.com/one.php?id=aaa ? Thanks in advance. Marco
