Hi.

I am trying to crawl a page using nutch. That page exists behinds a
validator (struts), i.e. In order to get to the page, a button needs to be
clicked. Is there anyway this can be bypassed so web crawler can get to the
page without clicking this button?

Code:
<form name="loginForm" method="post" action="/check.do">
      <input type="hidden" name="forward" value="target_page">
       <input type="submit" name="org.apache.struts.taglib.html.CANCEL"
value="Continue" onclick="bCancel=true;">
 </form>

Any help is appreciated. Thanks.
-- 
View this message in context: 
http://www.nabble.com/Bypass-Validation-tp18453973p18453973.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to