nutch-user  

Bypass Validation

karthik085
Mon, 14 Jul 2008 14:51:13 -0700

Hi.

I am trying to crawl a page using nutch. That page exists behinds a
validator (struts), i.e. In order to get to the page, a button needs to be
clicked. Is there anyway this can be bypassed so web crawler can get to the
page without clicking this button?

Code:
<form name="loginForm" method="post" action="/check.do">
      <input type="hidden" name="forward" value="target_page">
       <input type="submit" name="org.apache.struts.taglib.html.CANCEL"
value="Continue" onclick="bCancel=true;">
 </form>

Any help is appreciated. Thanks.
-- 
View this message in context: 
http://www.nabble.com/Bypass-Validation-tp18453973p18453973.html
Sent from the Nutch - User mailing list archive at Nabble.com.