Well, NTLM is a windows thing with a username and password. I have a certificate. No username/password. The debug stuff would be helpful once I can get a bit farther...I don't know how to tell Nutch to crawl with the cert. I'm getting a 403 error -- it is not (using? finding?) the certs that I have passed in via -D arguments.
I appreciate you trying to help -- but I need knowledge on getting Nutch to use a cert. -- Chris On Wed, Mar 7, 2012 at 4:14 PM, remi tassing <[email protected]> wrote: > There are many debugging tips on the bottom of that page, did you try them? > > E.g. ParserChecker, debug-level log info, ... > > BTW, which authentication scheme is required by your site? For NTLMv2 is > poorly supported > > Remi > > On Wednesday, March 7, 2012, Christopher Gross <[email protected]> wrote: >> I have protocol-httpclient set. >> >> I can't see how I'm supposed to do the certs. I can't seem to get >> them to work by passing them in via -D args when I call the nutch >> script (-Djavax.net.ssl.trustStore=xxxx >> -Djavax.net.ssl.trustStorePassword=xxxxx ...etc). Is there something >> for them in the AuthenticationSchemes >> (http://wiki.apache.org/nutch/HttpAuthenticationSchemes) that is not >> shown on the page? >> >> If you have a specific page that could help please send that. >> >> -- Chris >> >> >> >> On Wed, Mar 7, 2012 at 3:40 PM, remi tassing <[email protected]> > wrote: >>> Try googling for Nutch+httpclient >>> >>> Remi >>> >>> On Wednesday, March 7, 2012, Christopher Gross <[email protected]> wrote: >>>> Is there any good documentation for setting up Nutch to crawl HTTPS >>>> sites using a certificate? I've poked around on the wiki and tried >>>> some google searches without much luck. >>>> >>>> I'm using Nutch 1.4. >>>> >>>> Thanks! >>>> >>>> -- Chris >>>> >>

