Best tool to use is the parsechecker, it is a quick neat way to see whether your protocol/fetch/authentication is working then whether your parser is extracting the text and metadata you require.
On Wed, Sep 19, 2012 at 8:30 PM, Max Dzyuba <[email protected]> wrote: > Hi Lewis, > > I used that website as an example. I don't specify the exact website that I > was using. I'm 100% sure that my website requires authentication and the > credentials I provide are verified too. So there is something I'm missing in > trying to make it work. > > Please help. > > > > > Best regards, > MaxLewis John Mcgibbney <[email protected]> wrote:Hi, > > On Wed, Sep 19, 2012 at 3:37 PM, Max Dzyuba <[email protected]> wrote: > >> >> 2012-09-19 16:26:16,106 INFO httpclient.HttpMethodDirector - No credentials >> available for BASIC 'realm'@host.org:80 >> >> >> >> I don't understand why Nutch complains about "No credentials available for >> BASIC 'realm'@host.org:80" since I've set up the default credentials which >> should be used for any page that asks for authentication. >> > > If I follow the above link I get a popup box saying that the site does > not require authentication credentials and that it is trying to trick > me. > > Are you sure its not just this site and that another solution is required? > > Lewis -- Lewis

