Re: [Nutch-general] Re: Getting info about failed fetches (404, 500, HostNotFound, etc.)

2005-07-19 Thread ogjunk-nutch
Thanks Andrzej, Thanks turned out to be the problem with plugins - I updated the Nutch jar, but didn't rebuild and redeploy plugins. What is the best way to build plugin Jars? To rebuild plugins, I did "ant package", which did everything and more (javadoc, war, etc.). There are 2 Ant build scrip

Re: [Nutch-general] Re: Getting info about failed fetches (404, 500, HostNotFound, etc.)

2005-07-19 Thread Andrzej Bialecki
[EMAIL PROTECTED] wrote: I tried Nutch from SVN, but I'm getting this error: Exception in thread "fetcher3" java.lang.NoSuchMethodError: org.apache.nutch.parse.ParseData.(Ljava/lang/String;[Lorg/apache/nutch/parse/Outlink;Ljava/util/Properties;)V at org.apache.nutch.parse.html.HtmlParse

Re: [Nutch-general] Re: Getting info about failed fetches (404, 500, HostNotFound, etc.)

2005-07-18 Thread ogjunk-nutch
Hi, --- Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > [EMAIL PROTECTED] wrote: > > Hello, > > > > I'm using Nutch's Fetcher for my Simpy.com project, and one of the > > things I'd like to do is detect broken links (any type of error - > wrong > > host name, 404, 500, 302, etc.). From what I can

Re: Getting info about failed fetches (404, 500, HostNotFound, etc.)

2005-07-18 Thread Andrzej Bialecki
[EMAIL PROTECTED] wrote: Hello, I'm using Nutch's Fetcher for my Simpy.com project, and one of the things I'd like to do is detect broken links (any type of error - wrong host name, 404, 500, 302, etc.). From what I can tell, only successful fetches (200s and maybe 301/302s that result in 200)

Getting info about failed fetches (404, 500, HostNotFound, etc.)

2005-07-18 Thread ogjunk-nutch
Hello, I'm using Nutch's Fetcher for my Simpy.com project, and one of the things I'd like to do is detect broken links (any type of error - wrong host name, 404, 500, 302, etc.). From what I can tell, only successful fetches (200s and maybe 301/302s that result in 200) end up being written to di

Re: Getting info about failed fetches (404, 500, HostNotFound, etc.)

2005-07-18 Thread Stefan Groschupf
Hi, Am 18.07.2005 um 20:52 schrieb <[EMAIL PROTECTED]>: I'm not too familiar with Nutch's plugin system, but can I write a plugin that plugs into those 2 methods? You can only plug in a plug-in in a existing extension point. Since these methods does not provide any extension point yet you c