The meta forward or script forward me be pushing the relative url and not the absolute url thus getting the no protocol error since all links should be http:// (or some known protocol)
I use Java 1.5 beta release and it seems to work best. I also exclude everything but http: right now since i ran into timing out issues regarding FTP and https errors. -byron --- Stefan Groschupf <[EMAIL PROTECTED]> wrote: > > Am 28.05.2004 um 01:10 schrieb Byron Miller: > > > Is the forward in the meta tags? If so the crawler > may > > not be reading the forward and just grabbing the > root > > page as a document. > > > >> > no! > >> java.net.MalformedURLException: no protocol: > >> include.php?path=start.php > >> > >> > As you can see on this Exception it has a problem > with the url string > parsing i would say. > > By the way does your fetch process hangs as well > some times? > Which JVM you use? > > Stefan > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: Oracle 10g > Get certified on the hottest thing ever to hit the > market... Oracle 10g. > Take an Oracle 10g class now, and we'll give you the > exam FREE. > http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click > _______________________________________________ > Nutch-developers mailing list > [EMAIL PROTECTED] > https://lists.sourceforge.net/lists/listinfo/nutch-developers ------------------------------------------------------- This SF.Net email is sponsored by: Oracle 10g Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE. http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
