Hi Remi, This also looks like we need to document and address it.
Can you log a Jira issue and we will try to get on to it. Can you also have a look through some of the existing issues in case there is something similar, possibly relate them. Thank you in advance Lewis On Tue, Jan 17, 2012 at 9:38 AM, remi tassing <tassingr...@gmail.com> wrote: > Hi, > > The problem is really similar to this: > > http://old.nabble.com/java.lang.IllegalArgumentException:-Invalid-uri-td21856688.html > > Unfortunately, I have no clue on what to update in Nutch ... > > On Mon, Jan 16, 2012 at 4:41 PM, remi tassing <tassingr...@gmail.com> > wrote: > > > Hello Markus, > > > > thanks for the help! > > > > Just to clarify a little bit. In my previous message, "uri1" represented > a > > normal, ordinary URL, I just didn't want to copy the exact URL. > > > > The weird part is that it all works in the browser... > > > > > > On Mon, Jan 16, 2012 at 4:35 PM, Markus Jelsma < > markus.jel...@openindex.io > > > wrote: > > > >> This? https://uri1...&From=stats > >> > >> That's not a correct or valid URL if you ask me. > >> > >> On Monday 16 January 2012 15:12:51 remi tassing wrote: > >> > Hello , > >> > > >> > this is a snapshot of the log: > >> > > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96 > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96 > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96 > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96 > >> > java.lang.IllegalArgumentException: Invalid uri > >> > 'https://uri1...&From=stats': Invalid query > >> > at > >> > > >> > org.apache.commons.httpclient.HttpMethodBase.<init>(HttpMethodBase.java:222 > >> > ) at > >> > > >> > org.apache.commons.httpclient.methods.GetMethod.<init>(GetMethod.java:89) > >> > at > >> > > >> > org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java: > >> > 79) at > >> org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:154) > >> > at > >> > > >> > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java > >> > :224) at > >> > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628) > >> fetch > >> > of https://uri1...&From=stats failed with: > >> > java.lang.IllegalArgumentException: Invalid uri > >> > 'https://uri1...&From=stats': Invalid query > >> > -activeThreads=10, spinWaiting=10, fetchQueues.totalSize=96 > >> > -activeThreads=10, spinWaiting=10, fetchQueues.totalSize=96 > >> > > >> > On Mon, Jan 16, 2012 at 4:05 PM, Markus Jelsma > >> > > >> > <markus.jel...@openindex.io>wrote: > >> > > copy the stack trace please > >> > > > >> > > On Monday 16 January 2012 14:58:46 remi tassing wrote: > >> > > > Hello all, > >> > > > > >> > > > I'm getting "invalid uri" error with some link that have three > dots, > >> > > > i.e. "...". They work perfectly well in browsers (IE and Chrome) > >> but, > >> > > > apparently, not with Nutch. > >> > > > > >> > > > Is this a known issue? Any idea on how to handle it? > >> > > > > >> > > > Remi > >> > > > >> > > -- > >> > > Markus Jelsma - CTO - Openindex > >> > >> -- > >> Markus Jelsma - CTO - Openindex > >> > > > > > -- *Lewis*