this may also be an issue of protocolhttp-client. 

> Hi Remi,
> 
> This also looks like we need to document and address it.
> 
> Can you log a Jira issue and we will try to get on to it. Can you also have
> a look through some of the existing issues in case there is something
> similar, possibly relate them.
> 
> Thank you in advance
> 
> Lewis
> 
> On Tue, Jan 17, 2012 at 9:38 AM, remi tassing <tassingr...@gmail.com> wrote:
> > Hi,
> > 
> > The problem is really similar to this:
> > 
> > http://old.nabble.com/java.lang.IllegalArgumentException:-Invalid-uri-td2
> > 1856688.html
> > 
> > Unfortunately, I have no clue on what to update in Nutch ...
> > 
> > On Mon, Jan 16, 2012 at 4:41 PM, remi tassing <tassingr...@gmail.com>
> > 
> > wrote:
> > > Hello Markus,
> > > 
> > > thanks for the help!
> > > 
> > > Just to clarify a little bit. In my previous message, "uri1"
> > > represented
> > 
> > a
> > 
> > > normal, ordinary URL, I just didn't want to copy the exact URL.
> > > 
> > > The weird part is that it all works in the browser...
> > > 
> > > 
> > > On Mon, Jan 16, 2012 at 4:35 PM, Markus Jelsma <
> > 
> > markus.jel...@openindex.io
> > 
> > > > wrote:
> > >> This? https://uri1...&From=stats
> > >> 
> > >> That's not a correct or valid URL if you ask me.
> > >> 
> > >> On Monday 16 January 2012 15:12:51 remi tassing wrote:
> > >> > Hello ,
> > >> > 
> > >> > this is a snapshot of the log:
> > >> > 
> > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> > >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> > >> > java.lang.IllegalArgumentException: Invalid uri
> > >> > 'https://uri1...&From=stats': Invalid query
> > >> > at
> > 
> > org.apache.commons.httpclient.HttpMethodBase.<init>(HttpMethodBase.java:2
> > 22
> > 
> > >> > ) at
> > 
> > org.apache.commons.httpclient.methods.GetMethod.<init>(GetMethod.java:89)
> > 
> > >> > at
> > 
> > 
org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java:
> > >> > 79) at
> > >> 
> > >> org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:154)
> > >> 
> > >> > at
> > 
> > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.ja
> > va
> > 
> > >> > :224) at
> > >> > 
> > >> > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628)
> > >> 
> > >> fetch
> > >> 
> > >> > of https://uri1...&From=stats failed with:
> > >> > java.lang.IllegalArgumentException: Invalid uri
> > >> > 'https://uri1...&From=stats': Invalid query
> > >> > -activeThreads=10, spinWaiting=10, fetchQueues.totalSize=96
> > >> > -activeThreads=10, spinWaiting=10, fetchQueues.totalSize=96
> > >> > 
> > >> > On Mon, Jan 16, 2012 at 4:05 PM, Markus Jelsma
> > >> > 
> > >> > <markus.jel...@openindex.io>wrote:
> > >> > > copy the stack trace please
> > >> > > 
> > >> > > On Monday 16 January 2012 14:58:46 remi tassing wrote:
> > >> > > > Hello all,
> > >> > > > 
> > >> > > > I'm getting "invalid uri" error with some link that have three
> > 
> > dots,
> > 
> > >> > > > i.e. "...". They work perfectly well in browsers (IE and Chrome)
> > >> 
> > >> but,
> > >> 
> > >> > > > apparently, not with Nutch.
> > >> > > > 
> > >> > > > Is this a known issue? Any idea on how to handle it?
> > >> > > > 
> > >> > > > Remi
> > >> > > 
> > >> > > --
> > >> > > Markus Jelsma - CTO - Openindex
> > >> 
> > >> --
> > >> Markus Jelsma - CTO - Openindex

Reply via email to