Hi Remi,

This also looks like we need to document and address it.

Can you log a Jira issue and we will try to get on to it. Can you also have
a look through some of the existing issues in case there is something
similar, possibly relate them.

Thank you in advance

Lewis

On Tue, Jan 17, 2012 at 9:38 AM, remi tassing <tassingr...@gmail.com> wrote:

> Hi,
>
> The problem is really similar to this:
>
> http://old.nabble.com/java.lang.IllegalArgumentException:-Invalid-uri-td21856688.html
>
> Unfortunately, I have no clue on what to update in Nutch ...
>
> On Mon, Jan 16, 2012 at 4:41 PM, remi tassing <tassingr...@gmail.com>
> wrote:
>
> > Hello Markus,
> >
> > thanks for the help!
> >
> > Just to clarify a little bit. In my previous message, "uri1" represented
> a
> > normal, ordinary URL, I just didn't want to copy the exact URL.
> >
> > The weird part is that it all works in the browser...
> >
> >
> > On Mon, Jan 16, 2012 at 4:35 PM, Markus Jelsma <
> markus.jel...@openindex.io
> > > wrote:
> >
> >> This? https://uri1...&From=stats
> >>
> >> That's not a correct or valid URL if you ask me.
> >>
> >> On Monday 16 January 2012 15:12:51 remi tassing wrote:
> >> > Hello ,
> >> >
> >> > this is a snapshot of the log:
> >> >
> >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> >> > -activeThreads=10, spinWaiting=9, fetchQueues.totalSize=96
> >> > java.lang.IllegalArgumentException: Invalid uri
> >> > 'https://uri1...&From=stats': Invalid query
> >> > at
> >> >
> >>
> org.apache.commons.httpclient.HttpMethodBase.<init>(HttpMethodBase.java:222
> >> > ) at
> >> >
> >>
> org.apache.commons.httpclient.methods.GetMethod.<init>(GetMethod.java:89)
> >> > at
> >> >
> >>
> org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java:
> >> > 79) at
> >> org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:154)
> >> > at
> >> >
> >>
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java
> >> > :224) at
> >> > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628)
> >> fetch
> >> > of https://uri1...&From=stats failed with:
> >> > java.lang.IllegalArgumentException: Invalid uri
> >> > 'https://uri1...&From=stats': Invalid query
> >> > -activeThreads=10, spinWaiting=10, fetchQueues.totalSize=96
> >> > -activeThreads=10, spinWaiting=10, fetchQueues.totalSize=96
> >> >
> >> > On Mon, Jan 16, 2012 at 4:05 PM, Markus Jelsma
> >> >
> >> > <markus.jel...@openindex.io>wrote:
> >> > > copy the stack trace please
> >> > >
> >> > > On Monday 16 January 2012 14:58:46 remi tassing wrote:
> >> > > > Hello all,
> >> > > >
> >> > > > I'm getting "invalid uri" error with some link that have three
> dots,
> >> > > > i.e. "...". They work perfectly well in browsers (IE and Chrome)
> >> but,
> >> > > > apparently, not with Nutch.
> >> > > >
> >> > > > Is this a known issue? Any idea on how to handle it?
> >> > > >
> >> > > > Remi
> >> > >
> >> > > --
> >> > > Markus Jelsma - CTO - Openindex
> >>
> >> --
> >> Markus Jelsma - CTO - Openindex
> >>
> >
> >
>



-- 
*Lewis*

Reply via email to