Re: nutch adds %20 in urls instead of spaces

2024-01-09 Thread Jim Anderson
unsubscribe On Tue, Jan 9, 2024 at 1:20 PM Steve Cohen wrote: > Hello, > > I am updating a nutch crawl that read files in directories that have > spaces. The urls show %20 instead of spaces. This doesn't seem to be what > the behavior was in the past. > > In nutch 1.10 I get these results > >

Re: Nutch 1.17 download available?

2020-06-08 Thread Jim Anderson
Hi Lewis, Thanks for the information. I'm going to stick with Nutch 1.16 for now. Jim On Sun, Jun 7, 2020 at 6:11 PM Lewis John McGibbney wrote: > Hi Jim, > Response below > > On 2020/06/06 14:23:24, Jim Anderson wrote: > > > > I cannot find a download for Nutch 1.17.

Nutch 1.17 download available?

2020-06-06 Thread Jim Anderson
Hi, When I look at the URL https://cwiki.apache.org/confluence/display/nutch/NutchTutorial#NutchTutorial-UsingIndividualCommandsforWhole-WebCrawling and look at the section: Setup Solr for search I see: NutchSolr 1.17 8.5.1 I cannot find a download for Nutch 1.17. Is Nutch 1.17