RE: Nutch not crawling all URLs

2022-01-12 Thread Roseline Antai
Hi Sebastian, Thank you. I did enjoy the holiday. Hope you did too. I have had a look at the protocol-selenium plugin, but it was a bit difficult to understand. It appears it only works with Firefox. Does it work at all with Chrome? I was also not sure of what values to set for the properties.

Re: Nutch not crawling all URLs

2022-01-12 Thread Sebastian Nagel
Hi Roseline, > the mail below went to my junk folder and I didn't see it. No problem. I hope you nevertheless enjoyed the holidays. And sorry for any delays but I want to emphasize that Nutch is a community project and in doubt it might take a few days until somebody finds the time to respond. >

RE: Nutch not crawling all URLs

2022-01-12 Thread Roseline Antai
Hi Sebastian, For some reason, the mail below went to my junk folder and I didn't see it. The notco page - https://notco.com/ was not indexed, no. When I enabled redirects, I was able to get a few pages, but they don't seem valid. Could you confirm if you received all the urls I sent? Another