Hello, everyone!
I'm trying to fetch a page that has been moved temporarily or permanently , but
always show me the error: Redirect exceeded. I'm fetching a page that jumps
only once and at the settings I have put 100 maximum. The Fetcher is trying to
fetch the same URL every time. I'm using
Hi Dave and Ryan
Comments inline
Best
- Mensaje original -
> De: "Ryan Suarez"
> Para: user@nutch.apache.org
> Enviados: Sábado, 2 de Marzo 2019 11:48:51
> Asunto: [MASSMAIL]Re: Direct Nutch crawler to use different SOLR index writer?
> # ./crawl -h
> -D A Java property to pass to
Hi Dave,
One important thing you should know here is that Nutch doesn't use the
schema.xml file at all. This file is for reference only. You can modify it
(according Solr documentation) in Solr side without making any changes on Nutch
schema.xml. The only thing you really need to keep in mind
Glad to be useful. Regards
- Mensaje original -
> De: "Dave Beckstrom"
> Para: "user"
> Enviados: Martes, 5 de Marzo 2019 11:06:35
> Asunto: [MASSMAIL]JEXL and Exchanges
> Ryan and Roannel,
>
> Thank you guys so much for your replies. I didn't realize it but I was not
> seeing all of
Hi folks,
Is there any way in Nutch 1.15 to inject a remote seed file (accessible via
http or https)?
I mean this, for instance:
bin/nutch inject crawl http://example.org/seed
Regards
1519-2019: Aniversario 500 de la Villa de San Cristóbal de La Habana
Por La Habana, lo más grande. #Habana500
pected format (
> https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/service/resources/SeedResource.java#L92-L113
> )
>
> Best Regards,
> Jorge
>
> On Mon, Sep 16, 2019 at 4:59 PM Roannel Fernandez Hernandez
> wrote:
>
>> Hi folks,
>>
>&g
6 matches
Mail list logo