Redirect exceeded

2015-03-20 Thread Roannel Fernandez Hernandez
Hello, everyone! I'm trying to fetch a page that has been moved temporarily or permanently , but always show me the error: Redirect exceeded. I'm fetching a page that jumps only once and at the settings I have put 100 maximum. The Fetcher is trying to fetch the same URL every time. I'm using

Re: [MASSMAIL]Re: Direct Nutch crawler to use different SOLR index writer?

2019-03-02 Thread Roannel Fernandez Hernandez
Hi Dave and Ryan Comments inline Best - Mensaje original - > De: "Ryan Suarez" > Para: user@nutch.apache.org > Enviados: Sábado, 2 de Marzo 2019 11:48:51 > Asunto: [MASSMAIL]Re: Direct Nutch crawler to use different SOLR index writer? > # ./crawl -h > -D A Java property to pass to

Re: [MASSMAIL]Re: Configuring Nutch to work with Solr?

2019-03-02 Thread Roannel Fernandez Hernandez
Hi Dave, One important thing you should know here is that Nutch doesn't use the schema.xml file at all. This file is for reference only. You can modify it (according Solr documentation) in Solr side without making any changes on Nutch schema.xml. The only thing you really need to keep in mind

Re: [MASSMAIL]JEXL and Exchanges

2019-03-06 Thread Roannel Fernandez Hernandez
Glad to be useful. Regards - Mensaje original - > De: "Dave Beckstrom" > Para: "user" > Enviados: Martes, 5 de Marzo 2019 11:06:35 > Asunto: [MASSMAIL]JEXL and Exchanges > Ryan and Roannel, > > Thank you guys so much for your replies. I didn't realize it but I was not > seeing all of

Injection from webservice

2019-09-16 Thread Roannel Fernandez Hernandez
Hi folks, Is there any way in Nutch 1.15 to inject a remote seed file (accessible via http or https)? I mean this, for instance: bin/nutch inject crawl http://example.org/seed Regards 1519-2019: Aniversario 500 de la Villa de San Cristóbal de La Habana Por La Habana, lo más grande. #Habana500

Re: [MASSMAIL]Re: Injection from webservice

2019-09-16 Thread Roannel Fernandez Hernandez
pected format ( > https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/service/resources/SeedResource.java#L92-L113 > ) > > Best Regards, > Jorge > > On Mon, Sep 16, 2019 at 4:59 PM Roannel Fernandez Hernandez > wrote: > >> Hi folks, >> >&g