Re: Nutch data to Solr on HTTPS

2012-02-24 Thread Lewis John Mcgibbney
Hi, On Thu, Feb 23, 2012 at 7:27 PM, Christopher Gross cogr...@gmail.comwrote: Unless -- is 1.2 able to crawl https sites? If it can't do that then I may have to upgrade You should be able to get https sites yes, however I'm not overly familiar with the protocol-httpclient plugin. If

Nutch data to Solr on HTTPS

2012-02-23 Thread Christopher Gross
I have my Solr set up on a secure port -- and I think that is causing a problem for nutch (nothing else changed.) I don't see anything in the documentation regarding this. My nutch version is 1.2, Solr is 3.4. Here's the line from my runbot.sh script: $NUTCH_HOME/bin/nutch solrindex

Re: Nutch data to Solr on HTTPS

2012-02-23 Thread Christopher Gross
Meant to include this...the output from the runbot.sh script. Not that it really says a whole lot... - Index (Step 5 of 8) - SolrIndexer: starting at 2012-02-23 18:18:20 java.io.IOException: Job failed! -- Chris On Thu, Feb 23, 2012 at 1:26 PM, Christopher Gross cogr...@gmail.com

Re: Nutch data to Solr on HTTPS

2012-02-23 Thread Lewis John Mcgibbney
Yeah I can confirm it was 1.4 On Thu, Feb 23, 2012 at 7:05 PM, Christopher Gross cogr...@gmail.comwrote: I tried using 1.4, but I couldn't get that to work at all. What is wrong with your configuration, if this is all that is preventing you from migrating to 1.4 I would rather get it sorted

Re: Nutch data to Solr on HTTPS

2012-02-23 Thread Christopher Gross
I was getting it to do parts of the crawl, but it was not pushing the data to Solr (that was before I moved it to https). I had worked on that for two weeks, and was frustrated and needed to make progress with other parts of the project, so I bailed on the newer nutch and just rolled with 1.2,