Hi,
On Thu, Feb 23, 2012 at 7:27 PM, Christopher Gross cogr...@gmail.comwrote:
Unless -- is 1.2 able to crawl https sites? If it can't do that then
I may have to upgrade
You should be able to get https sites yes, however I'm not overly familiar
with the protocol-httpclient plugin.
If
I have my Solr set up on a secure port -- and I think that is causing
a problem for nutch (nothing else changed.) I don't see anything in
the documentation regarding this.
My nutch version is 1.2, Solr is 3.4. Here's the line from my runbot.sh script:
$NUTCH_HOME/bin/nutch solrindex
Meant to include this...the output from the runbot.sh script. Not
that it really says a whole lot...
- Index (Step 5 of 8) -
SolrIndexer: starting at 2012-02-23 18:18:20
java.io.IOException: Job failed!
-- Chris
On Thu, Feb 23, 2012 at 1:26 PM, Christopher Gross cogr...@gmail.com
Yeah I can confirm it was 1.4
On Thu, Feb 23, 2012 at 7:05 PM, Christopher Gross cogr...@gmail.comwrote:
I tried using 1.4, but I couldn't get that to work at all.
What is wrong with your configuration, if this is all that is preventing
you from migrating to 1.4 I would rather get it sorted
I was getting it to do parts of the crawl, but it was not pushing the
data to Solr (that was before I moved it to https). I had worked on
that for two weeks, and was frustrated and needed to make progress
with other parts of the project, so I bailed on the newer nutch and
just rolled with 1.2,
5 matches
Mail list logo