I haven't tried that how-to but it should simply work as expected. If
you can inject, fetch, parse, update the db and invert links then
sending the data to Solr is exactly the same in Nutch 1.2 as it is
written in the guide. Those commands haven't changed.
Also you mention that you have been unable to run a search in Nutch 1.2
but you have trouble with Solr? It doesn't make sense. Are you indexing
your crawl in Nutch or in Solr? What error do you get in hadoop.log and
your Solr container's output?
On Tue, 26 Oct 2010 14:54:47 -0500, "Joe Rogoiyo" <[email protected]>
wrote:
It doesn't appear that the guide you're referring to will work with
Nutch
1.2, which has Solr integration. The guide refers to Nutch 1.0, and,
if I
understand correctly, Solr integration was implemented starting with
Nutch
1.1. I have been unable to run a search in Nutch 1.2, even though
I've
crawled a website successfully. Is it possible someone with more
knowledge
of Nutch 1.2/Solr integration could please post a How-To? I'm sure
many
would really appreciate it.
Joe
-----Original Message-----
From: Steve Cohen [mailto:[email protected]]
Sent: Tuesday, October 26, 2010 2:06 PM
To: [email protected]
Subject: Any changes to setting up solr with nutch 1.2?
Hello,
I am looking at the wiki page for running nutch and solr.
http://wiki.apache.org/nutch/RunningNutchAndSolr
I see this step:
*1.* Download Solr version 1.3.0 or
LucidWorks<http://wiki.apache.org/nutch/LucidWorks>for Solr from
Download page
and this step:
*5.* Configure Solr For the sake of simplicity we are going to use
the
example configuration of Solr as a base.
Do we still download a version of solr (presumably version 1.4 since
that is
what nutch 1.2 is using) and configure it?
Thanks,
Steve
--
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536600 / 06-50258350