This works, (with the dev trunk. if you're using the 0.9 release then there's a full tutorial on the nutch wiki) but it's not particularly refined. I'm assuming you've got a working nutch crawl or whatever already.
1. Get yourself a working solr test server running using the Getting Started section of the tutorial : http://lucene.apache.org/solr/tutorial.html 2. modify the solr schema using the one found at http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html(There's a missing field called boost with a type of int. There may well be a more up to date version of the schema somewhere which doesn't have this error) 3. run your inject/fetch (or crawl, which will index unnecessarily but hey ho, it's simple...) 4. run bin/nutch SolrInjector Sorry, that's kida vague. You'll have to work out what parameters the SolrInjector job wants for yourself, but if I can work it out then anyone can! (well OK. maybe not my mum but apart from her...) Andy 2009/3/6 Tony Wang <ivyt...@gmail.com> > Hi all, > > For those on the Solr user list who have already seen my question about > Nutch/Solr integration, I want to apologize for the redundant messages and > I > don't mean to spam the two mailing lists. I have been desperately seeking > information/documentation on making Nutch crawl for Solr indexing. > > So my question here is: Was anyone on the list able to make Solr use > Nutch's > crawl data? If yes, could you please share it with me and the rest of us? I > sincerely appreciate any input. > > Thanks in advance! > > Tony > > -- > Are you RCholic? www.RCholic.com > 温 良 恭 俭 让 仁 义 礼 智 信 >