[Nutch Wiki] Trivial Update of RunningNutchAndSolr by NickTkach
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by NickTkach: http://wiki.apache.org/nutch/RunningNutchAndSolr The comment on the change is: Changed fields for copyField line to correct values -- * Add the fields that Nutch needs (url, content, segment, digest, host, site, anchor, title, tstamp, text--see [http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html FooFactory Article on Nutch + Solr]) * Change defaultSearchField to 'text' * Change defaultOperator to 'AND' -* Add lines to copyField section to copy cat name into the text field +* Add lines to copyField section to copy anchor, title, and content into the text field 1. Start the Solr you just made (cd /tmp/mysolr; java -jar start.jar) 1. Run a Nutch crawl using the bin/crawl.sh script.
[Nutch Wiki] Trivial Update of RunningNutchAndSolr by NickTkach
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by NickTkach: http://wiki.apache.org/nutch/RunningNutchAndSolr The comment on the change is: Corrected line 3 of instructions (should have been nutch-trunk) -- 1. Check out solr-trunk and nutch-trunk 1. Go into the solr-trunk and run 'ant dist dist-solrj' - 1. Get zip from [http://variogram.com/latest/SolrIndexer.zip Variogr.am] and unzip it to solr-trunk + 1. Get zip from [http://variogram.com/latest/SolrIndexer.zip Variogr.am] and unzip it to nutch-trunk. 1. Copy apache-solr-solrj-1.3-dev.jar and apache-solr-common-1.3-dev.jar to nutch-trunk/lib 1. Get the zip file from [http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html FooFactory] for SOLR-20 1. Unzip solr-client.zip somewhere, go into java/solr/src and run 'ant'
[Nutch Wiki] Trivial Update of RunningNutchAndSolr by NickTkach
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The following page has been changed by NickTkach: http://wiki.apache.org/nutch/RunningNutchAndSolr -- If you watch the output from your Solr instance (logs) you should see a bunch of messages scroll by when Nutch finishes crawling and posts new documents. If not, then you've got something not configured right. I'll try to add more notes here as people have questions/issues. + '''Troubleshooting:''' + * If you get errors about Type mismatch in value from map: (expected ObjectWritable, but received NutchWritable), then you likely are missing the two steps I just added in step 9 above. Sorry about that, I forgot about making the change there in SolrIndexer. --- I did everything but i got this error any idea??