[Nutch Wiki] Trivial Update of RunningNutchAndSolr by NickTkach

2008-04-14 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Nutch Wiki for change 
notification.

The following page has been changed by NickTkach:
http://wiki.apache.org/nutch/RunningNutchAndSolr

The comment on the change is:
Changed fields for copyField line to correct values

--
 * Add the fields that Nutch needs (url, content, segment, digest, host, 
site, anchor, title, tstamp, text--see 
[http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html 
FooFactory Article on Nutch + Solr])
 * Change defaultSearchField to 'text'
 * Change defaultOperator to 'AND'
-* Add lines to copyField section to copy cat  name into the text field
+* Add lines to copyField section to copy anchor, title, and content into 
the text field
   1. Start the Solr you just made (cd /tmp/mysolr; java -jar start.jar)
   1. Run a Nutch crawl using the bin/crawl.sh script.
  


[Nutch Wiki] Trivial Update of RunningNutchAndSolr by NickTkach

2008-04-04 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Nutch Wiki for change 
notification.

The following page has been changed by NickTkach:
http://wiki.apache.org/nutch/RunningNutchAndSolr

The comment on the change is:
Corrected line 3 of instructions (should have been nutch-trunk)

--
  
   1. Check out solr-trunk and nutch-trunk
   1. Go into the solr-trunk and run 'ant dist dist-solrj'
-  1. Get zip from [http://variogram.com/latest/SolrIndexer.zip  Variogr.am] 
and unzip it to solr-trunk
+  1. Get zip from [http://variogram.com/latest/SolrIndexer.zip  Variogr.am] 
and unzip it to nutch-trunk.
   1. Copy apache-solr-solrj-1.3-dev.jar and apache-solr-common-1.3-dev.jar to 
nutch-trunk/lib
   1. Get the zip file from 
[http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html  
FooFactory] for SOLR-20
   1. Unzip solr-client.zip somewhere, go into java/solr/src and run 'ant'


[Nutch Wiki] Trivial Update of RunningNutchAndSolr by NickTkach

2008-04-03 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Nutch Wiki for change 
notification.

The following page has been changed by NickTkach:
http://wiki.apache.org/nutch/RunningNutchAndSolr

--
  
  If you watch the output from your Solr instance (logs) you should see a bunch 
of messages scroll by when Nutch finishes crawling and posts new documents.  If 
not, then you've got something not configured right.  I'll try to add more 
notes here as people have questions/issues.
  
+ '''Troubleshooting:'''
+  * If you get errors about Type mismatch in value from map: (expected 
ObjectWritable, but received NutchWritable), then you likely are missing the 
two steps I just added in step 9 above.  Sorry about that, I forgot about 
making the change there in SolrIndexer.
  ---
  I did everything but i got this error any idea??