Right, I can create multiple collections no problem... but, what I'd really love is to put them into the same collection, just adding a field like "crawl_name" to the index.
Any way I can do that? Thanks! On Wed, Apr 8, 2015 at 9:15 AM, Iain Lopata <[email protected]> wrote: > Katrina, > > When you specify the solr instance as the third parameter to bin/crawl > try specifying the collection name in the path e.g. > http://localhost:8080/solr/collection1 > > Iain > > -----Original Message----- > From: Katrina Riehl [mailto:[email protected]] > Sent: Wednesday, April 8, 2015 8:51 AM > To: [email protected] > Subject: Adding field to Nutch / Solr > > Hello, > > I am new to using Nutch. I'm developing an application that crawls > websites, and then indexes information about those websites into a Solr > instance. The problem is, it's putting all the crawled documents into the > same Solr collection. > > Is there a way for me to add a field specifying which crawl the index came > from? Is there a command line option I can add when I start the crawl? > > Thank you so much for your help. > > -- > Katrina Riehl > Continuum Analytics > [email protected] > >

