Right, I can create multiple collections no problem... but, what I'd really
love is to put them into the same collection, just adding a field like
"crawl_name" to the index.

Any way I can do that?

Thanks!


On Wed, Apr 8, 2015 at 9:15 AM, Iain Lopata <[email protected]> wrote:

> Katrina,
>
> When you specify the solr instance as the third parameter to bin/crawl
> try  specifying the collection name in the path e.g.
> http://localhost:8080/solr/collection1
>
> Iain
>
> -----Original Message-----
> From: Katrina Riehl [mailto:[email protected]]
> Sent: Wednesday, April 8, 2015 8:51 AM
> To: [email protected]
> Subject: Adding field to Nutch / Solr
>
> Hello,
>
> I am new to using Nutch.  I'm developing an application that crawls
> websites, and then indexes information about those websites into a Solr
> instance.  The problem is, it's putting all the crawled documents into the
> same Solr collection.
>
> Is there a way for me to add a field specifying which crawl the index came
> from?  Is there a command line option I can add when I start the crawl?
>
> Thank you so much for your help.
>
> --
> Katrina Riehl
> Continuum Analytics
> [email protected]
>
>

Reply via email to