Every point is OK except one: if there's no partitioning for solrj, how
could, say 1000 documents, distributed across the nodes?  One-by-one? What
will be the strategy?

No need to open a new issue, my patch does similar job w/o using
CloudSolrServer, but CommonsHttpSolrServer(s). I'll give a shot for your
patch.

Best


On Tue, Jul 9, 2013 at 1:34 PM, Markus Jelsma <[email protected]>wrote:

> Yes, it only takes URL's for your ensemble because that is how
> CloudSolrServer works and it is the best method of connecting to a Solr
> cloud from Java. As said, there is no partitioning at all (SolrJ document
> routing is not yet committed) but your Solr nodes'
> DistributedUpdateRequestProcessor does the redistribution of incoming
> documents. Documents are also not send over Zookeeper, CloudSolrServer only
> uses the Zookeeper ensemble to find all nodes of the cluster and
> distinguish between masters and slaves so documents are sent to masters
> only.
>
> Depending on what your patch exactly does you may need to open a new
> issue. If it's also about writing data to a SolrCloud cluster, NUTCH-1377
> via Zookeeper is the only proper way to go.
>
> Cheers
>
> -----Original message-----
> > From:Tuğcem Oral <[email protected]>
> > Sent: Tuesday 9th July 2013 12:29
> > To: [email protected]
> > Subject: Re: Indexing from nutch 1.6 to solr 4.3.1 cloud
> >
> > Markus,
> >
> > I checked yours, they're quite similar but yours only takes zookeeper
> > ensemble urls, mine looks for all solr urls for a cluster. How could you
> > partition the documents? Sending them over zookeeper is enough?
> >
> > BTW my patch is ready, how could suppose to attach it?
> >
> > Best
> >
> >
> > On Tue, Jul 9, 2013 at 1:11 PM, Markus Jelsma <
> [email protected]>wrote:
> >
> > > I attached a patch for support of CloudSolrServer and a Zookeeper
> > > ensemble. Use solr.zookeeper.hosts and solr.collection to enable it.
> Patch
> > > also required NUTCH-1486.
> > > https://issues.apache.org/jira/browse/NUTCH-1377
> > >
> > >
> > >
> > > -----Original message-----
> > > > From:Tuğcem Oral <[email protected]>
> > > > Sent: Tuesday 9th July 2013 9:31
> > > > To: [email protected]
> > > > Subject: Re: Indexing from nutch 1.6 to solr 4.3.1 cloud
> > > >
> > > > So your org.apache.nutch.indexer.solr.SolrIndexer utility is not
> working
> > > > from nutch 1.6 I suppose, that might be used from nutch 2.1. Because
> in
> > > 1.6
> > > > you cannot do such a thing, as multiple solr instances (so
> solrcloud) and
> > > > partitioning is not supported on that version.
> > > >
> > > >
> > > > On Tue, Jul 9, 2013 at 12:55 AM, <[email protected]> wrote:
> > > >
> > > > > I give only one url to solrindex command and solrcloud takes care
> of
> > > > >  partitioning. I do not use solrj and actually did not understand
> > > Markus's
> > > > > comments. I use solr.4.2.0 with cloud feature.
> > > > >
> > > > > Thanks.
> > > > > Alex.
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > -----Original Message-----
> > > > > From: Tuğcem Oral <[email protected]>
> > > > > To: user <[email protected]>
> > > > > Sent: Mon, Jul 8, 2013 1:26 pm
> > > > > Subject: Indexing from nutch 1.6 to solr 4.3.1 cloud
> > > > >
> > > > >
> > > > > @alex, i dont understand how could you give multiple solr urls
> while
> > > > > indexing from 1.6. Because solrindex handles given solr url with a
> > > single
> > > > > SolrServer instance, dont use List<SolrServer>, and also as @Marcus
> > > said,
> > > > > solrj doesnt support partitioning. The phrase you used "indexing
> using
> > > with
> > > > > nutch 1.6 and 2.1" seems a bit confusing for me, which version of
> > > solrj and
> > > > > solr (cloud) you are using is important i suppose.
> > > > >
> > > > > @erol, I can upload the patch tomorrow and notify you about it,
> > > > >
> > > > > Best,
> > > > >
> > > > > Tugcem
> > > > >
> > > > > On Monday, July 8, 2013, eakarsu wrote:
> > > > >
> > > > > > Tugcem,
> > > > > >
> > > > > > Can you please send me patch also?
> > > > > > I would like to test it
> > > > > >
> > > > > > Thanks
> > > > > >
> > > > > > Erol Akarsu
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > View this message in context:
> > > > > >
> > > > >
> > >
> http://lucene.472066.n3.nabble.com/Indexing-from-nutch-1-6-to-solr-4-3-1-cloud-tp4075737p4076346.html
> > > > > > Sent from the Nutch - User mailing list archive at Nabble.com.
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > TO
> > > > >
> > > > >
> > > > >
> > > >
> > > >
> > > > --
> > > > TO
> > > >
> > >
> >
> >
> >
> > --
> > TO
> >
>



-- 
TO

Reply via email to