Thanks Shawan. My index meets Atomic update req, so I want to use DIH
because of its convenience
I am in a solr cloud with 5 shards (with a separate zookeeper
ensemble), so I will have to put 5 entity tags so that i can give 5
diff urls , one for each shard ?

thanks
neeraj


On Wed, Feb 24, 2016 at 7:34 PM, Shawn Heisey <apa...@elyograg.org> wrote:
> On 2/23/2016 11:10 PM, Neeraj Bhatt wrote:
>> Hello
>>
>> We have a solr cloud stored and indexed data of around 25 lakh documents
>> We recently moved to solr 5.4.1 but are unable to move our indexed
>> data. What approach we should follow
>>
>> 1. data import handler works in solr cloud ? what should we give in
>> url like  url="http://192.168.34.218:8080/solr/client_sku_shard1_replica3";
>> , this will have shard name, so all documents won't be imported
>
> SolrEntityProcessor in DIH will only work if your index meets the
> requirements for Atomic Updates.  Basically, every field must be stored,
> unless it is a copyField destination:
>
> https://wiki.apache.org/solr/Atomic_Updates#Caveats_and_Limitations
>
>> 2. direct copying of index will work ? There are some schema changes
>> like from solr.Int to solr.TrieInt etc
>
> If the schema uses different classes, you will not be able to use the
> old index directly.  The schema would need to be completely unchanged,
> but it sounds like your old schema is using classes that are no longer
> present in 5.x.
>
>> 3. write code to fetch from solr 4.10.2 and push into 5.4.1 this is
>> time consuming, though can be improved by using multithreading
>
> This has the same requirements as SolrEntityProcessor.
>
> A complete reindex in 5.x from the original data source would be the
> best option, but if your index meets the Atomic Update requirements, you
> could go with one of the options that you numbered 1 or 3.
>
> Thanks,
> Shawn
>

Reply via email to