Hi Erick,
Ah, yes I guess you are correct in that could just avoid using cursorMark
this way...the only (smallish I think) issue is that I would need to
extract the last id from the csv output. Oh and that I am using Datastaxx
DSE, so uniqueKey is a combination of two fields...but I think I can ma
The CursorMark stuff has to deal with shards, what happens when more
than one document on different shards has the same sort value, what
if all the docs in the response packet have the same sort value, what
happens when you want to return docs by score and the like.
For your case you can use a sor
Hi,
I need to index into a new schema 800M docs, that exist in an older solr.
As all fields are stored, I thought I was very lucky as I could:
- use wt=csv
- combined with cursorMark
to easily script out something that would export/index in chunks of 1M docs
or something. CVS output being very e