Hi Aaron! There are two good options for data loading: using DataStreamer or IgniteCache.loadCache(...) <https://apacheignite.readme.io/docs/3rd-party-store#section-loadcache->. The second option is good when initial data is stored in some database.
If you worry about overhead on data rebalancing, you can start the cluster and start streaming data once all nodes are up. In this case records will appear at their final destination at once, without need to move to other nodes. Denis пн, 25 сент. 2017 г. в 14:31, [email protected] <[email protected]>: > hi All, > > If we have dozen of nodes to cache millions data from DB; > > When init, what's the best way to loading those data? we use the data > streamer to load data, while all our entry include a partition ID when > insert into DB. > > As the nodes are started one by one, if loading from one Node and then > re-balance this seems impossible & wasting. > > Not sure whether there any guideline or best practice/advice for such > scenario. > > Thanks for our time! > > > Regards > Aaron > ------------------------------ > [email protected] >
