Same data set mean that I separate original data into 2 parts & input them from 2 separate programs. E.g. a data set with id 1 - 100. Program A input id 1-50. Program B input 51 - 100.
2017-04-21 17:24 GMT+08:00 Andrey Mashenkov <andrey.mashen...@gmail.com>: > Hi Woo, > > DataStreamer is designed to fill cache with maximum throughput. By > default, streamer will not rewrite cache data, until allowOverwite option > is set. > > Why you need to input same set of data? Why do you expected data input > time will change significantly with 2 programs compared to 1 if data set is > put twice? > Or I missed smth? > > Anyway, if you do not get a speed up but you sure you should, then a > bottleneck have to be found at first. > > On Fri, Apr 21, 2017 at 5:02 AM, woo charles <ignite.charl...@gmail.com> > wrote: > >> If the data is buffered in client side, the bottleneck should be also in >> client side. >> If I use 2 programs to input same set data, it should be a significant >> change in data input time. >> Is it right? >> >> 2017-04-21 6:46 GMT+08:00 Dmitriy Setrakyan <dsetrak...@apache.org>: >> >>> >>> On Wed, Apr 19, 2017 at 10:16 PM, woo charles <ignite.charl...@gmail.com >>> > wrote: >>> >>>> When I call addData() in streamer. this data will send & buffer in >>>> server node. is that correct? >>>> If I correct, this data will buffer in random server node or only the >>>> one it directly connected? >>>> >>> >>> addData() will buffer the data on the client side. As a matter of fact, >>> there are multiple buffers on the client side, which each buffer associated >>> with some server node. >>> >>> Ignite will never send the data to a random node. The data is always >>> sent exactly to the node where it will be cached. >>> >>> D. >>> >> >> > > > -- > Best regards, > Andrey V. Mashenkov >