hello Digambar, im newbie too, but according from http://predictionio.incubator. apache.org/deploy/monitoring/ how this help, but all my storage data are about 2.5 million which include events+items+user.
it's so incredible project, what kind a project you working on it? hehehe good luck! so many thanks, Yohan -Christian Yonathan S- On Tue, Aug 30, 2016 at 2:21 PM, Digambar Bhat <[email protected]> wrote: > Hello, > > I am using PredictionIO since last one year. It's working fine for me. > > Earlier importing, training was working flawlessly. But now training is > very slow as events are increased. Training almost taking 9-10 hours. > > Currently, events are about 15 million and items are about 10 million. > > Architecture is like below: > Spark and elastic search is on two machines. Hadoop and hbase is on > another two separate machines. > > Each machine has following configuration: > 160GB ram, CPUs 40, Cores per socket 10, cpu MHz 3000 > > So please let me know what is right configuration for such large events. > Also let me know what possibility should I consider as my events are going > to increase to billion. Will it work for such large data set? > > Thanks in advance. > > Thanks, > Digambar >
