Re: Slow cache loading with an object key using affinity key

2021-01-24 Thread vtchernyi
Hi,Please take a look at my post [1]. It is not exactly about you question, but it works in production for row count you mentionRegards,Vladimir[1] https://www.gridgain.com/resources/blog/how-fast-load-large-datasets-apache-ignite-using-key-value-api22:39, 22 января 2021 г., gvaidya :Hi Alex,The

Re: Slow cache loading with an object key using affinity key

2021-01-22 Thread gvaidya
Hi Alex, The data distribution of itemid is very even. For the ~537K records data set there are exactly 12 records per itemid for all values of itemid. I have reviewed all the documentation in the links you provided. I also switched to jdk8 (i was using jdk15 earlier), set my jvm options per the

Re: Slow cache loading with an object key using affinity key

2021-01-21 Thread akorensh
Hi, Can you actually see what the data distribution of itemid is. AffinityKey will direct Ignite to use that specific column (itemid) for partition mapping. If the data in itemid is in a narrow range, then all data will be put into fewer partitions. This might affect loading times.

Re: Slow cache loading with an object key using affinity key

2021-01-21 Thread gvaidya
Hi Alex, I am simply using GridGain/Ignite web console to generate a project for testing cacheload of a single table from my persistent storage. For evaluation/testing purposes, I am running a single node in my cluster on a 2CPU 32Gb Windows machine. For testing, I created 2 projects. Test1 is

Re: Slow cache loading with an object key using affinity key

2021-01-21 Thread akorensh
Hi, Can you give more context regarding your data load. How big is your cluster?* Are the load times comparable if you use only one node for both use cases? * If you remove the affinity column in the second use case, does the load speed up in a cluster env? If you lower the amount

Slow cache loading with an object key using affinity key

2021-01-20 Thread gvaidya
I have just began experimenting with Apache Ignite as an IMDG. I have imported a single table (~42 million, 15 columns - mostly integer/decimal datatypes) in a new cluster created in GridGain Web console and created a new JAVA project. I initially defined the cache as partitioned and did not