Re: [VOTE] Apache CarbonData 1.6.0(RC3) release

2019-08-21 Thread Raghunandan S
Hi all PMC vote has passed for Apache Carbondata 1.6.0 release, the result as below: +1(binding): 5(Jacky, Kumar Vishal, Ravindra, David CaiQiang, Liang Chen) +1(non-binding) : 2 Thanks all for your vote. Regards On Mon, Aug 19, 2019 at 9:52 AM Liang Chen wrote: > +1 from my side > >

Adapt to SparkSessionExtensions

2019-08-21 Thread Ajith shetty
Hi Community >From https://issues.apache.org/jira/browse/SPARK-18127 Spark provides >SparkSessionExtensions in order to extended capabilities of spark. Carbon can >use this in order to avoid the tight coupling due to CarbonSession in spark >environment.

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread Akash Nilugal
On 2019/08/21 02:39:45, tao li wrote: > hi Akash, > I have a few questions. > 1. About the ways to Pre-Prime: there are 2 ways. one is cache when data > loading, another is when the cace server started. > i think the latter is not desirable,because of load cache may take

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread Akash Nilugal
Hi Litao, Initially with first time count(*) , it used to take around 32seconds as it used to load into cache, and second time query takes 1.5sec to 2 i think, so with pre-prime we can achieve more improvement in first time query. Regards, Akash On 2019/08/21 03:03:55, tao li wrote: > hi

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread tao li
hi, akash How much of the performance difference between the first and second querys is affected by caching index and how much is affected by Hadoop caching. We should open it up and take a look at the time-consuming analysis on the driver side. On 2019/08/21 09:42:10, Akash Nilugal

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread Akash Nilugal
Hi chetan, As mentioned in design , loading to cache will be an asyc operation, and we will load only the corresponding segment to cache, so there wont be any hit. Logs will be added On 2019/08/21 13:18:05, chetan bhat wrote: > Hi Akash, > > 1. Will the performance of end to end dataload

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread tao li
hi akash count(*) can only load one table,if the table is very more, it is better we can have a command to trigger the cache load. On 2019/08/21 09:42:10, Akash Nilugal wrote: > Hi Litao, > > Initially with first time count(*) , it used to take around 32seconds as it > used to load into

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread Akash Nilugal
Hi litao, Basically If total first time query takes x amount of time and in that y time takes for to connect to index server , cache and return, then with pre prime we can save this y time may if all the segments are not loaded then we can save less than Y time, so we will get the benefit,

Re: [DISCUSSION] Cache Pre Priming

2019-08-21 Thread chetan bhat
Hi Akash, 1. Will the performance of end to end dataload operation be impacted if the segment datamap is loaded to cache once the load is finished. 2. Will there be a notification in logs stating that the loading of datamap cache is completed. Regards On 2019/08/15 12:03:09, Akash Nilugal