Hi all
PMC vote has passed for Apache Carbondata 1.6.0 release, the result
as below:
+1(binding): 5(Jacky, Kumar Vishal, Ravindra, David CaiQiang, Liang Chen)
+1(non-binding) : 2
Thanks all for your vote.
Regards
On Mon, Aug 19, 2019 at 9:52 AM Liang Chen wrote:
> +1 from my side
>
>
Hi Community
>From https://issues.apache.org/jira/browse/SPARK-18127 Spark provides
>SparkSessionExtensions in order to extended capabilities of spark. Carbon can
>use this in order to avoid the tight coupling due to CarbonSession in spark
>environment.
On 2019/08/21 02:39:45, tao li wrote:
> hi Akash,
> I have a few questions.
> 1. About the ways to Pre-Prime: there are 2 ways. one is cache when data
> loading, another is when the cace server started.
> i think the latter is not desirable,because of load cache may take
Hi Litao,
Initially with first time count(*) , it used to take around 32seconds as it
used to load into cache, and second time query takes 1.5sec to 2 i think, so
with pre-prime we can achieve more improvement in first time query.
Regards,
Akash
On 2019/08/21 03:03:55, tao li wrote:
> hi
hi, akash
How much of the performance difference between the first and second querys
is affected by caching index and how much is affected by Hadoop caching.
We should open it up and take a look at the time-consuming analysis on the
driver side.
On 2019/08/21 09:42:10, Akash Nilugal
Hi chetan,
As mentioned in design , loading to cache will be an asyc operation, and we
will load only the corresponding segment to cache, so there wont be any hit.
Logs will be added
On 2019/08/21 13:18:05, chetan bhat wrote:
> Hi Akash,
>
> 1. Will the performance of end to end dataload
hi akash
count(*) can only load one table,if the table is very more, it is better we
can have a command to trigger the cache load.
On 2019/08/21 09:42:10, Akash Nilugal wrote:
> Hi Litao,
>
> Initially with first time count(*) , it used to take around 32seconds as it
> used to load into
Hi litao,
Basically
If total first time query takes x amount of time and in that y time takes for
to connect to index server , cache and return, then with pre prime we can save
this y time may if all the segments are not loaded then we can save less than Y
time, so we will get the benefit,
Hi Akash,
1. Will the performance of end to end dataload operation be impacted if the
segment datamap is loaded to cache once the load is finished.
2. Will there be a notification in logs stating that the loading of datamap
cache is completed.
Regards
On 2019/08/15 12:03:09, Akash Nilugal