GitHub user foryou2030 opened a pull request:
https://github.com/apache/incubator-carbondata/pull/123
[CARBONDATA-204] Clear query statistics map when timeout
# Why raise this pr?
I found Query statistics issue:
1. some query statistics that never be printed will be keeped into
querystatisticsMap, so it will cause "out of memory" for long time running
2. in some sceniaro, the driver can't record "sql_parse_time" , the driver
statistics logs will not be output, we should output block_allocation_time and
block_identification_time always.
# How to solve?
1. add function to check querystatistics timeout , once timeout, remove the
queryId from the map.
2.add conditional detection for queryStatisticsMap size, if the
queryStatistic only contain block_allocation_time and
block_identification_time, then ouput them.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/foryou2030/incubator-carbondata fix_stat
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-carbondata/pull/123.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #123
----
commit e10868a5154ccc15196b23428db09005c3affc85
Author: foryou2030 <[email protected]>
Date: 2016-09-02T10:22:03Z
clear query statistics map
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---