[
https://issues.apache.org/jira/browse/PHOENIX-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ramkrishna.s.vasudevan updated PHOENIX-1453:
--------------------------------------------
Attachment: Phoenix-1453_1.patch
An updated patch.
-> collects the rowCount as a List<Long>
-> Stored in the GuidePostsInfo
-> Serialized along with the guidePosts as the value part of the cell. To
maintain BC we are also writing a version byte. Now the version byte is -1. Am
afraid if we can have a non zero, positive number because the first item that
we serialize here is the byteCount and that could be 0 or non zero positive
number.
-> There are testcases added that counts the rowCount with the splits and also
for multiCFCase.
-> For the TableAlreadyExistsException and updating the table with the new
column
{code}
addColumnsIfNotExists(metaConnection,
QueryConstants.CREATE_STATS_TABLE_METADATA, TimeKeeper.SYSTEM.getCurrentTime(),
PhoenixDatabaseMetaData.GUIDE_POSTS_ROW_COUNT);
{code}
What should be the time that we use here?
> Collect row counts per region in stats table
> --------------------------------------------
>
> Key: PHOENIX-1453
> URL: https://issues.apache.org/jira/browse/PHOENIX-1453
> Project: Phoenix
> Issue Type: Sub-task
> Reporter: James Taylor
> Assignee: ramkrishna.s.vasudevan
> Attachments: Phoenix-1453.patch, Phoenix-1453_1.patch
>
>
> We currently collect guideposts per equal chunk, but we should also capture
> row counts. Should we have a parallel array with the guideposts that count
> rows per guidepost, or is it enough to have a per region count?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)