[ 
https://issues.apache.org/jira/browse/PHOENIX-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated PHOENIX-1453:
--------------------------------------------
    Attachment: Phoenix-1453_1.patch

An updated patch.
-> collects the rowCount as a List<Long>
-> Stored in the GuidePostsInfo
-> Serialized along with the guidePosts as the value part of the cell. To 
maintain BC we are also writing a version byte. Now the version byte is -1. Am 
afraid if we can have a non zero, positive number because the first item that 
we serialize here is the byteCount and that could be 0 or non zero positive 
number.
-> There are testcases added that counts the rowCount with the splits and also 
for multiCFCase.
-> For the TableAlreadyExistsException and updating the table with the new 
column 
{code}
addColumnsIfNotExists(metaConnection, 
                                        
QueryConstants.CREATE_STATS_TABLE_METADATA, TimeKeeper.SYSTEM.getCurrentTime(), 
                                        
PhoenixDatabaseMetaData.GUIDE_POSTS_ROW_COUNT);
{code}
What should be the time that we use here?

> Collect row counts per region in stats table
> --------------------------------------------
>
>                 Key: PHOENIX-1453
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1453
>             Project: Phoenix
>          Issue Type: Sub-task
>            Reporter: James Taylor
>            Assignee: ramkrishna.s.vasudevan
>         Attachments: Phoenix-1453.patch, Phoenix-1453_1.patch
>
>
> We currently collect guideposts per equal chunk, but we should also capture 
> row counts. Should we have a parallel array with the guideposts that count 
> rows per guidepost, or is it enough to have a per region count?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to