[jira] [Commented] (PHOENIX-1453) Collect row counts per region in stats table

James Taylor (JIRA) Thu, 20 Nov 2014 00:04:26 -0800

    [ 
https://issues.apache.org/jira/browse/PHOENIX-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14219114#comment-14219114
 ]


James Taylor commented on PHOENIX-1453:
---------------------------------------

Nice work and great turn around on the patch, [~ramkrishna]. Here's some 
feedback:
- Storing GUIDE_POST_ROW_COUNT as a BIGINT ARRAY, parallel to GUIDE_POSTS will 
give us the most flexibility. That way, on the client side, we can calculate 
per guidepost how many rows will be traversed.
- For the backward compatibility, we'll want to dynamically add the new 
SYSTEM.STATS column in the TableNotFoundException in 
ConnectionQueryServicesImpl.init() method. You'll want to call 
ConnectionQueryServicesImpl.addColumnsIfNotExists() for the new column - see 
prior Phoenix init() methods for an example. You'll also want to bump up the 
MetaDataProtocol.MIN_SYSTEM_TABLE_TIMESTAMP, as that what causes the 
TableNotFoundException versus NewerTableFoundException to be called.
- Need some unit tests around this.
Thanks!

> Collect row counts per region in stats table
> --------------------------------------------
>
>                 Key: PHOENIX-1453
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1453
>             Project: Phoenix
>          Issue Type: Sub-task
>            Reporter: James Taylor
>            Assignee: ramkrishna.s.vasudevan
>         Attachments: Phoenix-1453.patch
>
>
> We currently collect guideposts per equal chunk, but we should also capture 
> row counts. Should we have a parallel array with the guideposts that count 
> rows per guidepost, or is it enough to have a per region count?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (PHOENIX-1453) Collect row counts per region in stats table

Reply via email to