[ https://issues.apache.org/jira/browse/PHOENIX-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14219114#comment-14219114 ]
James Taylor commented on PHOENIX-1453: --------------------------------------- Nice work and great turn around on the patch, [~ramkrishna]. Here's some feedback: - Storing GUIDE_POST_ROW_COUNT as a BIGINT ARRAY, parallel to GUIDE_POSTS will give us the most flexibility. That way, on the client side, we can calculate per guidepost how many rows will be traversed. - For the backward compatibility, we'll want to dynamically add the new SYSTEM.STATS column in the TableNotFoundException in ConnectionQueryServicesImpl.init() method. You'll want to call ConnectionQueryServicesImpl.addColumnsIfNotExists() for the new column - see prior Phoenix init() methods for an example. You'll also want to bump up the MetaDataProtocol.MIN_SYSTEM_TABLE_TIMESTAMP, as that what causes the TableNotFoundException versus NewerTableFoundException to be called. - Need some unit tests around this. Thanks! > Collect row counts per region in stats table > -------------------------------------------- > > Key: PHOENIX-1453 > URL: https://issues.apache.org/jira/browse/PHOENIX-1453 > Project: Phoenix > Issue Type: Sub-task > Reporter: James Taylor > Assignee: ramkrishna.s.vasudevan > Attachments: Phoenix-1453.patch > > > We currently collect guideposts per equal chunk, but we should also capture > row counts. Should we have a parallel array with the guideposts that count > rows per guidepost, or is it enough to have a per region count? -- This message was sent by Atlassian JIRA (v6.3.4#6332)