[ https://issues.apache.org/jira/browse/PHOENIX-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15866342#comment-15866342 ]
James Taylor commented on PHOENIX-3536: --------------------------------------- Thanks for the patch, [~Jeongdae Kim]. What kind of impact does this have on performance? [~sergey.soldatov] - would it be possible for you to review this? [~Jeongdae Kim] - this might need to be rebased after PHOENIX-3346 is committed. > Remove creating unnecessary phoenix connections in MR Tasks of Hive > ------------------------------------------------------------------- > > Key: PHOENIX-3536 > URL: https://issues.apache.org/jira/browse/PHOENIX-3536 > Project: Phoenix > Issue Type: Improvement > Reporter: Jeongdae Kim > Assignee: Jeongdae Kim > Labels: HivePhoenix > Attachments: PHOENIX-3536.1.patch > > > PhoenixStorageHandler creates phoenix connections to make QueryPlan in > getSplit phase(prepare MR) and getRecordReader phase(Map) while running MR > Job. > in phoenix, it spends too many times to create the first phoenix > connection(QueryServices) for specific URL. (checking and loading phoenix > schema information) > i found it is possible to remove creating query plan again in Map > phase(getRecordReader()) by serializing QueryPlan created from Input format > ans passing this plan to record reader. > this approach improves scan performance by removing trying to unnecessary > connection in map phase. -- This message was sent by Atlassian JIRA (v6.3.15#6346)