Jeongdae Kim created PHOENIX-3536:
-------------------------------------
Summary: Remove creating unnecessary phoenix connections in MR
Tasks of Hive
Key: PHOENIX-3536
URL: https://issues.apache.org/jira/browse/PHOENIX-3536
Project: Phoenix
Issue Type: Improvement
Reporter: Jeongdae Kim
Assignee: Jeongdae Kim
PhoenixStorageHandler creates phoenix connections to make QueryPlan in getSplit
phase(prepare MR) and getRecordReader phase(Map) while running MR Job.
in phoenix, it spends too many times to create the first phoenix
connection(QueryServices) for specific URL. (checking and loading phoenix
schema information)
i found it is possible to remove creating query plan again in Map
phase(getRecordReader()) by serializing QueryPlan created from Input format ans
passing this plan to record reader.
this approach improves scan performance by removing trying to unnecessary
connection in map phase.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)