Chris Tarnas created PHOENIX-2938:
-------------------------------------
Summary: HFile support for SparkSQL DataFrame saves
Key: PHOENIX-2938
URL: https://issues.apache.org/jira/browse/PHOENIX-2938
Project: Phoenix
Issue Type: Improvement
Reporter: Chris Tarnas
Priority: Minor
Currently when saving a DataFrame in Spark it is persisted as upserts. Having
an option to do saves natively via HFiles, as the MapReduce loader does, would
be a great performance improvement for large bulk loads. The current work
around to reduce the load on the regionservers would be to save to csv from
Spark then load via the MapReduce loader.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)