[jira] [Created] (PHOENIX-2938) HFile support for SparkSQL DataFrame saves

Chris Tarnas (JIRA) Wed, 25 May 2016 16:45:10 -0700

Chris Tarnas created PHOENIX-2938:
-------------------------------------

             Summary: HFile support for SparkSQL DataFrame saves
                 Key: PHOENIX-2938
                 URL: https://issues.apache.org/jira/browse/PHOENIX-2938
             Project: Phoenix
          Issue Type: Improvement
            Reporter: Chris Tarnas
            Priority: Minor



Currently when saving a DataFrame in Spark it is persisted as upserts. Having 
an option to do saves natively via HFiles, as the MapReduce loader does, would 
be a great performance improvement for large bulk loads. The current work 
around to reduce the load on the regionservers would be to save to csv from 
Spark then load via the MapReduce loader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (PHOENIX-2938) HFile support for SparkSQL DataFrame saves

Reply via email to