Daniel Frank created AIRFLOW-514:
------------------------------------

             Summary: HiveCli hook should be able to load a pandas DataFrame 
                 Key: AIRFLOW-514
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-514
             Project: Apache Airflow
          Issue Type: Improvement
            Reporter: Daniel Frank
            Assignee: Daniel Frank
            Priority: Minor


Currently the hive cli hook can load_df, which returns a pandas.DataFrame. Many 
of our workflows involve retrieving a hive table in pandas.DataFrame form, 
modifying it and saving it (perhaps elsewhere). In order to save the dataframe 
we have to manually translate the types, save to disk and run load_file() which 
is repetitive and tedious. This workflow could be easily automated with a 
load_df method for HiveCliHook 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to