Daniel Frank created AIRFLOW-514: ------------------------------------ Summary: HiveCli hook should be able to load a pandas DataFrame Key: AIRFLOW-514 URL: https://issues.apache.org/jira/browse/AIRFLOW-514 Project: Apache Airflow Issue Type: Improvement Reporter: Daniel Frank Assignee: Daniel Frank Priority: Minor
Currently the hive cli hook can load_df, which returns a pandas.DataFrame. Many of our workflows involve retrieving a hive table in pandas.DataFrame form, modifying it and saving it (perhaps elsewhere). In order to save the dataframe we have to manually translate the types, save to disk and run load_file() which is repetitive and tedious. This workflow could be easily automated with a load_df method for HiveCliHook -- This message was sent by Atlassian JIRA (v6.3.4#6332)