drobert opened a new pull request #18960:
URL: https://github.com/apache/airflow/pull/18960


   The 'tutorial' basic ETL example at 
https://airflow.apache.org/docs/apache-airflow/stable/tutorial.html#testing is 
not functional. 
   
   This PR:
   
   - allows the file save path to be configurable (at least called out/editable)
   - ensures the CSV file is properly formatted with newlines and no blank lines
   - removes a postgres constraint id violation
   - modifies the csv import code to support a header row
   
   related: https://github.com/apache/airflow/issues/18950 
   
   ## Not in this PR
   
   - the upstream data has malformed IDs. They're represented in scientific 
notation rather than `long` (`bigint`) and the result is duplicate IDs
   - the example relies on a configured `Connection`, which is a concept that 
hasn't been introduced yet and isn't clear from the documentation. This can be 
in a subsequent PR.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to