drobert opened a new pull request #18960: URL: https://github.com/apache/airflow/pull/18960
The 'tutorial' basic ETL example at https://airflow.apache.org/docs/apache-airflow/stable/tutorial.html#testing is not functional. This PR: - allows the file save path to be configurable (at least called out/editable) - ensures the CSV file is properly formatted with newlines and no blank lines - removes a postgres constraint id violation - modifies the csv import code to support a header row related: https://github.com/apache/airflow/issues/18950 ## Not in this PR - the upstream data has malformed IDs. They're represented in scientific notation rather than `long` (`bigint`) and the result is duplicate IDs - the example relies on a configured `Connection`, which is a concept that hasn't been introduced yet and isn't clear from the documentation. This can be in a subsequent PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
