[
https://issues.apache.org/jira/browse/AIRFLOW-6685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
alex l updated AIRFLOW-6685:
----------------------------
Description:
This PR includes a new operator in `CheckOperator` that allows users to perform
a threshold data quality check.
`ThresholdCheckOperator` will check a single value, sql result against a
threshold range, and will fail a task if it is outside this range. The lower
and upper bound of the threshold can be defined as either a numeric values, or
sql-statements that returns a numeric value.
was:
Add Data Quality Operators to improve data quality testing on data
workflows/pipelines. This includes 3 operators:
* BaseDataQualityOperator
** contains shared attributes and methods that data quality check operators
utilize
** a base class that can be used to create other dq operators
* DataQualityThresholdCheckOperator
** will check a single value, sql result against a threshold range, and will
fail a task if it is outside this range.
* DataQulaityThresholdSQLCheckOperator
** Similar to DataQualityThresholdCheckOperator, but thresholds are
sql-evaluated values, for dynamic threshold ranging.
> Add Data Quality Operators
> ---------------------------
>
> Key: AIRFLOW-6685
> URL: https://issues.apache.org/jira/browse/AIRFLOW-6685
> Project: Apache Airflow
> Issue Type: New Feature
> Components: operators
> Affects Versions: 2.0.0
> Reporter: alex l
> Assignee: alex l
> Priority: Major
>
> This PR includes a new operator in `CheckOperator` that allows users to
> perform a threshold data quality check.
> `ThresholdCheckOperator` will check a single value, sql result against a
> threshold range, and will fail a task if it is outside this range. The lower
> and upper bound of the threshold can be defined as either a numeric values,
> or sql-statements that returns a numeric value.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)