lopezvit opened a new issue, #39127:
URL: https://github.com/apache/airflow/issues/39127

   ### Description
   
   Sometimes you just need a the latest value of a field (e.g. `updatedAt`) so 
further operators downstream could use said value in their own query.
   This can be done by `SELECT MAX(updatedAt) [...]` but that would required a 
lot of re-write, when simply adding a new param `ordering_fields` could solve 
the same issue, allowing to create a query similar to:
   `SELECT updatedAt FROM [...] LIMIT 1 ORDER BY updatedAt DESC`
   
   Example implementation (not tested):
   
       def generate_query(self, hook: BigQueryHook) -> str:
           """Generate a SELECT query if for the given dataset and table ID."""
           query = "select "
           if self.selected_fields:
               query += self.selected_fields
           else:
               query += "*"
           query += (
               f" from `{self.table_project_id or 
hook.project_id}.{self.dataset_id}"
               f".{self.table_id}` limit {self.max_results}"
           )
           if self.ordering_fields:
               query += f" ORDER BY {self.ordering_fields}"
           return query
   
   ### Use case/motivation
   
   The operator BigQueryGetData should have 1 more params `ordering_fields` so 
the generated query would also include the `ORDER BY` clause.
   
   ### Related issues
   
   https://github.com/apache/airflow/issues/24460
   
   ### Are you willing to submit a PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to