judahrand commented on PR #24249:
URL: https://github.com/apache/beam/pull/24249#issuecomment-1319954241

   > From the conversation in https://issues.apache.org/jira/browse/BEAM-7610 
and related PRs, this should have been fixed by #14146
   > 
   > @apilloud could you take a look at this one since you have more context 
here?
   
   It's an issue we're definitely having in Dataflow SQL. A query which looks 
roughly like this:
   
   ```
   SELECT
       window_start,
       window_end,
       COALESCE(ARRAY_AGG(foo), []) AS data
   FROM HOP(
          (SELECT * FROM pubsub.topic.`project_id`.`topic`),
          DESCRIPTOR(event_timestamp),
          "INTERVAL 5 MINUTE",
          "INTERVAL 15 MINUTE"
      ) as data
      GROUP BY window_start, window_end
   ```
   
   results in an error downstream which says that Data Catalog does not support 
`NULLABLE ARRAYS`. This means that the output schema of the query must not be 
correctly handling the `COALESCE`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to