Thanks for the suggestion. Both documentation and expansion time warnings are added in https://github.com/apache/beam/pull/34563 .
Yi On Mon, Mar 31, 2025 at 2:43 PM Kenneth Knowles <k...@apache.org> wrote: > One way that you might be able to reach some users is to issue a warning > in the code for SqlTransform.expand(), with the deprecation timeline. Maybe > this is under "etc" but I wanted to mention it because the other items were > all documentation. > > Kenn > > On Mon, Mar 31, 2025 at 1:34 PM Yi Hu via dev <dev@beam.apache.org> wrote: > >> Thanks for inputs! From discussion we have agreed on moving forward >> >> Next step: >> >> - Draft PR to note the deprecation status in Documentation, including >> CHANGES, Javadoc, etc >> >> - The earliest release to stop publishing ZetaSQL artifacts is pushed >> further, not earlier than 1 full quarter before the next Release (2.65.0 + >> 3 months would be 2.68.0), and only when it causes issues on maintenance. >> >> Thanks, >> >> Yi >> >> >> On Wed, Mar 26, 2025 at 12:18 PM Kenneth Knowles <k...@apache.org> wrote: >> >>> +1 to this deprecation. Thanks for putting together a clear summary. >>> >>> FWIW it also has significantly worse performance than Calcite SQL >>> dialect, since it calls out to a ZetaSQL subprocess for most calculations, >>> and that is less optimized than Beam's Fn API. >>> >>> Kenn >>> >>> On Tue, Mar 25, 2025 at 4:18 PM Robert Bradshaw via user < >>> u...@beam.apache.org> wrote: >>> >>>> I'm in favor of deprecating this and cleaning it up, but it depends on >>>> usage. I suspect it is low (or possibly non-existent, especially as there's >>>> little upside to moving away from the default). I cc'd user@ just in >>>> case anyone wants to chime in there. This may be a good thing to add to our >>>> release notes as well (perhaps we can get it in the one that's just about >>>> to go out). >>>> >>>> Unless there is strong, justified pushback, I'd get the deprecation >>>> status (e.g. on the javadocs, website) right away. For actual removal, I >>>> agree with the idea of waiting until it actually causes issues or we move >>>> to the next major beam release, though I might push back at 2.66 being a >>>> bit too quick even if the first condition is hit before then and might give >>>> people at least a quarter's notice. >>>> >>>> - Robert >>>> >>>> >>>> On Mon, Mar 24, 2025 at 2:27 PM Yi Hu via dev <dev@beam.apache.org> >>>> wrote: >>>> >>>>> Hi everyone, >>>>> >>>>> I would like to bring up discussion for deprecating Beam SQL's ZetaSQL >>>>> component [1]. >>>>> Beam SQL currently serves with two SQL dialects (i) Apache Calcite and >>>>> (ii) ZetaSQL dialects, see documentation [2] due to the following reasons >>>>> >>>>> - Developments in Beam for ZetaSQL dialect effectively stalled since >>>>> early 2022 (See change history [3]) >>>>> >>>>> - Despite incomplete support status, there is no new bug / feature >>>>> request opened ever since we migrated to use GitHub Issue, suggesting >>>>> minimal adoption [4] >>>>> >>>>> - We still need to keep zetasql up-to-date if its dependency conflicts >>>>> with other google dependencies, as a result ZetaSQL component introduces >>>>> maintenance burden when upgrading GCP-BOM (e.g. [5]). >>>>> >>>>> - One of the main reason that using ZetaSQL dialect, per [2], was >>>>> because >>>>> >>>>> > ZetaSQL is more compatible with BigQuery, so it’s especially useful >>>>> in pipelines that write to or read from BigQuery tables. >>>>> >>>>> As of today, as GCP BigQuery now supports using GoogleSQL >>>>> (open-sourced as ZetaSQL) querying data that's stored outside of BigQuery >>>>> via BigQuery Connections API / Federated query [6, 7]. This largely >>>>> provides an alternative for using Beam's ZetaSQL interacting with >>>>> BigQuery. >>>>> >>>>> For these reasons, I propose initiating the process of deprecating >>>>> Beam SQL's ZetaSQL component. There are two decisions needed to be made: >>>>> >>>>> Firstly, agree on when to document the deprecated status for ZetaSQL >>>>> component in javadoc, beam website, currently I recommend do it in the >>>>> release that currently HEAD belongs, that is Beam 2.65.0 (cut April 30, >>>>> 2025) >>>>> >>>>> Secondly, stop publishing ZetaSQL artifacts. This is a breaking >>>>> change, and I think we can leave the deprecated status as is until the >>>>> following situation emerges, whichever comes first, and no earlier than >>>>> Beam 2.66.0 (cut Jun 11, 2025) >>>>> >>>>> - Continued support for ZetaSQL component involving significant >>>>> burdens, like conflict with other Beam dependencies, supported Java >>>>> versions, etc, or >>>>> - When Beam moved to the next release major release (3) >>>>> >>>>> Thanks for your attention, and any input welcomed! >>>>> >>>>> Regards, >>>>> Yi >>>>> >>>>> [1] >>>>> https://github.com/apache/beam/tree/master/sdks/java/extensions/sql/zetasql >>>>> [2] https://beam.apache.org/documentation/dsls/sql/overview/ >>>>> [3] >>>>> https://github.com/benEng/beam/commits/master/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/SupportedZetaSqlBuiltinFunctions.java >>>>> [4] >>>>> https://github.com/apache/beam/issues?q=is%3Aissue%20%20label%3Azetasql%20 >>>>> [5] https://github.com/apache/beam/pull/32902 >>>>> [6] https://cloud.google.com/bigquery/docs/connections-api-intro >>>>> [7] https://cloud.google.com/bigquery/docs/federated-queries-intro >>>>> >>>>> -- >>>>> >>>>> Yi Hu, (he/him/his) >>>>> >>>>> Software Engineer >>>>> >>>>> >>>>>