[
https://issues.apache.org/jira/browse/SPARK-55279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kent Yao reassigned SPARK-55279:
--------------------------------
Assignee: Kent Yao
> [SQL] Add sketch_funcs group for DataSketches SQL functions
> -----------------------------------------------------------
>
> Key: SPARK-55279
> URL: https://issues.apache.org/jira/browse/SPARK-55279
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 4.2.0
> Reporter: Kent Yao
> Assignee: Kent Yao
> Priority: Major
> Labels: pull-request-available
>
> All DataSketches-related expression functions should have their own
> 'sketch_funcs' group instead of being grouped under 'misc_funcs'.
> This improves consistency with how other specialized function categories are
> organized and makes the documentation clearer for users.
> Functions to move from misc_funcs to sketch_funcs:
> - HLL sketch functions: hll_sketch_estimate, hll_union
> - Theta sketch functions: theta_sketch_estimate, theta_union,
> theta_difference, theta_intersection
> - KLL sketch functions: kll_sketch_to_string_*, kll_sketch_get_n_*,
> kll_sketch_get_rank_*, kll_sketch_get_quantile_*, kll_sketch_get_pmf_*,
> kll_sketch_get_cdf_*, kll_sketch_merge_*
> - Tuple sketch functions: tuple_sketch_* expression functions
> - ApproxTopK: approx_top_k_estimate
> Note: Aggregate functions (like hll_sketch_agg, theta_sketch_agg,
> kll_sketch_agg_*, etc.) remain in 'agg_funcs' as they are aggregates.
> Changes:
> - Move all sketch-related expression functions from misc_funcs to sketch_funcs
> - Add sketch_funcs to the groups set in gen-sql-functions-docs.py
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]