Kent Yao created SPARK-55279:
--------------------------------
Summary: [SQL] Add sketch_funcs group for DataSketches SQL
functions
Key: SPARK-55279
URL: https://issues.apache.org/jira/browse/SPARK-55279
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 4.2.0
Reporter: Kent Yao
All DataSketches-related expression functions should have their own
'sketch_funcs' group instead of being grouped under 'misc_funcs'.
This improves consistency with how other specialized function categories are
organized and makes the documentation clearer for users.
Functions to move from misc_funcs to sketch_funcs:
- HLL sketch functions: hll_sketch_estimate, hll_union
- Theta sketch functions: theta_sketch_estimate, theta_union, theta_difference,
theta_intersection
- KLL sketch functions: kll_sketch_to_string_*, kll_sketch_get_n_*,
kll_sketch_get_rank_*, kll_sketch_get_quantile_*, kll_sketch_get_pmf_*,
kll_sketch_get_cdf_*, kll_sketch_merge_*
- Tuple sketch functions: tuple_sketch_* expression functions
- ApproxTopK: approx_top_k_estimate
Note: Aggregate functions (like hll_sketch_agg, theta_sketch_agg,
kll_sketch_agg_*, etc.) remain in 'agg_funcs' as they are aggregates.
Changes:
- Move all sketch-related expression functions from misc_funcs to sketch_funcs
- Add sketch_funcs to the groups set in gen-sql-functions-docs.py
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]