Kent Yao created SPARK-55279:
--------------------------------

             Summary: [SQL] Add sketch_funcs group for DataSketches SQL 
functions
                 Key: SPARK-55279
                 URL: https://issues.apache.org/jira/browse/SPARK-55279
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 4.2.0
            Reporter: Kent Yao


All DataSketches-related expression functions should have their own 
'sketch_funcs' group instead of being grouped under 'misc_funcs'.

This improves consistency with how other specialized function categories are 
organized and makes the documentation clearer for users.

Functions to move from misc_funcs to sketch_funcs:
- HLL sketch functions: hll_sketch_estimate, hll_union
- Theta sketch functions: theta_sketch_estimate, theta_union, theta_difference, 
theta_intersection
- KLL sketch functions: kll_sketch_to_string_*, kll_sketch_get_n_*, 
kll_sketch_get_rank_*, kll_sketch_get_quantile_*, kll_sketch_get_pmf_*, 
kll_sketch_get_cdf_*, kll_sketch_merge_*
- Tuple sketch functions: tuple_sketch_* expression functions
- ApproxTopK: approx_top_k_estimate

Note: Aggregate functions (like hll_sketch_agg, theta_sketch_agg, 
kll_sketch_agg_*, etc.) remain in 'agg_funcs' as they are aggregates.

Changes:
- Move all sketch-related expression functions from misc_funcs to sketch_funcs
- Add sketch_funcs to the groups set in gen-sql-functions-docs.py



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to