Musxeto opened a new issue, #5854:
URL: https://github.com/apache/texera/issues/5854

   ### Feature Summary
   
   ### Feature Summary
   
   Currently, the Aggregate operator only supports `SUM`, `COUNT`, `AVERAGE`, 
`MIN`, `MAX`, and `CONCAT`. It lacks common statistical aggregations such as 
Median, Variance, Standard Deviation, and Mode. Adding these functions will 
provide data scientists with more powerful statistical tools out-of-the-box 
when building workflows.
   
   ### Proposed Solution or Design
   
   1. Add `MEDIAN`, `VARIANCE`, `STDDEV`, and `MODE` to 
`AggregationFunction.java`.
   2. Implement the computation logic for these functions in 
`AggregationOperation.scala` (and any related distributed aggregation classes).
   3. Update the frontend UI schema for the Aggregate operator to reflect the 
new options.
   4. Add comprehensive unit tests in `AggregateOpSpec.scala` covering 
positive, negative, and edge cases.
   
   ### Affected Area
   
   Workflow Engine (Amber), Workflow UI


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to