aglinxinyuan opened a new pull request, #4211:
URL: https://github.com/apache/texera/pull/4211

   ### What changes were proposed in this PR?
   Introduce the Sklearn Testing Operator, which accepts any number of machine 
learning models (from 1 to n) and computes `accuracy`, `F1` score, `precision`, 
and `recall` for each model, appending these metrics to the output columns.
   
   There is a port dependency between the model and the data: the data port 
must be finished first. The data table is then used as an internal state for 
testing. This operator can accept any number of models; each model will be 
tested against the same data table.
   
   **Input single model:**
   <img width="797" alt="Screenshot 2025-07-04 at 22 54 33" 
src="https://github.com/user-attachments/assets/d14326f2-4f5f-4476-9eda-eb464ea8049c";
 />
   
   **Input multiple models:**
   <img width="822" alt="Screenshot 2025-07-04 at 22 47 09" 
src="https://github.com/user-attachments/assets/4333ca08-3717-407a-b978-05995197f8c8";
 />
   
   ### Was this PR authored or co-authored using generative AI tooling?
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to