CarlosJuncher03 opened a new issue, #6385:
URL: https://github.com/apache/hop/issues/6385

   ### What would you like to happen?
   
   Description: Issue Type: Improvement / Feature Request Priority: Major
   
   Summary: Currently, the Golden Data Set validation in Pipeline Unit Tests 
performs a strict comparison: the output rows must exactly match the values in 
the dataset. If a single character differs (or if the sort order changes), the 
test fails.
   
   This rigidity makes it difficult to test pipelines that generate dynamic 
data, such as:
   
   Timestamps/Dates: (e.g., sysdate or execution_date).
   
   Generated IDs: (e.g., UUIDs or sequence numbers).
   
   Variable outputs: (e.g., Live API responses).
   
   Proposed Solution: Enhance the Golden Data Set configuration to allow 
Validation Rules (Assertions) per field, rather than just static values.
   
   Instead of only checking Value == Expected, allow the user to define rules 
such as:
   
   Not Null: Pass if the field has any value.
   
   Data Type Check: Pass if the value is a specific type (e.g., String, 
Integer).
   
   Numeric Ranges: Pass if Value > X or Value < Y.
   
   String Length: Pass if Length > X.
   
   Regex Matching: Pass if the string matches a pattern (e.g., 
^[0-9]{3}-[0-9]{2}$).
   
   Contains: Pass if the string contains a specific substring.
   
   Benefit: This would drastically improve the usability of Unit Tests for 
real-world scenarios where data is dynamic. Developers could validate the 
structure and integrity of the data without needing to know the exact runtime 
values beforehand.
   
   ### Issue Priority
   
   Priority: 3
   
   ### Issue Component
   
   Component: Pipelines


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to