Hyukjin Kwon created SPARK-21914:
------------------------------------

             Summary: Running examples as tests in SQL builtin function 
documentation
                 Key: SPARK-21914
                 URL: https://issues.apache.org/jira/browse/SPARK-21914
             Project: Spark
          Issue Type: Test
          Components: SQL
    Affects Versions: 2.3.0
            Reporter: Hyukjin Kwon


It looks we have added many examples in {{ExpressionDescription}} for builtin 
functions.
Actually, if I have seen correctly, we have fixed many examples so far in some 
minor PRs and sometimes require to add the examples as tests sql and golden 
files.

As we have formatted examples in {{ExpressionDescription.examples}} - 
https://github.com/apache/spark/blob/ba327ee54c32b11107793604895bd38559804858/sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/ExpressionDescription.java#L44-L50,
 and we have `SQLQueryTestSuite`, I think we could run the examples as tests 
like Python's doctests.

Rough way I am thinking:

1. Loads the example in {{ExpressionDescription}}.
2. identify queries by {{>}}.
3. identify the rest of them as the results.
4. run the examples by reusing {{SQLQueryTestSuite}} if possible.
5. compare the output by reusing {{SQLQueryTestSuite}} if possible.

Advantages of doing this I could think for now:

- Reduce the number of PRs to fix the examples
- De-duplicate the test cases that should be added into sql and golden files.
- Correct documentation with correct examples.
- Reduce reviewing costs for documentation fix PRs.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to