[ 
https://issues.apache.org/jira/browse/SPARK-36504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Haejoon Lee updated SPARK-36504:
--------------------------------
    Description: 
There are many codes in pandas-on-Spark are not being tested, for example:
 * (Series|DataFrame).to_clipboard

             !Screen Shot 2021-08-13 at 9.56.48 AM.png|width=548,height=125!    
        
 * `value` and `method` argument for Series.fillna

             !Screen Shot 2021-08-13 at 9.59.23 AM.png|width=551,height=34!

 

The red line above screen capture means that "this line is not being tested".

Now the test coverage of pandas-on-Spark is 89.93% for total, 93.43% for 
frame.py (which is including DataFrame API), 89.04% for indexing.py (which is 
including Index API) and 93.43% for series.py (which is including Series API).

Not necessarily cover the 100% of codes, since some test such as 
`DataFrame.to_delta` is untestable for now, but we should cover the codes as 
much as possible for healthy of project.

You can find more missing tests and percentage of coverage in [code cov 
report|[https://app.codecov.io/gh/apache/spark]|https://app.codecov.io/gh/apache/spark].

 

 

  was:
There are many codes in pandas-on-Spark are not being tested, for example:
 * (Series|DataFrame).to_clipboard

             !Screen Shot 2021-08-13 at 9.56.48 AM.png|width=548,height=125!    
        
 * `value` and `method` argument for Series.fillna

             !Screen Shot 2021-08-13 at 9.59.23 AM.png|width=551,height=34!

 

The red line above screen capture means that "this line is not being tested".

Now the test coverage of pandas-on-Spark is 89.93% for total, 93.43% for 
frame.py (which is including DataFrame API), 89.04% for indexing.py (which is 
including Index API) and 93.43% for series.py (which is including Series API).

Not necessarily cover the 100% of codes, since some test such as 
`DataFrame.to_delta` is untestable for now, but we should cover the codes as 
much as possible for healthy of project.

You can find more missing tests and percentage of coverage in [code cov 
report|[https://app.codecov.io/gh/apache/spark]|https://app.codecov.io/gh/apache/spark].].

 

 


> Improve test coverage for pandas API on Spark
> ---------------------------------------------
>
>                 Key: SPARK-36504
>                 URL: https://issues.apache.org/jira/browse/SPARK-36504
>             Project: Spark
>          Issue Type: Umbrella
>          Components: PySpark
>    Affects Versions: 3.3.0
>            Reporter: Haejoon Lee
>            Priority: Major
>         Attachments: Screen Shot 2021-08-13 at 9.56.48 AM.png, Screen Shot 
> 2021-08-13 at 9.59.23 AM.png
>
>
> There are many codes in pandas-on-Spark are not being tested, for example:
>  * (Series|DataFrame).to_clipboard
>              !Screen Shot 2021-08-13 at 9.56.48 AM.png|width=548,height=125!  
>           
>  * `value` and `method` argument for Series.fillna
>              !Screen Shot 2021-08-13 at 9.59.23 AM.png|width=551,height=34!
>  
> The red line above screen capture means that "this line is not being tested".
> Now the test coverage of pandas-on-Spark is 89.93% for total, 93.43% for 
> frame.py (which is including DataFrame API), 89.04% for indexing.py (which is 
> including Index API) and 93.43% for series.py (which is including Series API).
> Not necessarily cover the 100% of codes, since some test such as 
> `DataFrame.to_delta` is untestable for now, but we should cover the codes as 
> much as possible for healthy of project.
> You can find more missing tests and percentage of coverage in [code cov 
> report|[https://app.codecov.io/gh/apache/spark]|https://app.codecov.io/gh/apache/spark].
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to