[ 
https://issues.apache.org/jira/browse/MADLIB-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15172701#comment-15172701
 ] 

Frank McQuillan commented on MADLIB-970:
----------------------------------------

[~gautamsm] Is this a universal rule?  Or should there be an input param with 
"20% of cells < 5" as the default? 

> Log a warning message when running Chi squared independence test  if more 
> than 20% of the cells in the contingency table have expected values < 5.
> --------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MADLIB-970
>                 URL: https://issues.apache.org/jira/browse/MADLIB-970
>             Project: Apache MADlib
>          Issue Type: New Feature
>          Components: Module: Inferential Statistics
>            Reporter: Gautam Muralidhar
>            Priority: Minor
>
> Log a warning message when running Chi squared independence test  if more 
> than 20% of the cells in the contingency table have expected values < 5. It 
> might be acceptable to not proceed with the computation as well if more than 
> 20% of the cells in the contingency table have expected values < 5.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to