[
https://issues.apache.org/jira/browse/MADLIB-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15172701#comment-15172701
]
Frank McQuillan commented on MADLIB-970:
----------------------------------------
[~gautamsm] Is this a universal rule? Or should there be an input param with
"20% of cells < 5" as the default?
> Log a warning message when running Chi squared independence test if more
> than 20% of the cells in the contingency table have expected values < 5.
> --------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: MADLIB-970
> URL: https://issues.apache.org/jira/browse/MADLIB-970
> Project: Apache MADlib
> Issue Type: New Feature
> Components: Module: Inferential Statistics
> Reporter: Gautam Muralidhar
> Priority: Minor
>
> Log a warning message when running Chi squared independence test if more
> than 20% of the cells in the contingency table have expected values < 5. It
> might be acceptable to not proceed with the computation as well if more than
> 20% of the cells in the contingency table have expected values < 5.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)