Anybody any ideas?
Any help would be appreciated!
Cheers,
Mika
--
View this message in context:
http://www.nabble.com/Odd-results-with-Chi-square-test.-%28Not-an-R-problem%2C-but-general-statistics%2C-I-think.%29-tp25026167p25041900.html
Sent from the R help mailing list archive at
Hi,
I am working on a system which automatically answers user questions (such
systems are commonly called Question Answering systems). I evaluated
different versions of the same system on a publicly available test sets.
Naturally, there is a fixed number of questions in the test set, and the
I'm far from an expert on stats but what I think you are saying is if you try
and compare Baseline with Version 3 you don't think your p-value is as good as
version 1 and 2. I'm not 100% sure you are meant to do that with p-values but
I'll let someone else comment on that!.
3 matches
Mail list logo