Wow. You do have a rationale for your answer to 4. I completely disagree with it but it's a rational.

At 02:42 PM 7/2/2010, David Pennock wrote:
Oops, I meant 1,000,000 for question 5.

I'm not a statistician. I reasoned this way: If I somehow "explore" lots of hypotheses I need to correct for that, otherwise I may overfit. In 2, I only checked one randomly chosen hypothesis and stopped exploring after that.

In 4 and 5, I've in some way explored all 1,000,000 hypotheses at least implicitly either via my own intuition or via algorithmic pruning, so I need to correct for that.

Correcting for all 1,000,000 is overkill in some cases and few people would actually do it, but to be "safe" I was advocating that you technically should.

I could easily have reasoned incorrectly. :-) Thanks for livening up this list.

thanks,
Dave

Rich Neapolitan wrote:
Dear David,
Thanks for the reply. Question 5 does not have a yes or no answer. I find it quite odd that you say yes to 4 and no to 2. Can you explain? If you were a classic frequentist, you'd say no to both. I would say yes to 2 and no to 4.
By the way, I just submitted another post with my answers.
Best,
Rich

At 07:45 AM 7/2/2010, David Pennock wrote:
I'll bite.

1. yes
2. no
3. yes
4. yes
5. yes

Rich Neapolitan wrote:
Let's have some fun in this group again instead of just posting about conferences, post docs, and new books, etc. I offer you this quiz about the use of the Bonferroni (or any other) correction:
I have 1,000,000 hypotheses that are not mutually exclusive.
1. I test them all. Do I apply the Bonferroni correction?
2. I plan to test them all, but I run out of resources after testing only one of them. Do I apply the Bonferroni correction? 3. I test one of them, and a year later test the others. Do I apply the Bonferroni correction to the one tested early? If so, when? 4. I only test the first one because that is the one I suspect. Do I apply the Bonferroni correction? 5. I run an algorithm that prunes unlikely hypotheses. I end up testing only 100,000. Do I apply the Bonferroni correction for 100,000 or for 1,000,000 hypotheses. I have two colleagues who make their living as statisticians, one is a good Bayesian and the other a good frequentist (oxymoron?). They both have different answers, and neither agrees with me. It would be interesting to learn how some of you view this matter.
Best,
Rich
Rich Neapolitan
Professor and Chair of Computer Science
Northeastern Illinois University
5500 N. St. Louis
Chicago, Il 60625

------------------------------------------------------------------------
_______________________________________________
uai mailing list
uai@ENGR.ORST.EDU
https://secure.engr.oregonstate.edu/mailman/listinfo/uai
Rich Neapolitan
Professor and Chair of Computer Science
Northeastern Illinois University
5500 N. St. Louis
Chicago, Il 60625

Rich Neapolitan
Professor and Chair of Computer Science
Northeastern Illinois University
5500 N. St. Louis
Chicago, Il 60625 
_______________________________________________
uai mailing list
uai@ENGR.ORST.EDU
https://secure.engr.oregonstate.edu/mailman/listinfo/uai

Reply via email to