Re: [R] Waaaayy off topic...Statistical methods, pub bias, scientific validity

Claudia Beleites Fri, 07 Jan 2011 10:46:46 -0800

On 01/07/2011 06:13 AM, Spencer Graves wrote:

A more insidious problem, that may not affect the work of JonahLehrer, is political corruption in the way research is funded, withless public and more private funding of research

Maybe I'm too pessimistic, but the term _political_ corruption remindsme that I can just as easily imagine a "funding bias"* in publicfunding. And I'm not sure it is (or would be) less of a problem justbecause the interests of private funding are easier to spot.

* I think of bias on both sides: the funding agency selecting thestudies to support and the researcher subconsciously complying to theexpectations of the funding agency.


On 01/07/2011 08:06 AM, Peter Langfelder wrote:

> From a purely statistical and maybe somewhat naive point of view,
published p-values should be corrected for the multiple testing that
is effectively happening because of the large number of published
studies. My experience is also that people will often try several
statistical methods to get the most significant p-value but neglect to
share that fact with the audience and/or at least attempt to correct
the p-values for the selection bias.

Even if the number of all the tests were known, I have the impressionthat the corrected p-value would be kind of the right answer to thewrong question. I'm not particularly interested in the probability ofarriving at the presented findings if the null hypothesis were true.I'd rather know the probability that the conclusions are true. Switchingto the language of clinical chemistry, this is: I'm presented with thesensitivity of a test, but I really want to know the positive predictivevalue. What is still missing with the corrected p-values is the"prevalence of good ideas" of the publishing scientist (not even knownfor all scientists). And I'm not sure this is not decreasing if thescientist generates and tests more and more ideas.I found my rather hazy thoughts about this much better expressed in thebooks of Beck-Bornholdt and Dubben (which I'm afraid are only availablein German).

Conclusion: try to be/become a good scientist: with a high prevalence ofgood ideas. At least with a high prevalence of good ideas among thetested hypotheses. Including thinking first which hypotheses are theones to test, and not giving in to the temptation to try out more andmore things as one gets more familiar with the experiment/data set/problem.The latter I find very difficult. Including the experience of giving apresentation where I explicitly talked about why I did not do anydata-driven optimization of my models. Yet in the discussion I was veryprominently told I need to try in addition these other pre-processingtechniques and these other modeling techniques - even by people whom Iknow to be very much aware and concerned about optimistically biasedvalidation results. Which were of course very valid questions (and easyto comply), but I conclude it is common/natural/human to have and wantto try out more ideas.Also, after several years in the field and with the same kind of samplesof course I run the risk of my ideas being overfit to our kind ofsamples - this is a cost that I have to pay for the gain due toexperience/expertise.


Some more thoughts:

- reproducibility: I'm analytical chemist. We have huge amounts of workgoing into round robin trials in order to measure the "natural"variability of different labs on very defined systems.- we also have huge amounts of work going into calibration transfer,i.e. making quantitative predictive models work on a differentinstrument. This is always a whole lot of work, and for some fields ofproblems at the moment considered basically impossible even between twoinstruments of the same model and manufacturer.

The quoted results on the mice are not very astonishing to me... ;-)

- Talking about (not so) astonishing differences between betweenreplications of experiments:I find myself moving from reporting ± 1 standard deviation to reportinge.g. the 5th to 95th percentiles. Not only because my data distributionsare often not symmetric, but also because I find Im not able to directlyperceive the real spread of the data from a standard deviation errorbar. This is all about perception, of course I can reflect about themeaning. Such a reflection also tells me that one student having areally unlikely number of right guesses is unlikely but not impossible.There is no statistical law stating that unlikely events happen onlywith large sample sizes/number of tests. Yet the immediate perception iscompletely different.

- I happily agree with the ideas of publishing findings (conclusions) aswell as the data and data analysis code I used to arrive there. But I'maware that part of this agreement is due to the fact that I'm quiteinterested in the data analytical methods (I'd say as well as in theparticular chemical-analytical problem at hand, but rather more than mypurely experimental colleagues). This means that psychologically I'mhappy enough with my work if I can introduce a new method (variant) evenif the results for the chemical-analytical problem aren't that goodneither for the standard method nor for the variant.I remember a discussion between someone from a data-analysis/methoddeveloping group with an experimental scientist. The "methods guy" wascomplaining that the experimental people are so reluctant about makingtheir data public. However, the data sets were for him a prerequisite totest his data analysis ideas. For the experimental guy they were the_product_ of long work, which he felt not properly appreciated by themethods people.

On a more "practical" level, publishing code and data implies additionaldocumentation work. I have a large part of the documentation and metainformation hand-written in lab books (particularly of the experimentaldata) and/or in German language. In order to take the effort to convertthis to electronical formats and English language, the effort must beappropriately rewarded.

Ravi, I also agree with your point that we should make better use of ourexperimental data. However, this is in practice very difficult in myfield (vibrational spectroscopy for medical diagnosis). We have thementioned problems of calibration transfer, neither do we yet havestandardized sample treatment protocols. Thus, it is difficult tocombine different series of measurements. As long as I'm not talkingabout measurements of reference substances, the chances that someoneelse will be able to make much use of my data are therefore rather low.

For the moment these points taken together mean that I'd happily sharemy data (and code) if I'm asked, but not without. And, considering thecurrent publication rules in my field, I first want to play a bit morewith my data myself, before I leave it to everyone else.


My 2 ct,

Claudia

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Waaaayy off topic...Statistical methods, pub bias, scientific validity

Reply via email to