Good discussion of problems with the notion of “statistical significance” as deployed by social scientists?

12/1/2016

Brian Leiter

I'm especially interested in discussions in the context of political science, but more general critical discussions (by philosophers or social scientists) are welcome. Links to discussions that are available on-line would also be useful. Thanks!

Uncategorized

19 responses to “Good discussion of problems with the notion of “statistical significance” as deployed by social scientists?”

Philippe Lemoine

December 1, 2016 at 5:34 pm

Raymond Nickerson, "Null Hypothesis Significance Testing: A Review of an Old and Continuing Controversy", Psychological Methods, 2000, Vol. 5, No. 2, pp. 241-301 is a very good overview of the debate about that issue.

Loading…

Reply
David Wallace

December 1, 2016 at 6:23 pm

https://xkcd.com/882/ is scarcely sophisticated philosophical commentary, but gets the basic point across pretty effectively.

Loading…

Reply
Matthew Rellihan

December 1, 2016 at 6:49 pm

There's a very accessible discussion of the issue in chapter nine of Jordan Ellenberg's book _How Not to Be Wrong_.

Loading…

Reply
Nomen nescio

December 1, 2016 at 6:57 pm

There's a nice piece on Aeon: 'It's time for science to abandon the term 'statistically significant' (https://aeon.co/essays/it-s-time-for-science-to-abandon-the-term-statistically-significant).

Loading…

Reply
Josh

December 1, 2016 at 6:58 pm

http://www.deirdremccloskey.com/docs/jsm.pdf

Loading…

Reply
Chris Tucker

December 1, 2016 at 8:10 pm

Colquhoun, the author of the Aeon piece, has a discussion of p values that I find more helpful than the Aeon piece: http://rsos.royalsocietypublishing.org/content/1/3/140216

Loading…

Reply
Barry Lam

December 1, 2016 at 8:19 pm

Pretty much all of Andrew Gelman's blog. A good recent synopsis of problems and solutions here: http://onlinelibrary.wiley.com/doi/10.1111/brv.12315/full

Loading…

Reply
Keith Whittington

December 1, 2016 at 8:25 pm

Not sure exactly what you have in mind, but perhaps some of this is useful

http://www.polmeth.wustl.edu/files/polmeth/gill99.pdf

https://www.researchgate.net/profile/Michael_Ward12/publication/227574659_The_perils_of_policy_by_p-value_Predicting_civil_conflicts/links/0046352d05541bf4c2000000.pdf

http://jonathanrenshon.com/Teaching/NPS/ResearchDesign/Using%20Graphs%20Instead%20of%20Tables.pdf

http://www.gwern.net/docs/statistics/2008-gerber.pdf

http://psychology.okstate.edu/faculty/jgrice/psyc5314/SignificanceSorceryLambdin2012.pdf

http://www.americanscientist.org/libraries/documents/20121206131259bnjebtf7fd/2012126135599486-2013-01MacroHiggs.pdf

Loading…

Reply
Gary

December 2, 2016 at 12:06 am

http://psychology.okstate.edu/faculty/jgrice/psyc5314/Freedman_1991A.pdf

Loading…

Reply
Duncan

December 2, 2016 at 12:18 am

Great piece. Colquhoun goes into greater detail in this video:

Loading…

Reply
Sean Matthews

December 2, 2016 at 1:18 am

Obvious goto source here is Andrew Gelman's blog, andrewgelman.com, which is a continuing extremely sophisticated discussion of precisely these issues.
Gelman is professor of statistics and political science at Columbia.

Loading…

Reply
David Duffy

December 2, 2016 at 3:29 am

Several accessibly written papers by Andrew Gelman address these questions

http://www.stat.columbia.edu/~gelman/research/published/

eg
http://www.stat.columbia.edu/~gelman/research/published/ForkingPaths.pdf
http://www.stat.columbia.edu/~gelman/research/published/pvalues3.pdf
http://www.stat.columbia.edu/~gelman/research/published/retropower_final.pdf

and see his blog.

Loading…

Reply
Michael B

December 2, 2016 at 5:44 am

There is an article on Stanford about the philosophy of statistics. Under the section dealing with problems of classical statistics there are issues related to statistical significance, centered on p-values, but it's not specific to social sciences.

Minor point for anyone who goes there: the author talks several times about p-values (and thus statistical significance) as bearing on whether or not one accepts or rejects the null hypothesis. E.g. "After all, the test leads to the advice to either reject the hypothesis or accept it, and this seems conceptually very close to giving a verdict of truth or falsity."

In statistics we NEVER accept a null hypothesis. We either reject it, or fail to reject it. And the decision is never given over to just a p-value, if at all possible – as if something magical happens at (say) p = 0.05 that doesn't happen at 0.050 000 001.

In fact, to the best of my recollection, in none of my statistics classes were we ever confronted with phrases like, 'So the results are statistically significant.' There are only ever pieces of evidence, of which p-values are but one, and people disagree about how strong they are, which is entirely context-dependent. (Would you get on a plane that, assuming everything was ship shape, still had a 5% chance of crashing?)

Loading…

Reply
Brian Leiter

December 2, 2016 at 8:00 am

Many thanks for these excellent pointers.

Loading…

Reply
Stuart Buck

December 2, 2016 at 9:50 am

The American Statistical Association released a statement earlier this year:
http://www.nature.com/news/statisticians-issue-warning-over-misuse-of-p-values-1.19503
http://amstat.tandfonline.com/doi/pdf/10.1080/00031305.2016.1154108

Other useful sources:

"A Dirty Dozen: Twelve P-Value Misconceptions" http://www.perfendo.org/docs/BayesProbability/twelvePvaluemisconceptions.pdf

"Why P Values Are Not a Useful Measure of Evidence in Statistical Significance Testing" http://tap.sagepub.com/content/18/1/69.abstract

Loading…

Reply
Philippe Lemoine

December 2, 2016 at 4:23 pm

Another thing I noticed is that, when people criticize the obsession of researchers with p-value (usually for good reasons), they often say that researchers should use confidence intervals. But, as this excellent paper shows, this can also lead to bad consequences: http://link.springer.com/article/10.3758/s13423-015-0947-8.

Loading…

Reply
Matías Vernengo

December 2, 2016 at 6:25 pm

Deirdre McCloskey is worth readinghttp://www.deirdremccloskey.com/docs/jsm.pdf

Loading…

Reply
Howard Isaacs

December 3, 2016 at 9:22 am

Statistician William Briggs has got a long-running discussion of this issue, including a recent response to the ASA statement.
http://wmbriggs.com/?s=statistical+significance

Loading…

Reply
Aaron Wright

December 4, 2016 at 1:34 am

Mostly on the bio/med sciences, but John Ioannidis's (Stanford Med) "Why Most Published Research Findings Are False" is a bracing commentary on these issues. (And open source.) Though, insofar as social sciences such as economics and political science are based on computer simulations of theoretical idealizations, p-values may not apply at all.

http://journals.plos.org/plosmedicine/article?id=10.1371/journal.pmed.0020124

From Ioannidis's abstract: Simulations show that for most study designs and settings, it is more likely for a research claim to be false than true. Moreover, for many current scientific fields, claimed research findings may often be simply accurate measures of the prevailing bias. In this essay, I discuss the implications of these problems for the conduct and interpretation of research.

—–
KEYWORDS:
Primary Blog

Loading…

Reply

My former colleagues at another university in Middle East have also been moved to online teaching indefinitely, with the students…

If much of the interest of high-quality papers lies between the lines—in the metaphorical fire that a paper lights in…

I would also recommend that potential grad students make inquiries into how far the compensation package actually goes towards cost…

It’s a mix. I’m still in the UAE with my family, and we feel safe. But some students and faculty…

In the above comment, Michel wrote: “As an aside, every once in a while I check out how the chatbots…

I could imagine LLMs having saved me a *ton* of time in graduate school–e.g., by having supplied reasonable answers to…

The McMaster Department of Philosophy has now put together the following notice commemorating Barry: Barry Allen: A Philosophical Life Barry…

Leiter Reports: A Philosophy Blog