The legacy of social psychology

To anyone teaching psychology.

In this post I express some concerns about the prestige given to ‘classic’ studies, which are widely taught in undergraduate social psychology courses around the world. I argue that rather than just demonstrating a bunch of clever but dodgy experiments, we could teach undergraduates to evaluate studies for themselves. To exemplify this, I quickly demonstrate power, Bayes factors, the p-checker app and the GRIM test.

psychology’s foundations are built not of theory but with the rock of classic experiments

– Christian Jarrett

Here is an out-of-context quote from Sanjay Srivastava from a while back:

This got me thinking about why and how we teach classic studies.

Psychologists usually lack the luxury of well-behaving theories. Some have thus proposed that the classic experiments, which have survived in the literature until the present, serve as the bedrock of our knowledge ¹. In the introduction to a book retelling the stories of classic studies in social psychology ², the authors note that classical studies have “played an important role in setting the research agenda for the field as it has progressed over time” and “serve as common points of reference for researchers, teachers and students alike”. The authors continue by pointing out that many of these classics lacked sophistication, but that this in fact is a feature of their enduring appeal, as laypeople can understand the “points” the studies make. Exposing the classics to modern statistical methods, would thus miss their point.

Now, this makes me wonder; if the point of a study is not to assess the existence of a phenomenon, what in the world may it be? One answer would be to serve as historical examples of practices no longer considered scientific, but I doubt this is what’s normally thought. Notwithstanding, I wanted to dip into the “foundations” of our knowledge by demostrating the use of some more-or-less recently developed tools on a widely known article. According to Google Scholar, the Festinger and Carlsmith cognitive dissonance experiment ³ has been cited for over three thousand times, so its influence is hard to downplay.

But first, a necessary digression: statistical power is the probability of detecting a “significant” effect of the postulated size, if the null hypothesis is false. As explained in Brunner & Schimmack ⁴, it is an interesting anomaly that the statistical power of studies in psychology is usually small, but almost all of them end up finding these “significant” results. As to how small, power doubtfully exceeds 50% ^5–7, and for small (conventional?) effect sizes, the mean has been shown to be as low as 24%. As a recent replication project regarding the ego depletion effect ⁸ exemplified, a highly “replicable” (as judged by the published record) phenomenon may turn out to be a fluke, when null findings are taken into account. This has recently made psychologists consider the uncomfortable possibility, that entire research lines consisting of “accumulated scientific evidence” may in fact not contain that much evidence ^9,10.

So, what is the statistical power of Festinger and Carlsmith? Using G*Power ¹¹, it turns out that they had 80% chance to discover a humongous effect of d = 0.9, and only a coin flip’s probability to find a (still large) effect of d = 0.64. Now, if an underpowered study finds an effect, with current practices it is likely to be exaggerated, and/or even of the wrong sign ¹². Here would be a nice opportunity to demonstrate these concepts to students.

Considering the low power, it may not come as a surprise that the evidence the study provided was low to begin with. A Bayes Factor (BF) is an indicator of evidence for one hypothesis, in relation to another. In this case, a BF of ~3 moves an impartial observer from being 50% sure the experiment works to being 75% sure, or a skeptic from being 25% sure to being 43% sure that the effect is small instead of nil.

It would be relatively simple to introduce Bayes Factors with this study. The effect of a prior scale in this case does not matter much for reasonable choices, as exemplified with a plot made in JASP with two clicks:

bf-festinger — Figure 1: Bayes factor robustness check for the main finding of the dissonance study. Plotted by JASP 0.8.0.0, using n=20 for both groups, a t-value of 2.48 and a cauchy prior scale of 0.4.

Nowadays it is possible to easily check, whether a paper correctly reports test statistics and their associated p-values. The p-checker app (this link feeds the relevant statistics to the app) can do this, and it turns out that most of the t-values in the paper are incorrectly rounded down (assuming, that “significant at the 0.08 level” means p < 0.08). You can demonstrate this by including the link on your slides, using it to go to p-checker and choosing “p-values correct?”.

Finally, you can look at the study using the GRIM test ¹³, which evaluates if the reported means are mathematically possible. As it turns out, a quarter of the reported means in the table with the main results do not pass the test. One more time: 25% of the reported means are mathematically impossible. The most likely explanation for this is shoddy reporting of means or accidental misreporting of sample sizes, but I find it telling that—to my knowledge, at least—the issue has not come up in fifty years of scientific investigation.

*Figure 2: Main results table of the Festinger & Carlsmith study. Circled means are mathematically impossible given the reported sample sizes.*

Now, even though I have doubts about this study, as well as the process by which the theory has “evolved” ¹⁴, it does not mean that cognitive dissonance effects do not exist. It is just that the research may not have been able to capture the essence of this everyday phenomenon (which, if it exists, can influence behaviour without the help of academics). Under the traditional paradigm of psychological science, fraught with publication bias and unhelpful incentives ¹⁰, a Registered Replication Report (RRR) -type of work would be needed, and even that could only test one operationalisation. As an undergraduate, I would have been exhilarated to hear early about how and why such initiatives work, and why the curatescience.org approach is much more informative than any singular experiments.

Returning to the notion of the bedrock of psychology, consisting of classic experiments instead of theories as in the natural sciences ¹. Perhaps we need a more solid foundation, regardless of whether some flashy findings from decades ago happened to spur out a progressive-ish ^15,16 line of research.

How would such foundation come to be? Maybe teaching could play a role?

Bibliography

Jarrett, C. Foundations of sand? The Psychologist 21, 756–759 (2008).
Smith, J. R. & Haslam, S. A. Social psychology: Revisiting the classic studies. (SAGE Publications, 2012).
Festinger, L. & Carlsmith, J. M. Cognitive consequences of forced compliance. The Journal of Abnormal and Social Psychology 58, 203–210 (1959).
Brunner, J. & Schimmack, U. How replicable is psychology? A comparison of four methods of estimating replicability on the basis of test statistics in original studies. (2016).
Button, K. S. et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci 14, 365–376 (2013).
Cohen, J. Things I have learned (so far). American psychologist 45, 1304 (1990).
Sedlmeier, P. & Gigerenzer, G. Do studies of statistical power have an effect on the power of studies? Psychological bulletin 105, 309 (1989).
Hagger, M. S. et al. A multi-lab pre-registered replication of the ego-depletion effect. Perspectives on Psychological Science (2016).
Earp, B. D. & Trafimow, D. Replication, falsification, and the crisis of confidence in social psychology. Front. Psychol 6, 621 (2015).
Smaldino, P. E. & McElreath, R. The Natural Selection of Bad Science. arXiv preprint arXiv:1605.09511 (2016).
Faul, F., Erdfelder, E., Lang, A.-G. & Buchner, A. G*Power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods 39, 175–191 (2007).
Gelman, A. & Carlin, J. Beyond Power Calculations Assessing Type S (Sign) and Type M (Magnitude) Errors. Perspectives on Psychological Science 9, 641–651 (2014).
Brown, N. J. L. & Heathers, J. A. J. The GRIM Test: A Simple Technique Detects Numerous Anomalies in the Reporting of Results in Psychology. Social Psychological and Personality Science (2016). doi:10.1177/1948550616673876
Aronson, E. in The science of social influence: Advances and future progress (ed. Pratkanis, A. R.) 17–82 (Psychology Press, 2007).
Lakatos, I. History of science and its rational reconstructions. (Springer, 1971).
Meehl, P. E. Appraising and amending theories: The strategy of Lakatosian defense and two principles that warrant it. Psychological Inquiry 1, 108–141 (1990).

22 thoughts on “The legacy of social psychology”

Welcome / Tervetuloa! – Data punk | Käyttäytymisarkkitehtuuri says:

∞ at 10:04 am

[…] The legacy of social psychology […]

LikeLike

Reply
Nick says:

∞ at 3:49 pm

See also Abelson’s “Statistics as Principled Argument”, pp. 35-36, 98-100, 162-163, and 191-193.

LikeLike

Reply
Scott says:

∞ at 11:50 pm

“The most likely explanation for this is shoddy reporting of means or accidental misreporting of sample sizes…”

You’ve overlooked one other possibility…faulty data input.

Assuming a correctly reported sample size, and deriving the sum of the ratings for the suspect averages, the sums include single decimals that, as whole numbers, are within their respective rating scales. This obviously isn’t possible for whole-number ratings.

It’s possible that data were incorrectly entered as a decimal value instead of a whole number. If so, the effect would be that the reported averages are all lower than actual. I don’t know if that changes the statistical analysis or not.

Given it was 1959, and things weren’t computerized, I find it odd that no one caught a decimal value in a sum of whole numbers…3 times.

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud – newsheadlines4 says:

∞ at 11:08 am

[…] on the psychological phenomenon of “cognitive dissonance” contained numbers that have been mathematically impossible. But that paper stays within the literature, garnering citations, with out a lot as a be aware from […]

LikeLike

Reply
Weirdly, Gamers Are More Scrupulous Than Scientists | Dawson County Journal says:

∞ at 11:21 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists At Catching Fraud says:

∞ at 11:46 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
If you are looking for Gamers Are Better Than Scientists at Catching Fraud then click this link to get all the details about Gamers Are Better Than Scientists at Catching Fraud says:

∞ at 12:10 pm

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud - The Entrepreneur Fund says:

∞ at 12:23 pm

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud - At Ease says:

∞ at 9:08 pm

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud | jo trends says:

∞ at 2:43 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Why Are Gamers So Much Better Than Scientists at Catching Fraud? - News-Rich Publications says:

∞ at 4:43 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud – The Atlantic – World News says:

∞ at 5:42 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud – Profound News says:

∞ at 6:34 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Why Are Gamers So Much Better Than Scientists at Catching Fraud? | Dawson County Journal says:

∞ at 7:52 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud | DISHA TECHNOLOGY says:

∞ at 2:17 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Players Are Greater Than Scientists at Catching Fraud - Genericviagraviagraonlinefhjfr says:

∞ at 8:52 pm

[…] paper on the psychological phenomenon of “cognitive dissonance” contained figures that had been mathematically extremely hard. Nonetheless that paper remains in the literature, garnering citations, with no so much as a […]

LikeLike

Reply
Why Are Gamers So Much Better Than Scientists at Catching Fraud? | TechTipsInsider says:

∞ at 1:50 am

[…] on the psychological phenomenon of “cognitive dissonance” contained numbers that have been mathematically impossible. But that paper stays within the literature, garnering citations, with out a lot as a observe from […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud- Vigour Times says:

∞ at 1:34 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
Gamers Are Better Than Scientists at Catching Fraud- webtoday.us says:

∞ at 8:12 am

[…] paper on the psychological phenomenon of “cognitive dissonance” contained numbers that were mathematically impossible. Yet that paper remains in the literature, garnering citations, without so much as a note from the […]

LikeLike

Reply
One of history’s most famous psychological experiments was probably fake says:

∞ at 5:50 pm

[…] often creative set-ups – turning the basement of a university into a prison to study conformity! Bribing people to say they had fun on a boring task, and finding they enjoyed it more! And so on – but were […]

LikeLike

Reply
Regarded as one of historical previous’s most notorious psychological experiments was potentially false – TOP Show HN says:

∞ at 3:12 am

[…] creative status-ups – turning the basement of a college into a jail to gaze conformity! Bribing other folks to sing they rejoiced on a uninteresting job, and discovering they enjoyed it extra! And so forth […]

LikeLike

Reply
One of history’s most well-known psychological experiments was probably fake – Health Psychology says:

∞ at 8:52 am

[…] established-ups – turning the basement of a university into a jail to examine conformity! Bribing men and women to say they had fun on a uninteresting undertaking, and obtaining they loved it much more! And so […]

LikeLike

Reply