» Articles » PMID: 8851245

Effect Sizes and P Values: What Should Be Reported and What Should Be Replicated?

Overview
Specialty Psychiatry
Date 1996 Mar 1
PMID 8851245
Citations 67
Authors
Affiliations
Soon will be listed here.
Abstract

Despite publication of many well-argued critiques of null hypothesis testing (NHT), behavioral science researchers continue to rely heavily on this set of practices. Although we agree with most critics' catalogs of NHT's flaws, this article also takes the unusual stance of identifying virtues that may explain why NHT continues to be so extensively used. These virtues include providing results in the form of a dichotomous (yes/no) hypothesis evaluation and providing an index (p value) that has a justifiable mapping onto confidence in repeatability of a null hypothesis rejection. The most-criticized flaws of NHT can be avoided when the importance of a hypothesis, rather than the p value of its test, is used to determine that a finding is worthy of report, and when p approximately equal to .05 is treated as insufficient basis for confidence in the replicability of an isolated non-null finding. Together with many recent critics of NHT, we also urge reporting of important hypothesis tests in enough descriptive detail to permit secondary uses such as meta-analysis.

Citing Articles

Cortico-striatal action control inherent of opponent cognitive-motivational styles.

Avila C, Sarter M Elife. 2025; 13.

PMID: 39968969 PMC: 11839163. DOI: 10.7554/eLife.100988.


Polarization and reflectance are linked to climate, size and mechanistic constraints in a group of scarab beetles.

Ospina-Rozo L, Medina I, Hugall A, Rankin K, Roberts N, Roberts A Sci Rep. 2024; 14(1):29349.

PMID: 39592655 PMC: 11599573. DOI: 10.1038/s41598-024-80325-1.


Reinterpretation of the results of randomized clinical trials.

Habibzadeh F PLoS One. 2024; 19(6):e0305575.

PMID: 38875254 PMC: 11178203. DOI: 10.1371/journal.pone.0305575.


Cortico-striatal action control inherent of opponent cognitive-motivational styles.

Avila C, Sarter M bioRxiv. 2024; .

PMID: 38559086 PMC: 10979997. DOI: 10.1101/2024.03.12.584623.


On the use of receiver operating characteristic curve analysis to determine the most appropriate p value significance threshold.

Habibzadeh F J Transl Med. 2024; 22(1):16.

PMID: 38178182 PMC: 10765856. DOI: 10.1186/s12967-023-04827-8.