A Machine Learning Model Can Predict Which Young Cannabis Users Will Develop Use Disorder
Using just five personality and behavioral factors — biological sex, delinquency, conscientiousness, neuroticism, and openness — a Bayesian machine learning model predicted cannabis use disorder within 5 years of first use with moderate accuracy.
Quick Facts
What This Study Found
The model achieved AUC values of 0.68 (training), 0.64, and 0.75 (two validation datasets) for predicting CUD within 5 years of first cannabis use. The five risk factors were biological sex, delinquency, and personality traits of conscientiousness, neuroticism, and openness. Calibration was excellent (E/O ratios of 0.95–1.0).
Key Numbers
5 risk factors. Training AUC: 0.68. Validation AUCs: 0.64 and 0.75. E/O ratios: 0.95, 0.98, and 1.0 (excellent calibration). Predicts CUD risk within 5 years of first cannabis use. Personalized absolute risk output.
How They Did This
Bayesian machine learning model trained on the National Longitudinal Study of Adolescent to Adult Health. Five-fold cross-validation assessed performance (AUC and E/O ratio). Independent validation on two external datasets. Model provides personalized absolute risk scores for individual patients.
Why This Research Matters
Currently, clinicians have no validated tool to identify which young cannabis users are most likely to develop use disorder. A simple 5-factor model could be integrated into routine clinical screening to enable early, targeted intervention before problems develop.
The Bigger Picture
Moving from population-level risk factors to personalized risk prediction represents a shift toward precision prevention in substance use. A brief personality and behavior assessment could identify the specific young cannabis users who would benefit most from intervention.
What This Study Doesn't Tell Us
Moderate AUC (0.64–0.75) means the model misses some who develop CUD and flags some who don't. Based on an older cohort — cannabis products and patterns have changed. Limited to 5 factors for simplicity, potentially missing important predictors. Requires first cannabis use as entry point.
Questions This Raises
- ?Would adding cannabis use patterns (frequency, product type) improve prediction?
- ?How would clinicians use personalized CUD risk scores in practice?
- ?Would this model perform differently in the current era of legal cannabis and high-potency products?
Trust & Context
- Key Stat:
- Evidence Grade:
- Well-validated machine learning model with independent external validation on two datasets, demonstrating good calibration and moderate discrimination.
- Study Age:
- Published 2025.
- Original Title:
- Absolute Risk Prediction for Cannabis Use Disorder in Adolescence and Early Adulthood Using Bayesian Machine Learning.
- Published In:
- Drug and alcohol review, 44(6), 1680-1690 (2025)
- Authors:
- Wang, Tingfang, Boden, Joseph M(4), Biswas, Swati, Choudhary, Pankaj K
- Database ID:
- RTHC-07917
Evidence Hierarchy
Frequently Asked Questions
Can you really predict who will become addicted to cannabis?
The model provides a probability estimate, not a certainty. With an AUC of 0.64–0.75, it correctly ranks risk better than chance but isn't perfect. It's designed as a screening aid, not a definitive diagnosis.
What personality traits increase CUD risk?
Higher neuroticism, lower conscientiousness, and higher openness were associated with increased CUD risk. These traits relate to emotional instability, impulsivity, and novelty-seeking — all established risk factors for substance use disorders.
Read More on RethinkTHC
- cannabis-dependence-physical-psychological-addiction-science
- cannabis-perception-vs-evidence-gap
- cannabis-use-disorder-test
- cross-addiction-quit-weed-start-drinking
- is-weed-addictive
- is-weed-addictive-science
- quitting-weed-and-alcohol
- rehab-for-weed-addiction-necessary
- signs-of-cannabis-use-disorder
- weed-vape-pen-addiction
Cite This Study
https://rethinkthc.com/research/RTHC-07917APA
Wang, Tingfang; Boden, Joseph M; Biswas, Swati; Choudhary, Pankaj K. (2025). Absolute Risk Prediction for Cannabis Use Disorder in Adolescence and Early Adulthood Using Bayesian Machine Learning.. Drug and alcohol review, 44(6), 1680-1690. https://doi.org/10.1111/dar.14098
MLA
Wang, Tingfang, et al. "Absolute Risk Prediction for Cannabis Use Disorder in Adolescence and Early Adulthood Using Bayesian Machine Learning.." Drug and alcohol review, 2025. https://doi.org/10.1111/dar.14098
RethinkTHC
RethinkTHC Research Database. "Absolute Risk Prediction for Cannabis Use Disorder in Adoles..." RTHC-07917. Retrieved from https://rethinkthc.com/research/wang-2025-absolute-risk-prediction-for
Access the Original Study
Study data sourced from PubMed, a service of the U.S. National Library of Medicine, National Institutes of Health.
This study breakdown was produced by the RethinkTHC research team. We analyze and report published research findings without making health recommendations. All interpretations are based solely on the published abstract and study data.