A Machine Learning Model Can Predict Which Young Cannabis Users Will Develop Use Disorder

Using just five personality and behavioral factors — biological sex, delinquency, conscientiousness, neuroticism, and openness — a Bayesian machine learning model predicted cannabis use disorder within 5 years of first use with moderate accuracy.

Wang, Tingfang et al.·Drug and alcohol review·2025·Strong Evidencelongitudinal
RTHC-07917LongitudinalStrong Evidence2025RETHINKTHC RESEARCH DATABASErethinkthc.com/research

Quick Facts

Study Type
longitudinal
Evidence
Strong Evidence
Sample
Not reported

What This Study Found

The model achieved AUC values of 0.68 (training), 0.64, and 0.75 (two validation datasets) for predicting CUD within 5 years of first cannabis use. The five risk factors were biological sex, delinquency, and personality traits of conscientiousness, neuroticism, and openness. Calibration was excellent (E/O ratios of 0.95–1.0).

Key Numbers

5 risk factors. Training AUC: 0.68. Validation AUCs: 0.64 and 0.75. E/O ratios: 0.95, 0.98, and 1.0 (excellent calibration). Predicts CUD risk within 5 years of first cannabis use. Personalized absolute risk output.

How They Did This

Bayesian machine learning model trained on the National Longitudinal Study of Adolescent to Adult Health. Five-fold cross-validation assessed performance (AUC and E/O ratio). Independent validation on two external datasets. Model provides personalized absolute risk scores for individual patients.

Why This Research Matters

Currently, clinicians have no validated tool to identify which young cannabis users are most likely to develop use disorder. A simple 5-factor model could be integrated into routine clinical screening to enable early, targeted intervention before problems develop.

The Bigger Picture

Moving from population-level risk factors to personalized risk prediction represents a shift toward precision prevention in substance use. A brief personality and behavior assessment could identify the specific young cannabis users who would benefit most from intervention.

What This Study Doesn't Tell Us

Moderate AUC (0.64–0.75) means the model misses some who develop CUD and flags some who don't. Based on an older cohort — cannabis products and patterns have changed. Limited to 5 factors for simplicity, potentially missing important predictors. Requires first cannabis use as entry point.

Questions This Raises

  • ?Would adding cannabis use patterns (frequency, product type) improve prediction?
  • ?How would clinicians use personalized CUD risk scores in practice?
  • ?Would this model perform differently in the current era of legal cannabis and high-potency products?

Trust & Context

Key Stat:
Evidence Grade:
Well-validated machine learning model with independent external validation on two datasets, demonstrating good calibration and moderate discrimination.
Study Age:
Published 2025.
Original Title:
Absolute Risk Prediction for Cannabis Use Disorder in Adolescence and Early Adulthood Using Bayesian Machine Learning.
Published In:
Drug and alcohol review, 44(6), 1680-1690 (2025)
Database ID:
RTHC-07917

Evidence Hierarchy

Meta-Analysis / Systematic Review
Randomized Controlled Trial
Cohort / Case-Control
Cross-Sectional / ObservationalSnapshot without intervening
This study
Case Report / Animal Study
What do these levels mean? →

Frequently Asked Questions

Can you really predict who will become addicted to cannabis?

The model provides a probability estimate, not a certainty. With an AUC of 0.64–0.75, it correctly ranks risk better than chance but isn't perfect. It's designed as a screening aid, not a definitive diagnosis.

What personality traits increase CUD risk?

Higher neuroticism, lower conscientiousness, and higher openness were associated with increased CUD risk. These traits relate to emotional instability, impulsivity, and novelty-seeking — all established risk factors for substance use disorders.

Read More on RethinkTHC

Cite This Study

RTHC-07917·https://rethinkthc.com/research/RTHC-07917

APA

Wang, Tingfang; Boden, Joseph M; Biswas, Swati; Choudhary, Pankaj K. (2025). Absolute Risk Prediction for Cannabis Use Disorder in Adolescence and Early Adulthood Using Bayesian Machine Learning.. Drug and alcohol review, 44(6), 1680-1690. https://doi.org/10.1111/dar.14098

MLA

Wang, Tingfang, et al. "Absolute Risk Prediction for Cannabis Use Disorder in Adolescence and Early Adulthood Using Bayesian Machine Learning.." Drug and alcohol review, 2025. https://doi.org/10.1111/dar.14098

RethinkTHC

RethinkTHC Research Database. "Absolute Risk Prediction for Cannabis Use Disorder in Adoles..." RTHC-07917. Retrieved from https://rethinkthc.com/research/wang-2025-absolute-risk-prediction-for

Access the Original Study

Study data sourced from PubMed, a service of the U.S. National Library of Medicine, National Institutes of Health.

This study breakdown was produced by the RethinkTHC research team. We analyze and report published research findings without making health recommendations. All interpretations are based solely on the published abstract and study data.