FreshRSS

🔒
☐ ☆ ✇ BMJ Open

Predictive machine-learning model for screening iron deficiency without anaemia: a retrospective cohort study

Por: Efros · O. · Soffer · S. · Mudrik · A. · Robinson · R. · Kenet · G. · Nadkarni · G. N. · Klang · E. — Agosto 13th 2025 at 05:11
Objectives

This study aimed to develop and validate a machine-learning (ML) model to predict iron deficiency without anaemia (IDWA) using routinely collected electronic health record (EHR) data. The primary hypothesis was that an ML model could achieve better accuracy in identifying low ferritin levels (

Design

A retrospective cohort study.

Setting

Data were derived from secondary and tertiary care facilities within the eight-hospital Mount Sinai Health System, an urban academic health system.

Participants

The study included 211 486 adult patients (aged ≥18 years) with normal haemoglobin levels (≥130 g/L for men and ≥120 g/L for women) and recorded ferritin measurements.

Primary and secondary outcome measures

The primary outcome was the prediction of low ferritin levels (

Data from 211 486 Mount Sinai Health System patients with normal haemoglobin levels and ferritin testing were analysed. The model used demographic data, blood count indices and chemistry results to identify low ferritin levels (

Results

Of the 211 486 patients analysed, 19.56% (n=41 368) of the patients had low ferritin levels. In the low ferritin group, the mean age was 41.28 years with 89.64% females. In contrast, the normal ferritin group had a mean age of 50.14 years with 62.02% females. The model achieved an area under the curve (AUC) of 0.814. At a sensitivity threshold of 70%, the model had a specificity of 75.85%, with a positive predictive value of 37.6% and a negative predictive value of 92.41%. The model outperformed an alternative model based only on complete blood count indices (AUC 0.814 vs 0.741). Subgroup analysis showed that model accuracy varied by sex and age, with lower performance in premenopausal women (AUC 0.736) compared with postmenopausal women (AUC 0.793) and men (AUC of 0.832 in those under 60 years and 0.806 in those aged 60 and above).

Conclusions

The ML model provides an effective approach to screening for IDWA using readily available EHR data. Implementing this tool in clinical settings may facilitate early diagnosis of IDWA.

☐ ☆ ✇ Journal of Clinical Nursing

Evaluating Large Language Model‐Assisted Emergency Triage: A Comparison of Acuity Assessments by GPT‐4 and Medical Experts

Por: Gal Ben Haim · Mor Saban · Yiftach Barash · David Cirulnik · Amit Shaham · Ben Zion Eisenman · Livnat Burshtein · Orly Mymon · Eyal Klang — Noviembre 29th 2024 at 06:30

ABSTRACT

Aim

To evaluate the accuracy of the Emergency Severity Index (ESI) assignments by GPT-4, a large language model (LLM), compared to senior emergency department (ED) nurses and physicians.

Method

An observational study of 100 consecutive adult ED patients was conducted. ESI scores assigned by GPT-4, triage nurses, and by a senior clinician. Both model and human experts were provided the same patient data.

Results

GPT-4 assigned a lower median ESI score (2.0) compared to human evaluators (median 3.0; p < 0.001), suggesting a potential overestimation of patient severity by the LLM. The results showed differences in the triage assessment approaches between GPT-4 and the human evaluators, including variations in how patient age and vital signs were considered in the ESI assignments.

Conclusion

While GPT-4 offers a novel methodology for patient triage, its propensity to overestimate patient severity highlights the necessity for further development and calibration of LLM tools in clinical environments. The findings underscore the potential and limitations of LLM in clinical decision-making, advocating for cautious integration of LLMs in healthcare settings.

Reporting Method

This study adhered to relevant EQUATOR guidelines for reporting observational studies.

❌