Anglia Ruskin Research Online (ARRO)
PIIS0300957223008134.pdf (270.56 kB)

Testing ChatGPT ability to answer laypeople questions about cardiac arrest and cardiopulmonary resuscitation

Download (270.56 kB)
journal contribution
posted on 2024-06-06, 11:17 authored by Tommaso Scquizzato, Federico Semeraro, Paul Swindell, Rupert Simpson, Matteo Angelini, Arianna Gazzato, Uzma Sajjad, Elena G Bignami, Giovanni Landoni, Thomas R Keeble, Marco Mion

Introduction: Cardiac arrest leaves witnesses, survivors, and their relatives with a multitude of questions. When a young or a public figure is affected, interest around cardiac arrest and cardiopulmonary resuscitation (CPR) increases. ChatGPT allows everyone to obtain human-like responses on any topic. Due to the risks of accessing incorrect information, we assessed ChatGPT accuracy in answering laypeople questions about cardiac arrest and CPR. Methods: We co-produced a list of 40 questions with members of Sudden Cardiac Arrest UK covering all aspects of cardiac arrest and CPR. Answers provided by ChatGPT to each question were evaluated by professionals for their accuracy, by professionals and laypeople for their relevance, clarity, comprehensiveness, and overall value on a scale from 1 (poor) to 5 (excellent), and for readability. Results: ChatGPT answers received an overall positive evaluation (4.3 ± 0.7) by 14 professionals and 16 laypeople. Also, clarity (4.4 ± 0.6), relevance (4.3 ± 0.6), accuracy (4.0 ± 0.6), and comprehensiveness (4.2 ± 0.7) of answers was rated high. Professionals, however, rated overall value (4.0 ± 0.5 vs 4.6 ± 0.7; p = 0.02) and comprehensiveness (3.9 ± 0.6 vs 4.5 ± 0.7; p = 0.02) lower compared to laypeople. CPR-related answers consistently received a lower score across all parameters by professionals and laypeople. Readability was ‘difficult’ (median Flesch reading ease score of 34 [IQR 26–42]). Conclusions: ChatGPT provided largely accurate, relevant, and comprehensive answers to questions about cardiac arrest commonly asked by survivors, their relatives, and lay rescuers, except CPR-related answers that received the lowest scores. Large language model will play a significant role in the future and healthcare-related content generated should be monitored.



  • Yes



Page range


Publication title





Elsevier BV



File version

  • Published version


  • eng

Item sub-type

Journal Article

Media of output


Affiliated with

  • School of Medicine Outputs