Abstract 2162P
Background
There is limited data on the quality of cancer information provided by ChatGPT and other artificial intelligence systems. In this study, we aimed to compare the accuracy of information about cancer pain provided by chatbots (chatGPT, perplexity, and chatsonic) based on the questions and answers contained in the the European Medical Oncology Association (ESMO) Patient Guide about cancer pain.
Methods
Twenty questions were selected from the questions available in the ESMO Patient Guide Cancer Pain. Medical oncologists with more than 10 years of experience compared responses from chatbots (ChatGPT, Perplexity, and Chatsonic) with the ESMO patient guide. The primary evaluation criteria for the quality of the responses were accuracy, patient readability, and stability of response. The accuracy of responses was evaluated using a three-point scale: 1 for accuracy, 2 for a mixture of accurate and incorrect or outdated data, and 3 for wholly inaccurate. The Flesch-Kincaid readability (FKr) grade was used to measure readability. Stability of responses was evaluated whether the model’s accuracy is consistent across different answers to the same question.
Results
Chatbots were more difficult to read than the ESMO patient guideline (FKr= 9.6 vs. 12.8, p= 0.072). Among the chatbots, perplexity had the easiest readability (FKr= 11.2). In the accuracy evaluation, the percentage of overall agreement for accuracy was 100% for ESMO answers and 96% for ChatGPT outputs for questions (k= 0.03, standard error= 0.08). Among the chatbots, the most accurate information was obtained with chatGPT.
Table: 2162P
Comparison of ESMO and chatbots in terms of readibility and accuracy
ESMO | Chatbots | ||||
ChatGPT | Perplexity | Chatsonic | p-value | ||
Readibility (FKr grade) | 9.6 Easily understood | 13.4 Difficult to read | 11.2 Fairly difficult to read | 13.9 Difficult to read | 0.072 |
Accuracy | %100 | %96 | %86 | % 90 | 0.037 |
Conclusions
The results suggest that ChatGPT provides more accurate information about cancer pain compared with other chatbots.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
Gazi University Ethic Commitee.
Funding
Has not received any funding.
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
2130P - ATTITUDE - ATTrition In longiTUDinal studiEs of cancer survivors (CS): Can we improve the experience of patients (pts)?
Presenter: Camila Chiodi
Session: Poster session 07
2131P - Real-life experiences from a late effects clinic: An investigation of health-related quality of life in Danish cancer survivors
Presenter: Lærke Tolstrup
Session: Poster session 07
2132P - Unmet needs, quality of life, and financial toxicity in survivors of lung cancer
Presenter: Josephine Feliciano
Session: Poster session 07
2133P - Sleep disorders: Evolution in time in early breast cancer (EBC)
Presenter: Blanca Cantos
Session: Poster session 07
2134P - Self-reported late effects, need for information and follow-up in long-term Hodgkin lymphoma survivors
Presenter: Lise Willumsen
Session: Poster session 07
2135P - Impact of geriatric assessment (GA) and geriatric 8(G8)-based targeted interventions on the quality of life (QoL) in older Asian adults with cancer
Presenter: Jia Li Low
Session: Poster session 07
2136P - Prostate cancer supportive care (PCSC) program for patients and partners: A model for meeting an unmet need for PC patients
Presenter: Celestia Higano
Session: Poster session 07
2137P - Impact of supportive care on the quality of life (QoL) of hospitalized cancer patients (pts)
Presenter: Judit Sanz Beltran
Session: Poster session 07
2138P - Time toxicity of palliative chemotherapy in a geriatric oncology population
Presenter: Christopher Cronin
Session: Poster session 07
2139P - Quality of life in adult cancer survivors (QLACS) in Spain: Study of its clinical characteristics and use of social media for oncological information
Presenter: Maria Cornide Santos
Session: Poster session 07