Oops, you're using an old version of your browser so some of the features on this page may not be displaying properly.

MINIMAL Requirements: Google Chrome 24+Mozilla Firefox 20+Internet Explorer 11Opera 15–18Apple Safari 7SeaMonkey 2.15-2.23

Cocktail and Poster Display session

3P - Evaluating conversational generative pre-trained transformer (ChatGPT) as a tool in early breast cancer (eBC) cases

Date

26 Feb 2024

Session

Cocktail and Poster Display session

Topics

Translational Research

Tumour Site

Breast Cancer

Presenters

Francesca Patanè

Citation

Annals of Oncology (2024) 9 (suppl_1): 1-2. 10.1016/esmoop/esmoop102255

Authors

F. Patanè1, F. Palumbo2, G. Lorenzini3, I. Bargagna4, P. Cinacchi5, I. Albanese6, D. Bilancio7, F. Pantaleo8, G. Acconci8, G. Bianchini9, E. Baldacci10, M. La Commare11, B. Fratini12, A. Fontana13

Author affiliations

  • 1 Medical Oncology Dept., AOU Pisana - Stabilimento di Santa Chiara, 56126 - Pisa/IT
  • 2 Engineering, Ict, Energy And Transport Technologies Department, Institute of Information Science and Technologies “A. Faedo”, National Research Council (ISTI-CNR), 56124 - Pisa/IT
  • 3 Dipartimento Di Oncologia Medica, Azienda Ospedaliera Universitaria S.Chiara, 56100 - Pisa/IT
  • 4 Dipartimento Di Oncologia Medica, AOU Pisana - Stabilimento di Santa Chiara, 56126 - Pisa/IT
  • 5 Oncology Dept., UniPi - Università di Pisa - Direzione Area di Medicina, 56126 - Pisa/IT
  • 6 Oncologist, AOU Pisana - Stabilimento di Santa Chiara, 56126 - Pisa/IT
  • 7 Oncology Department, AOU Pisana - Stabilimento di Santa Chiara, 56126 - Pisa/IT
  • 8 Oncology, Azienda Ospedaliero-Universitaria Pisana, 56124 - Pisa/IT
  • 9 Medical Oncology, Azienda Ospedaliero-Universitaria Pisana, 56124 - Pisa/IT
  • 10 Medical, AOU Pisana - Stabilimento di Santa Chiara, 56126 - Pisa/IT
  • 11 Oncologia Medica, AOU Pisana - Stabilimento di Santa Chiara, 56126 - Pisa/IT
  • 12 Oncologia Medica, UniPi - Università di Pisa - Direzione Area di Medicina, 56126 - Pisa/IT
  • 13 Oncology Dept., Azienda Ospedaliero Universitaria Pisana - Stabilimento di Santa Chiara, 56100 - Pisa/IT

Resources

This content is available to ESMO members and event participants.

Abstract 3P

Background

ChatGPT is a web interface chatbot based on a large language model with the aim to mimic human conversation tuned with machine learning and supervised techniques, that have gained scientific attention wondering if it can be a tool in medical decision.

Methods

We tasked ChatGPT 4.0 with creating a multidisciplinary team (MDT) chat and provided it with clinical data from patients (pts) diagnosed with hormone receptor (HR)-positive, human epidermal growth factor receptor 2-negative eBC with an intermediate clinico-pathological risk. These pts were candidates for the Oncotype DX® genomic test. Our goal was to compare our MDT recommendations with those generated by ChatGPT’s and assess the consistency of its responses.

Results

We gathered data from 100 consecutive pts: median age 57, evenly split between stages I and II, 35 premenopausal. By supplying clinical details (age, stage, menopausal status, HR expression, grading, ki67, comorbidity), we asked ChatGPT to assess the need for Oncotype DX®. Each case was presented 9 times in varied chats to test repeatability, yielding a modal vector with a mean variation ratio of 0.181. Only in 31 pts it always recommended a genomic test. Summarizing ChatGPT's most frequent advices for each patient, it recommended genomic test for 61 pts. Next, we provided Recurrence Scores of the 61 pts, asking for chemotherapy (CT) recommendations. The mean variation ratio in responses was 0.069. The Cohen's kappa coefficient for inter-rater agreement between ChatGPT's and actual CT recommendations was 0.62. ChatGPT did not consider clinical risk but only menopausal status for endocrine therapy: tamoxifen if premenopausal, aromatase inhibitor if postmenopausal. When asked for concurrent CT and genomic test advice, its responses were inconsistent, offering CT for almost all pts regardless of genomic testing recommendation.

Conclusions

ChatGPT is a generative model capable of producing data that attempts to capture the statistical distribution of its training dataset, but without reasoning abilities. Its low repeatability, along with suboptimal inter-rater agreement, mean it cannot yet replace an MDT. Effective clinical integration requires identifying areas where ChatGPT's knowledge is beneficial.

Clinical trial identification

Editorial acknowledgement

Legal entity responsible for the study

The authors.

Funding

Has not received any funding.

Disclosure

A. Fontana: Non-Financial Interests, Institutional, Invited Speaker: MSD. All other authors have declared no conflicts of interest.

This site uses cookies. Some of these cookies are essential, while others help us improve your experience by providing insights into how the site is being used.

For more detailed information on the cookies we use, please check our Privacy Policy.

Customise settings
  • Necessary cookies enable core functionality. The website cannot function properly without these cookies, and you can only disable them by changing your browser preferences.