Abstract 1171MO
Background
Detecting lung cancer in its early stages poses a significant challenge due to the lack of specific diagnostic markers. The digitisation of healthcare records has generated vast amounts of untapped data, presenting opportunities for improvements in early diagnoses. This study presents a novel approach to mine and extract data to automatically form diagnostic criteria for lung cancer from primary care appointment data.
Methods
Patients aged 40 years or older, referred for a chest x-ray between 2016 and 2022, were included. Lung cancer status was determined using the primary and secondary care electronic health records, and the Somerset Cancer Registry. Patients with pre-existing cancer diagnoses were excluded. Coded diagnoses and symptoms were extracted from primary care records to establish diagnostic criteria. A custom feature selection algorithm was used to identify the most relevant historical diagnoses and symptoms, whilst minimising redundancy among predictors. The selected features were used to train a Random Forest classifier using 75% of the training data, validating on the remaining 25%. The resulting model was applied longitudinally to a test set of patients to determine the first time a patient is flagged for screening.
Results
The diagnostic model was trained on data from 78,457 patients, using age, sex, ethnicity, and the presence of any of the following predictors, within 6 months of the appointment: bronchiolar disease, chronic lung disease, dyspnoea, ex-smoker, cough and altered bowel function. An AUROC of 0.80 was achieved, with a sensitivity and specificity of 87% and 57%, respectively. When longitudinally applied to the test set, the model would have detected 77% of lung cancer patients, offering a median “advanced notice” of 36 months before diagnosis (IQR = 19 - 52 months).
Conclusions
This study demonstrated success in leveraging primary care data to establish diagnostic criteria for lung cancer. The performance of the model to identify lung cancer years earlier requires further validation, but such an approach demonstrates promise for enhancing early detection. Future efforts will concentrate on prospectively evaluating the developed criteria and determining their health and economic value.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
Barts Health NHS Trust.
Funding
Has not received any funding.
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
69MO - Organ-specific performance of a multi-analyte, multi-cancer early detection (MCED) blood test in a prospectively-collected cohort
Presenter: Christopher Douville
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Slides
Webcast
70MO - Cell-free DNA indicates potential preclinical detectability of cancer signals up to 30 months prior to diagnosis
Presenter: Celine Vachon
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Webcast
Invited Discussant 69MO, 70MO and 1171MO
Presenter: Angela George
Session: Mini oral session: Basic science & Translational research
Resources:
Slides
Webcast
71MO - Clinical and biomarker analyses of intratumoral CD40 agonist sotigalimab in combination with pembrolizumab in metastatic melanoma: A phase I/II trial
Presenter: Adi Diab
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Webcast
72MO - Concurrent BRAF targeted therapy (TT) with dabrafenib and trametinib and anti-PD-1 agent pembrolizumab (PD1) increased B cell signalling and inflammatory pathways more effectively than when given sequentially or with anti-PD-1 alone
Presenter: Jorja Braden
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Slides
Webcast
73MO - Longitudinal probing of combinatorial immunotherapy (anti-VEGF/anti-PD1) response in advanced nasopharyngeal carcinoma, at single cell resolution
Presenter: Shu-Mei Chia
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Webcast
Invited Discussant 71MO, 72MO and 73MO
Presenter: Göran Jönsson
Session: Mini oral session: Basic science & Translational research
Resources:
Slides
Webcast
74MO - Impact of MTAP loss on prognosis and efficacy of immune check point inhibitors across advanced solid tumors: SCRUM-Japan MONSTAR-SCREEN-1
Presenter: Kyosuke Seguchi
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Slides
Webcast
75MO - Atlas of tertiary lymphoid structures in solid tumors: Genomic features and prediction of response to immunotherapy
Presenter: Antoine Italiano
Session: Mini oral session: Basic science & Translational research
Resources:
Abstract
Slides
Webcast
Invited Discussant 74MO and 75MO
Presenter: Daniela Thommen
Session: Mini oral session: Basic science & Translational research
Resources:
Slides
Webcast