Abstract 934P
Background
Almost half of patients with early/locally advanced (LA) head and neck squamous cell carcinoma (HNSCC) progress or relapse within 2 years. We applied artificial intelligence (AI) techniques to extract clinical variables from electronic health records (EHR) and to predict treatment failure.
Methods
This is an international, observational, retrospective study using free-text and structured data from HNSCC patients’ EHRs. A total of 203 clinical variables were extracted running EHRead®, a multilingual natural language processing (NLP) engine employing SNOMED-CT, from the EHR’s of HNSCC patients from 6 hospitals from Spain, France, and Colombia (2014-2021). Only patients with incident early/LA HNSCC were included. We utilized machine learning to train and 10-fold-cross validate several algorithms to predict progression, recurrence, or death within 2 years after radical treatment, selecting the model with the best performance metrics.
Results
Among 2,034,994 patients screened, 1,209 individuals were identified with early (44%) and LA (56%) HNSCC: 39.7% oral cavity, 32.8% larynx, 23.1% oropharynx, and 4.5% hypopharynx. Median age was 65 years (69.2% male). Main comorbidities were hypertension (50.1%), diabetes (33.3%). Most frequent symptoms at diagnosis were pain (60.7%), dysphagia (25.6%). A total of 484 (40%) patients experienced progression, recurrence, or death within 2 years. The algorithm with the best performance to predict radical treatment failure employed a random forest model, with an area under the ROC curve of 0.709. The main predictors of this model were: Age, stage, alcohol consumption, involved nodes, pain at diagnosis and surgical treatment.
Conclusions
This first phase of the HNC-TACTIC study shows the ability of NLP to extract accurate clinical data from HNSCC patients’ EHRs, and for machine-learning models to predict risk of treatment failure, demonstrating the potential capabilities of AI-powered real-world studies. The next phase will validate and retrain this model on data from 9 countries.
Clinical trial identification
NCT05117775.
Editorial acknowledgement
Legal entity responsible for the study
Savana S.L.
Funding
Savana S. L.
Disclosure
M. López, E. Castillo: Financial Interests, Institutional, Full or part-time Employment: Savana S. L. L. Cabal Hierro: Non-Financial Interests, Institutional, Full or part-time Employment: Savana S. L. H.T. Study Group: Financial Interests, Institutional, Full or part-time Employment, Judith Marín-Corral, Bianca de León, Ignacio Salcedo, David Casadevall, Victor Fanjul, Sebastian Menke, Almudena Chapa, Adrián Ceja: Savana S. L. ; Non-Financial Interests, Institutional, Affiliate, John de Almeida (UHN Canada), Andreas Dietz (Leipzig, Germany), Robert Ferris (UPMC, USA, Raul Giglio (Hospital Roffo, Argentina), Chris Holsinger (Stanford University School of Medicine, USA), Kate Hutcheson (MD Anderson Cancer Center, USA), Sandro Porceddu (Queensland Health, Australia), Christian Simon (Lausanne University Hospital, Switzerland), Ana López Alfonso (HUIL, Spain), Gloria María Serrano Montero (HUIL, Spain), Laura Rodrigáñez Riesco (H La Paz, Spain), Beatriz Castelo Fernández (H La Paz, Spain), Cristina Urbano (H de Granollers, Spain), Jordi Serra Carreras (H Granollers, Spain), Carlos García (H Alma Matter de Antioquia, Colombia), Yair Monsalve (H Alma Matter de Antioquia, Colombia), Javier Alonso Ortega (H de Móstoles, Spain), Gabriela Morales Medina (H de Móstoles, Spain), Renaud Schiappa (Centre Antoine Lacassagne, France,: Savana S. L. M. Taberna: Financial Interests, Institutional, Full or part-time Employment: Savana S. L. All other authors have declared no conflicts of interest.
Resources from the same session
927P - Preliminary results of the BROADEN study: Burden of human papillomavirus-related head and neck cancers
Presenter: Laia Alemany
Session: Poster session 03
928P - Radiomic analysis based on machine learning of multi-MR sequences to assess early treatment response in locally advanced nasopharyngeal carcinoma
Presenter: Lei Qiu
Session: Poster session 03
929P - Advanced laryngeal squamous cell carcinoma prognosis and machine learning insights
Presenter: Tala Alshwayyat
Session: Poster session 03
Resources:
Abstract
930P - Real-world data analysis of oncological outcomes in patients with pathological extranodal extension (ENE) in OSCC: A proposal to refine the pathological nodal staging system
Presenter: Abhinav Thaduri
Session: Poster session 03
931P - Deep learning models for predicting short-term efficacy in locally advanced nasopharyngeal carcinoma
Presenter: Kexin Shi
Session: Poster session 03
932P - Accuracy and prognostic implications of extranodal extension on radiologic imaging in HPV-positive oropharyngeal cancer (HNCIG-ENE): A multinational, real-world study
Presenter: Hisham Mehanna
Session: Poster session 03
933P - Prediction of survival in patients with head and neck merkel cell carcinoma: Statistical and machine-learning approaches
Presenter: Jehad Yasin
Session: Poster session 03
936P - Chronic pain in cancer survivors: Head and neck versus other cancers
Presenter: Rong Jiang
Session: Poster session 03
937P - Pain, fatigue and depression symptom cluster in head and neck cancer survivors
Presenter: Iakov Bolnykh
Session: Poster session 03