Abstract 182P
Background
Mutations in the MAPK/ERK pathway are frequently found across cancer entities, including colorectal cancer (CRC) where the accurate diagnosis of KRAS and BRAF mutational status is pivotal for treatment decisions. While the mutation analysis is usually done via genomic sequencing, the prediction of mutations from histological images using artificial intelligence (AI) could present a faster alternative with broad potential for diagnostic routine, research applications, and trial recruitment. However, to date, such algorithms typically do not meet the required accuracy criteria for real-world application in different institutions.
Methods
Since the frequency of both BRAFmut and KRASmut is associated with easily available clinical patient parameters, we developed a multi-modal predictive AI model on n = 455 CRC cases from the TCGA database and UPenn. Besides patient data, the AI model uses the BRAFmut/KRASmut information and hematoxylin & eosin (H&E)-stained tissue images. We evaluated the model on an independent hold-out TCGA cohort of n = 114 samples and an additional external cohort of n = 104 CRC samples from the CPTAC database.
Results
With our multi-modal approach the AI model achieved an AUROC of 0.84 ± 0.02 and 0.67 ± 0.01 for BRAF/KRAS respectively on the TCGA hold-out test set. Accuracy levels were similar on the second external testing dataset (CPTAC) (AUROC of 0.82 ± 0.02 and 0.72 ± 0.01) indicating the model’s ability to generalize across different cohorts. Notably, accuracy values obtained with the multi-modal training setup were significantly higher than those from models that were trained with image data only (AUROCs (CPTAC): BRAF 0.73 +- 0.02, KRAS: 0.64 +- 0.03).
Conclusions
By analyzing mutations in two of the most frequently mutated genes in CRC in two separate cohorts, we demonstrate that the inclusion of patient parameters in AI training can provide added value for diagnostic accuracy of AI models that predict mutations from H&E images. Our results also support previous findings that some driver mutations can be more accurately predicted from tissue than others. Altogether, these results show the potential of multi-modal deep learning to bring predictive AI towards real-world application in pathology.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
Mindpeak GmbH.
Funding
Mindpeak GmbH.
Disclosure
M. Päpper: Financial Interests, Personal, Full or part-time Employment: Mindpeak GmbH; Financial Interests, Personal, Stocks/Shares: Mindpeak GmbH; Financial Interests, Personal, Leadership Role: Mindpeak GmbH. P. Frey: Financial Interests, Personal, Full or part-time Employment: Mindpeak GmbH. T. Lang: Financial Interests, Personal, Full or part-time Employment: Mindpeak GmbH; Financial Interests, Personal, Leadership Role: Mindpeak GmbH; Financial Interests, Personal, Stocks or ownership: Mindpeak GmbH. All other authors have declared no conflicts of interest.
Resources from the same session
102P - Dynamic profiling of ctDNA in HER2-alterated advanced NSCLC treated with pyrotinib and apatinib: Exploratory biomarker analysis from a phase II trial
Presenter: Yucheng Dong
Session: Poster session 08
103P - Epigenetic regulated genes enhanced fragmentomics-based model for early-stage lung cancer detection
Presenter: Yadong Wang
Session: Poster session 08
104P - The development of a classifier of somatic copy number alteration burden in liquid biopsy with potential clinical impact in advanced non-small cell lung cancer (NSCLC)
Presenter: Laura Bonanno
Session: Poster session 08
105P - Plasma ctDNA dynamics as clinical response biomarker for NSCLC: A systematic review and meta-analysis
Presenter: Luís Leite
Session: Poster session 08
106P - Longitudinal molecular characterization in plasma of EGFR mutant non-small cell lung cancer (NSCLC) experiencing early progression (EPD) on first-line osimertinib (Osi)
Presenter: Laura Bonanno
Session: Poster session 08
107P - Germline pathogenic variants identified in tissue- and blood-based whole exome sequencing in advanced solid tumors
Presenter: Takeshi Kuwata
Session: Poster session 08
108P - Assessing molecular characteristics in a large cohort of anal squamous cell carcinoma patients
Presenter: Cristina Smolenschi
Session: Poster session 08
109P - Development and validation of a digital PCR assay for detection and monitoring of universally methylated circulating tumor DNA in patients with high-grade sarcoma
Presenter: Elisabeth Ashton
Session: Poster session 08
110P - 13-plex non-invasive genotyping in HPV related cancers in the MOSCATO trial
Presenter: Elise Rupp
Session: Poster session 08
111P - Leveraging circulating tumor DNA sequencing for first-line cancer treatment: Insights from two prospective precision medicine studies
Presenter: Veronique Debien
Session: Poster session 08