Abstract 1178P
Background
Non-small cell lung cancer (NSCLC) requires multiple genomics testing modalities for optimizing patient outcomes. The foremost of NSCLC biomarkers is EGFR sequencing. Sequencing comes with many challenges, including long turnaround time, high tissue requirements from small biopsies, and cost. An AI model using only digital whole slide images (WSI) can act as a rapid screening test to prioritize tissue for proper sequencing without expending tissue.
Methods
A vision transformer (ViT) base architecture is trained for classification of acinar, solid, lepidic, papillary, and micropapillary morphologies, using 1 million 2242 pixel patches extracted from 3475 WSIs. The training utilizes cross-entropy loss with the Adam optimizer with learning rate of 1e-4 and cosine weight decay scheduler. The pretrained encoder allows for extraction of 768-dimensional feature vectors from the last hidden layer for downstream tasks. For EGFR prediction, each of the 1558 training WSIs are decomposed to 2242 pixel patches and feature embeddings are extracted for each patch. Using a gated attention-based multiple instance learning model, EGFR WSI labels are predicted. The model was optimized using 260 WSIs to obtain best AUC. The best model was evaluated on a held-out set of 6300 WSIs before integration into a mock clinical workflow, enabling in real-time (IRT) EGFR prediction for 7 slides. The informatic backbone identifies WSI at time of scanning and transfers the slide for inference, complted within 30 minutes of scanning.
Results
On the validation dataset of 260 cases, our model exhibited an area under the curve (AUC) of 0.93 with a specificity of 0.90 and sensitivity of 0.88. The model, assessed on an independent validation set of 6300 cases, maintained a high AUC of 0.89 with negative/positive predictive value (NPV/PPV): NPV = 0.90; PPV = 0.71. On IRT cohort, using same threshold: NPV = 1.0; PPV = 0.66.
Conclusions
Implementing such a model that can be ran IRT with clinical WSIs can provide rapid insight and inform ongoing testing protocols (e.g. prioritize tissue for EGFR confirmation when positive or full genomics when negative). Continuous refinement and integration of IRT data will enhance performance to align with clinical process requirements.
Clinical trial identification
Editorial acknowledgement
During the preparation of this work the author(s) used ChatGPT in order to construct the abstract title. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.
Legal entity responsible for the study
The Warren Alpert Center for Digital and Computational Pathology, Memorial Sloan Kettering Cancer Center.
Funding
The Warren Alpert Foundation, The Warren Alpert Center for Digital and Computational Pathology, Memorial Sloan Kettering Cancer Center.
Disclosure
C.M. Vanderbilt: Financial Interests, Personal, Stocks or ownership: Paige AI. T. Fuchs: Financial Interests, Personal, Advisory Board, Founder, Equity holder, etc: Paige AI. M. Hameed: Financial Interests, Personal, Other, Fiduciary Role/Position: USCAP. A. Dogan: Financial Interests, Personal, Other, Professional Services and Activities: Incyte. All other authors have declared no conflicts of interest.
Resources from the same session
202P - eIF4E inhibition exhibits anti-tumor activity and re-sensitizes acquired resistant KRAS G12C NSCLC to KRAS inhibitors
Presenter: Andrew Truong
Session: Poster session 09
203P - An innovative evidence-based laboratory medicine (EBLM) test to help doctors in multi-cancer early detection (MCED)
Presenter: Jose D Santotoribio
Session: Poster session 09
204P - Assessing biomarker testing awareness among patients and caregivers in NSCLC through an interdisciplinary global survey
Presenter: Rodrigo Paredes
Session: Poster session 09
205P - Detection and diagnosis of lung cancer by electronic nose analysis of exhaled breath: A multi-center prospective observational study
Presenter: Alessandra Buma
Session: Poster session 09
206P - Unveiling the link: How metabolic syndrome drives endometrial cancer progression
Presenter: Lirong Zhai
Session: Poster session 09
Resources:
Abstract
207P - Associations of diabetic background retinopathy and ER+ breast cancer risk: A Mendelian randomization study
Presenter: Shu Wang
Session: Poster session 09
208P - Role of plasma exosomes in crosstalk between immune system and hereditary ovarian cancer: Opportunity or challenge?
Presenter: Daniele Fanale
Session: Poster session 09
209P - A novel method for early evaluation of drug-specific predictive biomarker
Presenter: Gal Dinstag
Session: Poster session 09
210P - Therapeutic implications of phosphoproteomics in molecular cancer diagnostics
Presenter: Annika Schneider
Session: Poster session 09
211P - GynePDX: A new platform of preclinical models for endometrial and ovarian cancers
Presenter: Melek Denizli
Session: Poster session 09