Abstract 5820
Background
DNA sequencing to identify variants is becoming increasingly valuable in clinical settings; including matching patients to approved targeted therapies, immunotherapies, and/or clinical trials. However, accurate calling of genetic variants from sequencing still remains challenging. With little corroboration between the different tools available, patients are at risk of being treated with therapies that are unsuitable for their cancer.
Methods
Here we present a novel machine learning based method for the accurate identification of somatic variants in cancer patient tumour samples, with a neural network architecture from encoded raw sequencing read information of tumour/normal sample pairings into an image, enabling it to classify whether a variant is germline, somatic, or sequencing error. The model was trained and tested on in-silico spike-in data using bam-surgeon, and then validated on a multi-cancer and multi-center dataset and benchmarked against industry standard variant callers.
Results
The approach, called somaticNET, outperforms existing industry standard tools in sensitivity and specificity, achieving an AUROC of ∼1.00 on the bam-surgeon dataset and an AUROC of ∼0.99 on the multi-cancer multicenter dataset. The model also works faster than other variant callers, in minutes compared to hours.
Conclusions
Using the power of machine learning for accurate somatic variant calling can improve patient matching to approved therapies and clinical trials, thus ensuring patients are given the right therapy at the right time to treat their cancer.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Cambridge Cancer Genomics.
Disclosure
G. Dubourg-Felonneau: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. D. Rebergen: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. C. Parsons: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H. Thompson: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. J.W. Cassidy: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment, Officer / Board of Directors: Cambridge Cancer Genomics. N. Patel: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H.W. Clifford: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics.
Resources from the same session
5218 - Elevated driver mutational burden or number of perturbed pathways and poor response to abiraterone or enzalutamide in metastatic castration-resistant prostate cancer
Presenter: Bram De Laere
Session: Poster Display session 3
Resources:
Abstract
2452 - High proportion of multiple KRAS mutations in circulating tumor DNA and tumor tissue of pancreatic ductal adenocarcinoma
Presenter: Min Kyeong Kim
Session: Poster Display session 3
Resources:
Abstract
3328 - Biological difference of tumor mutational burden (TMB) and microsatellite instability (MSI) status in patients (pts) with somatic vs. germline BRCA1/2-mutated advanced gastrointestinal (GI) cancers using cell-free DNA (cfDNA) sequencing analysis in the GOZILA study
Presenter: Yasuyuki Kawamoto
Session: Poster Display session 3
Resources:
Abstract
3022 - Cell-Free DNA to Detect Focal Versus Non-Focal MET Amplification in Metastatic Colorectal Cancer Patients: Combined Analysis from Japan and the United States
Presenter: Mishima Saori
Session: Poster Display session 3
Resources:
Abstract
2833 - Presence of circulating tumor DNA in surgically resected renal cell carcinoma is associated with advanced disease and poor patient prognosis
Presenter: Andres Correa
Session: Poster Display session 3
Resources:
Abstract
1376 - Combined genomic and epigenomic assessment of cell-free circulating tumor DNA (cfDNA) for cancer diagnosis and recurrence-risk assessment in early-stage lung cancer
Presenter: Junghee Lee
Session: Poster Display session 3
Resources:
Abstract
4050 - DEMo: a prospective evaluation of a prognostic clinico-molecular composite score in NSCLC patients treated with immunotherapy.
Presenter: Arsela Prelaj
Session: Poster Display session 3
Resources:
Abstract
4727 - Bespoke circulating tumor DNA (ctDNA) analysis as a predictive biomarker in solid tumor patients (pts) treated with single agent pembrolizumab (P)
Presenter: Cindy Yang
Session: Poster Display session 3
Resources:
Abstract
3662 - Dynamic changes in whole-genome cell-free DNA (cfDNA) to identify disease progression prior to imaging in advanced solid tumors
Presenter: Andrew Davis
Session: Poster Display session 3
Resources:
Abstract
3817 - Evaluation of Microsatellite Instability Testing Through cell-free DNA sequencing
Presenter: Shile Zhang
Session: Poster Display session 3
Resources:
Abstract