Abstract 5820
Background
DNA sequencing to identify variants is becoming increasingly valuable in clinical settings; including matching patients to approved targeted therapies, immunotherapies, and/or clinical trials. However, accurate calling of genetic variants from sequencing still remains challenging. With little corroboration between the different tools available, patients are at risk of being treated with therapies that are unsuitable for their cancer.
Methods
Here we present a novel machine learning based method for the accurate identification of somatic variants in cancer patient tumour samples, with a neural network architecture from encoded raw sequencing read information of tumour/normal sample pairings into an image, enabling it to classify whether a variant is germline, somatic, or sequencing error. The model was trained and tested on in-silico spike-in data using bam-surgeon, and then validated on a multi-cancer and multi-center dataset and benchmarked against industry standard variant callers.
Results
The approach, called somaticNET, outperforms existing industry standard tools in sensitivity and specificity, achieving an AUROC of ∼1.00 on the bam-surgeon dataset and an AUROC of ∼0.99 on the multi-cancer multicenter dataset. The model also works faster than other variant callers, in minutes compared to hours.
Conclusions
Using the power of machine learning for accurate somatic variant calling can improve patient matching to approved therapies and clinical trials, thus ensuring patients are given the right therapy at the right time to treat their cancer.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Cambridge Cancer Genomics.
Disclosure
G. Dubourg-Felonneau: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. D. Rebergen: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. C. Parsons: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H. Thompson: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. J.W. Cassidy: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment, Officer / Board of Directors: Cambridge Cancer Genomics. N. Patel: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H.W. Clifford: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics.
Resources from the same session
3842 - Effect of docetaxel-resistance on the reactivity of prostate cancer cells to metformin
Presenter: Jessica Catapano
Session: Poster Display session 3
Resources:
Abstract
5198 - Cell plasticity and taxanes resistance in metastatic prostate cancer: ESRP1 as a predictive biomarker of taxane response
Presenter: Natalia Jimenez
Session: Poster Display session 3
Resources:
Abstract
2981 - Effect of Selumetinib plus AZD8186 treatment on Cabazitaxel sensitivity in docetaxel-acquired resistant metastatic prostate cancer cell lines
Presenter: Vicenc Ruiz de Porras
Session: Poster Display session 3
Resources:
Abstract
2779 - Anti-tumor activity of cediranib, a pan-inhibitor of vascular endothelial growth factor receptors, in pancreatic ductal adenocarcinoma cells
Presenter: Majid Momeny
Session: Poster Display session 3
Resources:
Abstract
1782 - The molecular mechanisms of EpCAM in regulating tumor progression and development of anti-EpCAM antibodies for colon cancer diagnosis and therapy
Presenter: Han-chung Wu
Session: Poster Display session 3
Resources:
Abstract
1322 - Detection of microRNAs as biomarker for anti-EGFR antibody resistance in colon cancer patients
Presenter: Jens Hahne
Session: Poster Display session 3
Resources:
Abstract
1579 - Serum exosomal microRNA-199b-5p as a novel circulating biomarker to predict response of preoperative chemoradiotherapy for locally advanced rectal cancer
Presenter: Dong Won Baek
Session: Poster Display session 3
Resources:
Abstract
1761 - Live biobank of patient-derived organoids from Thai colorectal cancer patients enables clinical outcome prediction
Presenter: Pariyada Tanjak
Session: Poster Display session 3
Resources:
Abstract
3542 - The biological implications of PDCD6 dysregulation in colorectal cancer
Presenter: Romina Briffa
Session: Poster Display session 3
Resources:
Abstract
4634 - Comparative molecular analyses between microsatellite stable BRAFV600E mutant colorectal cancers and BRAFV600E mutant melanomas.
Presenter: Mohamed Salem
Session: Poster Display session 3
Resources:
Abstract