Abstract 5820
Background
DNA sequencing to identify variants is becoming increasingly valuable in clinical settings; including matching patients to approved targeted therapies, immunotherapies, and/or clinical trials. However, accurate calling of genetic variants from sequencing still remains challenging. With little corroboration between the different tools available, patients are at risk of being treated with therapies that are unsuitable for their cancer.
Methods
Here we present a novel machine learning based method for the accurate identification of somatic variants in cancer patient tumour samples, with a neural network architecture from encoded raw sequencing read information of tumour/normal sample pairings into an image, enabling it to classify whether a variant is germline, somatic, or sequencing error. The model was trained and tested on in-silico spike-in data using bam-surgeon, and then validated on a multi-cancer and multi-center dataset and benchmarked against industry standard variant callers.
Results
The approach, called somaticNET, outperforms existing industry standard tools in sensitivity and specificity, achieving an AUROC of ∼1.00 on the bam-surgeon dataset and an AUROC of ∼0.99 on the multi-cancer multicenter dataset. The model also works faster than other variant callers, in minutes compared to hours.
Conclusions
Using the power of machine learning for accurate somatic variant calling can improve patient matching to approved therapies and clinical trials, thus ensuring patients are given the right therapy at the right time to treat their cancer.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Cambridge Cancer Genomics.
Disclosure
G. Dubourg-Felonneau: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. D. Rebergen: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. C. Parsons: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H. Thompson: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. J.W. Cassidy: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment, Officer / Board of Directors: Cambridge Cancer Genomics. N. Patel: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H.W. Clifford: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics.
Resources from the same session
2477 - Antecedent of cancer and mortality after the first ST segment elevation acute myocardial infarction treated with primary coronary angioplasty. A prospective cohort study
Presenter: Irene Sillero
Session: Poster Display session 3
Resources:
Abstract
1894 - Genomic characterisation of locally advanced pancreatic adenocarcinoma
Presenter: Sarah Picardo
Session: Poster Display session 3
Resources:
Abstract
3280 - Comparison of freshly prepared and frozen cells from colorectal cancer surgical samples for phenotyping experiments- a pilot study
Presenter: Sandra Mersakova
Session: Poster Display session 3
Resources:
Abstract
3419 - Hyaluronan (HA) Accumulation in the Tumor Microenvironment (TME) is Increased in Colorectal Cancer (CRC) and Associated with Consensus Molecular Subtypes (CMS) 4 Molecular Subtype
Presenter: Barbara Blouw
Session: Poster Display session 3
Resources:
Abstract
1833 - Evaluation of CT-based radiomics in patients with renal cell carcinoma
Presenter: An Zhao
Session: Poster Display session 3
Resources:
Abstract
5883 - Detection of Double Protein Expression in Diffuse Large B Cell Lymphoma
Presenter: Mohamed Gouda
Session: Poster Display session 3
Resources:
Abstract
5415 - Encyclopedic Tumor Analysis for organ agnostic treatment with Axitinib in combination regimens for advanced cancers
Presenter: Tim Crook
Session: Poster Display session 3
Resources:
Abstract
3297 - Computational model to predict response rate of clinical trials
Presenter: Orsolya Lorincz
Session: Poster Display session 3
Resources:
Abstract
4355 - Analysis of BRCA genes and homologous recombination deficiency (HRD) scores in tumours from patients (pts) with metastatic breast cancer (mBC) in the OlympiAD trial
Presenter: Mark Robson
Session: Poster Display session 3
Resources:
Abstract
2316 - A 3D co-culture platform of breast cancer and patient derived immune cells to analyse the response to chemotherapy and immunotherapies
Presenter: Diana Saraiva
Session: Poster Display session 3
Resources:
Abstract