Abstract 117P
Background
The shift toward precision oncology requires the identification of novel, highly specific drug targets. Publicly available transcriptomic data offer a rich resource for identifying such targets, yet they remain largely underutilized. To address this, we present a scalable, data-driven platform for pan-cancer antigen target discovery leveraging the untapped potential of public transcriptomic data, along with extensive biological and pharmaceutical knowledge.
Methods
We integrated 299 microarray datasets using our AI-augmented, human-supervised clinical data curation and transcriptomic data normalization pipeline. We then used our open-source batch effects correction tool, PyComBat, to aggregate them into 15 indication-specific cohorts. The resulting cohorts, profiling 20,347 genes, breadth with 45 curated clinical data elements, exhibit exceptional size, encompassing 15,500 tumor and healthy tissue samples, surpassing TCGA projects by 2.1 times. We also increased patient population representativity with an average of 3.2 histological subtypes included in cohorts, compared to only 1.2 in datasets taken individually.
Results
To handle cancer heterogeneity, we stratified our cohorts into patient subpopulations based on transcriptomic profiles using consensus clustering analysis, interpreted with clinical data and pathway analysis. We then used our target discovery pipeline, starting with differential gene expression analysis, followed by proteomic filters to limit anticipated cytotoxicity and focus on cell surface-bound proteins. An average of 35 and 48 relevant antigen targets were identified at the indication and cluster level, respectively. These included targets already described in the literature, e.g. CD19 in acute lymphoblastic leukemia and BCMA in multiple myeloma. Finally, we characterized the hundreds of candidate targets using bulk and single cell transcriptomic data, proteomic data, and biological knowledge to evaluate their safety, efficacy, and robustness.
Conclusions
Encompassing data integration and target identification, our platform is scalable for the use with any cancer type and antigen-targeting modality, exemplifying its potential to accelerate oncology drug discovery.
Editorial acknowledgement
Clinical trial identification
Legal entity responsible for the study
Epigene Labs.
Funding
Epigene Labs.
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
94P - Clinical utility of long read sequencing for comprehensive analysis of cancer patient genomes
Presenter: Rowan Howell
Session: Cocktail & Poster Display session
Resources:
Abstract
95P - Implementation of TDM and PGx in mRCC patients treated with sunitinib to personalize drug dosing
Presenter: Giorgia Bortolus
Session: Cocktail & Poster Display session
Resources:
Abstract
96P - Physician perceptions and the evolving landscape of next generation sequencing (NGS) use across EU4+UK
Presenter: Keerun Khela
Session: Cocktail & Poster Display session
Resources:
Abstract
97P - Population-based assessment of outcomes in pancreatic ductal adenocarcinoma (PDAC) patients (pts) with pathogenic germline variants (PGVs)
Presenter: Phoebe Cheng
Session: Cocktail & Poster Display session
Resources:
Abstract
98P - Clinical implementation of NGS and ESCAT for optimized cancer therapy
Presenter: AI. Martin-Quesada
Session: Cocktail & Poster Display session
Resources:
Abstract
99P - Prevalence of homologous recombination repair deficiency-associated variants in non-selected Portuguese patients with metastatic prostate cancer
Presenter: Tiago Barroso
Session: Cocktail & Poster Display session
Resources:
Abstract
100P - Real-world insights in precision oncology: First data of the comprehensive clinico-genomics database network AIO-BNHO-CONNECT (AIO-TF-0122) in Germany
Presenter: Maike Collienne
Session: Cocktail & Poster Display session
Resources:
Abstract
101P - Molecular tumor board for advanced cancer patients offers new therapeutic opportunities beyond level 1 ESCAT scale
Presenter: Victor Segui-Manzaneque
Session: Cocktail & Poster Display session
Resources:
Abstract
102P - Prognostic model of pediatric AML patients with RUNX1-RUNX1T1 fusion gene
Presenter: Yang Xun
Session: Cocktail & Poster Display session
Resources:
Abstract
103P - NICE recommendations and ESMO-MCBS/ESCAT scores for solid tumour drugs
Presenter: Eunice Xing
Session: Cocktail & Poster Display session
Resources:
Abstract