Abstract 117P
Background
The shift toward precision oncology requires the identification of novel, highly specific drug targets. Publicly available transcriptomic data offer a rich resource for identifying such targets, yet they remain largely underutilized. To address this, we present a scalable, data-driven platform for pan-cancer antigen target discovery leveraging the untapped potential of public transcriptomic data, along with extensive biological and pharmaceutical knowledge.
Methods
We integrated 299 microarray datasets using our AI-augmented, human-supervised clinical data curation and transcriptomic data normalization pipeline. We then used our open-source batch effects correction tool, PyComBat, to aggregate them into 15 indication-specific cohorts. The resulting cohorts, profiling 20,347 genes, breadth with 45 curated clinical data elements, exhibit exceptional size, encompassing 15,500 tumor and healthy tissue samples, surpassing TCGA projects by 2.1 times. We also increased patient population representativity with an average of 3.2 histological subtypes included in cohorts, compared to only 1.2 in datasets taken individually.
Results
To handle cancer heterogeneity, we stratified our cohorts into patient subpopulations based on transcriptomic profiles using consensus clustering analysis, interpreted with clinical data and pathway analysis. We then used our target discovery pipeline, starting with differential gene expression analysis, followed by proteomic filters to limit anticipated cytotoxicity and focus on cell surface-bound proteins. An average of 35 and 48 relevant antigen targets were identified at the indication and cluster level, respectively. These included targets already described in the literature, e.g. CD19 in acute lymphoblastic leukemia and BCMA in multiple myeloma. Finally, we characterized the hundreds of candidate targets using bulk and single cell transcriptomic data, proteomic data, and biological knowledge to evaluate their safety, efficacy, and robustness.
Conclusions
Encompassing data integration and target identification, our platform is scalable for the use with any cancer type and antigen-targeting modality, exemplifying its potential to accelerate oncology drug discovery.
Editorial acknowledgement
Clinical trial identification
Legal entity responsible for the study
Epigene Labs.
Funding
Epigene Labs.
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
121P - DNA methylation co-operates with genomic alterations during non-small cell lung cancer evolution
Presenter: Nnenna Kanu
Session: Cocktail & Poster Display session
Resources:
Abstract
122P - Comprehensive multi-omics profiling identifies prognostic and predictive subtypes in renal cell carcinoma
Presenter: Sanha Park
Session: Cocktail & Poster Display session
Resources:
Abstract
123P - Copy number from ulcWGS to predict TNBC molecular subtypes in the IBCSG 22-00 trial
Presenter: Andrea Joaquin Garcia
Session: Cocktail & Poster Display session
Resources:
Abstract
124P - Targeting neoantigens in chronic lymphocytic leukemia (CLL) for personalized T cell therapy
Presenter: Gurvinder Kaur
Session: Cocktail & Poster Display session
Resources:
Abstract
125P - Detection and analysis of medulloblastoma subtype-specific copy number variations from RNA-seq data for improved risk-based subtype classification
Presenter: Ivan Martinez de Estibariz Royuela
Session: Cocktail & Poster Display session
Resources:
Abstract
126P - Genomic and transcriptomic profiles define smokers and non-smokers lung squamous cell carcinoma patients
Presenter: Matteo Canale
Session: Cocktail & Poster Display session
Resources:
Abstract
128P - Metastatic migrations in lung cancer: Insights from the PEACE autopsy programme
Presenter: Sonya Hessey
Session: Cocktail & Poster Display session
Resources:
Abstract
129P - NGS prescreening program for refractory solid tumors outside standard indications in a public network of cancer centers
Presenter: Paula Sàbat Viltró
Session: Cocktail & Poster Display session
Resources:
Abstract
130P - Transcriptomic analysis of patients with metastatic hormone-sensitive prostate cancer to identify genomic signatures involved in the transition from androgen-dependent to androgen-independent phenotype
Presenter: Giovanna Pecoraro
Session: Cocktail & Poster Display session
Resources:
Abstract
131P - Benchmarking whole exome sequencing in the German network for personalized medicine
Presenter: Michael Menzel
Session: Cocktail & Poster Display session
Resources:
Abstract