Abstract 117P
Background
The shift toward precision oncology requires the identification of novel, highly specific drug targets. Publicly available transcriptomic data offer a rich resource for identifying such targets, yet they remain largely underutilized. To address this, we present a scalable, data-driven platform for pan-cancer antigen target discovery leveraging the untapped potential of public transcriptomic data, along with extensive biological and pharmaceutical knowledge.
Methods
We integrated 299 microarray datasets using our AI-augmented, human-supervised clinical data curation and transcriptomic data normalization pipeline. We then used our open-source batch effects correction tool, PyComBat, to aggregate them into 15 indication-specific cohorts. The resulting cohorts, profiling 20,347 genes, breadth with 45 curated clinical data elements, exhibit exceptional size, encompassing 15,500 tumor and healthy tissue samples, surpassing TCGA projects by 2.1 times. We also increased patient population representativity with an average of 3.2 histological subtypes included in cohorts, compared to only 1.2 in datasets taken individually.
Results
To handle cancer heterogeneity, we stratified our cohorts into patient subpopulations based on transcriptomic profiles using consensus clustering analysis, interpreted with clinical data and pathway analysis. We then used our target discovery pipeline, starting with differential gene expression analysis, followed by proteomic filters to limit anticipated cytotoxicity and focus on cell surface-bound proteins. An average of 35 and 48 relevant antigen targets were identified at the indication and cluster level, respectively. These included targets already described in the literature, e.g. CD19 in acute lymphoblastic leukemia and BCMA in multiple myeloma. Finally, we characterized the hundreds of candidate targets using bulk and single cell transcriptomic data, proteomic data, and biological knowledge to evaluate their safety, efficacy, and robustness.
Conclusions
Encompassing data integration and target identification, our platform is scalable for the use with any cancer type and antigen-targeting modality, exemplifying its potential to accelerate oncology drug discovery.
Editorial acknowledgement
Clinical trial identification
Legal entity responsible for the study
Epigene Labs.
Funding
Epigene Labs.
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
142P - Lipidomic signature in response to omega-3 fatty acids and γ-linolenic acid supplementation in breast cancer patients receiving aromatase inhibitors
Presenter: Vesna Vucic
Session: Cocktail & Poster Display session
Resources:
Abstract
143P - A tailored histology-driven molecular profiling algorithm proposal for salivary gland cancers
Presenter: Simone Rota
Session: Cocktail & Poster Display session
Resources:
Abstract
144P - Is it time to incorporate next generation sequencing of body fluids for detection of circulating tumor DNA (ctDNA) alterations?
Presenter: Aditya Shreenivas
Session: Cocktail & Poster Display session
Resources:
Abstract
145P - Unveiling the molecular landscape of head and neck cancer: Pathway dysregulations and potential therapeutic targets
Presenter: Rajeev Vijayakumar
Session: Cocktail & Poster Display session
Resources:
Abstract
146P - ESR1 fusions as potential mechanism of resistance to endocrine therapy in metastatic breast cancer
Presenter: Sewanti Limaye
Session: Cocktail & Poster Display session
Resources:
Abstract
147P - Clinical characteristics and outcomes in non-small cell lung cancer patients harboring rare mutations: A single center real-world data
Presenter: Ana Rita Freitas
Session: Cocktail & Poster Display session
Resources:
Abstract
148P - Diversity of genomic mechanisms of resistance to endocrine therapy in ER+ breast cancer
Presenter: Prithika Sritharan
Session: Cocktail & Poster Display session
Resources:
Abstract
149P - Assessing treatment options for gynaecological cancers (GC) using next-generation sequencing (NGS): A real-world analysis
Presenter: Álvaro García
Session: Cocktail & Poster Display session
Resources:
Abstract
150P - Prevalence of DPYD variants in 1478 cancer patients receiving fluoropyrimidine chemotherapy: A real-world data analysis
Presenter: Bahaaeldin Baraka
Session: Cocktail & Poster Display session
Resources:
Abstract
151P - Unravelling the limitations of next-generation sequencing (NGS)-based liquid biopsy (LB) across solid tumors: The PREICO-LB project
Presenter: Cinta Hierro
Session: Cocktail & Poster Display session
Resources:
Abstract