Abstract 117P
Background
The shift toward precision oncology requires the identification of novel, highly specific drug targets. Publicly available transcriptomic data offer a rich resource for identifying such targets, yet they remain largely underutilized. To address this, we present a scalable, data-driven platform for pan-cancer antigen target discovery leveraging the untapped potential of public transcriptomic data, along with extensive biological and pharmaceutical knowledge.
Methods
We integrated 299 microarray datasets using our AI-augmented, human-supervised clinical data curation and transcriptomic data normalization pipeline. We then used our open-source batch effects correction tool, PyComBat, to aggregate them into 15 indication-specific cohorts. The resulting cohorts, profiling 20,347 genes, breadth with 45 curated clinical data elements, exhibit exceptional size, encompassing 15,500 tumor and healthy tissue samples, surpassing TCGA projects by 2.1 times. We also increased patient population representativity with an average of 3.2 histological subtypes included in cohorts, compared to only 1.2 in datasets taken individually.
Results
To handle cancer heterogeneity, we stratified our cohorts into patient subpopulations based on transcriptomic profiles using consensus clustering analysis, interpreted with clinical data and pathway analysis. We then used our target discovery pipeline, starting with differential gene expression analysis, followed by proteomic filters to limit anticipated cytotoxicity and focus on cell surface-bound proteins. An average of 35 and 48 relevant antigen targets were identified at the indication and cluster level, respectively. These included targets already described in the literature, e.g. CD19 in acute lymphoblastic leukemia and BCMA in multiple myeloma. Finally, we characterized the hundreds of candidate targets using bulk and single cell transcriptomic data, proteomic data, and biological knowledge to evaluate their safety, efficacy, and robustness.
Conclusions
Encompassing data integration and target identification, our platform is scalable for the use with any cancer type and antigen-targeting modality, exemplifying its potential to accelerate oncology drug discovery.
Editorial acknowledgement
Clinical trial identification
Legal entity responsible for the study
Epigene Labs.
Funding
Epigene Labs.
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
104P - Comprehensive analysis of clinical characteristics and germline status among colorectal cancer patients in a tertiary care center in Thailand
Presenter: NUTDANAI ROILA
Session: Cocktail & Poster Display session
Resources:
Abstract
105P - Subsequent treatments after progression on cyclin-dependent kinase 4/6 inhibitors: A multicentric real-world data study
Presenter: Ana Rita Freitas
Session: Cocktail & Poster Display session
Resources:
Abstract
106P - Toxicity profile antibody-drug conjugates (ADCs) in metastatic breast cancer patients: A systematic review and meta-analysis based on studies’ design
Presenter: Silvia Belloni
Session: Cocktail & Poster Display session
Resources:
Abstract
107P - Receptor change on residual disease following neoadjuvant therapies for locally advanced breast cancer fails to impact oncological and survival outcomes
Presenter: Rionagh Lynch
Session: Cocktail & Poster Display session
Resources:
Abstract
114P - Comprehensive genomic profiling by liquid biopsy captures tumor heterogeneity and identifies cancer vulnerabilities in patients with RAS/BRAFV600E wild type metastatic colorectal cancer in the CAPRI 2-GOIM trial
Presenter: Davide Ciardiello
Session: Cocktail & Poster Display session
Resources:
Abstract
115P - Impact of tissue factor on clinical and biological characteristics in patients with advanced pancreatic cancer
Presenter: Taro Shibuki
Session: Cocktail & Poster Display session
Resources:
Abstract
116P - Multiomic profiling based on <italic>Akkermansia muciniphila</italic> in advanced non-small cell lung cancer
Presenter: Lorenzo Belluomini
Session: Cocktail & Poster Display session
Resources:
Abstract
118P - Whole transcriptome sequencing of lung tissue to combine disease classification and identification of actionable targets
Presenter: Alejandro Pallares Robles
Session: Cocktail & Poster Display session
Resources:
Abstract
119P - Genetic profiling of breast cancer in a developing country: Towards the establishment of oncogenetics in Cameroon
Presenter: Kenn Chi Ndi
Session: Cocktail & Poster Display session
Resources:
Abstract
120P - Uncovering the prognostic potential of FGFR2c isoform expression in advanced gastroesophageal cancer through MONSTAR-SCREEN-2 analysis
Presenter: Tadayoshi Hashimoto
Session: Cocktail & Poster Display session
Resources:
Abstract