Fall Research Expo 2023

Exploration of Feature Selection Strategies in DNA Methylome-based Cancer Classification

DNA cytosine modification at CpG dinucleotides are rich encoders of a cancer cell’s mitotic history and cell-of-origin information, establishing the DNA methylome as a powerful molecular analyte for cancer diagnosis. However, classifiers trained on one methylation assay platform may not translate effectively to other assay platforms due to probe selection changes and platform-specific technical artifacts such as signal background and amplification bias. To derive a pan-platform classifier, we explored various feature transformation strategies, guided by biological knowledge of chromatin states and other knowledge base sets. Targeting brain cancers and 33 cancer types from The Cancer Genome Atlas (TCGA), we investigated diverse methods of feature selection, including aggregating CpG methylation using tissue signature databases and nonparametric rank transformation. After evaluating various tissue signature databases, including transcription factor binding sites (TFBS), chromatin states (chromHMM), histone modifications (HM), and more, we trained individual models for each of these databases and compared their respective feature importances and class accuracies.  

PRESENTED BY
PURM - Penn Undergraduate Research Mentoring Program
College of Arts & Sciences 2026
Advised By
Wanding Zhou
Assistant Professor of Pathology and Laboratory Medicine
PRESENTED BY
PURM - Penn Undergraduate Research Mentoring Program
College of Arts & Sciences 2026
Advised By
Wanding Zhou
Assistant Professor of Pathology and Laboratory Medicine

Comments