In the world of novel cancer therapeutics, the low-hanging fruit is long gone. New targets and new drugs will only come from the long slog of in-depth characterization, analysis, and evaluation. To achieve this, drug discovery and development scientists must extract deeper insights into disease biology from the limited and precious samples available to them.
Tissue samples are challenging to acquire, particularly in sufficient numbers and volumes. Plasma samples, on the other hand, can be obtained longitudinally in a minimally invasive manner. However, many well-established analysis methods, such as RNA-seq, do not translate well to liquid samples. Epigenetic analysis provides clear advantages as a stable analyte that performs reliably in a liquid biopsy setting and delivers information highly relevant to disease state and underlying biology.
Epigenetic profiling is not new to pharma scientists focused on oncology; however, conventional approaches typically focus on 5-methylcytosine (5mC), a widespread regulatory mechanism associated with gene silencing. More recently, scientists have begun to recognize the value of studying a different type of epigenetic marker: 5-hydroxymethylcytosine (5hmC), which indicates genes that are actively turned on. Emerging studies have now demonstrated the advantages of targeting 5hmC over 5mC across a range of applications in drug discovery and development.
5hmC for cancer drug development
5mC has been recognized for decades as a fundamental component of the epigenetic landscape. This modification is distributed throughout the human genome and plays a central role in transcriptional repression, contributing to the silencing of a substantial portion of the genome in any given cell. As the most abundant form of DNA methylation, 5mC has been the subject of extensive study.
However, the near-ubiquitous nature of 5mC can also be a disadvantage, in particular for applications in drug development. Identifying actionable disease signals in 5mC patterns often requires sifting through vast amounts of data, making analysis cumbersome and increasing the risk that meaningful signals are lost in the noise.
5hmC, on the other hand, has emerged as a robust diagnostic and functional marker for cancer biology. This marker is associated with gene activation, therefore marking oncogenes and other genes that are expressed in cancer cells. It provides more targeted data because it is localized to the small fraction of transcriptionally active regions of the genome. That makes 5hmC analysis far more efficient and cost-effective, requiring substantially less sequencing and input DNA to cover relevant areas of the genome. Moreover, detecting 5hmC may be one of the most reliable and sensitive ways to spot the earliest signs of cancer — even before a tumor is large enough to appear on an imaging scan. Specialized assays can enrich for the cells and genomic regions that matter most, providing a focused window into disease biology.
In addition, commonly utilized analysis methods for 5mC markers require a bisulfite conversion process that inherently damages the DNA, reducing strands to short fragments and preventing future reanalysis. By comparison, 5hmC can be studied using methods that avoid this step, enabling a more flexible workflow and making it feasible to work with small or precious samples.
These properties allow 5hmC to provide an unbiased, direct, genome-wide view of active disease biology, offering a real-time snapshot of what is happening in tumor cells via liquid biopsy.
Key applications of 5hmC for biopharma
5hmC analysis provides a high-resolution window into cancer biology, making it invaluable for clinical drug development. Its ability to detect markers across cancer subtypes and varying tumor burdens — even in tumor-naïve samples — supports drug evaluation, patient stratification, and real-time monitoring of treatment response or resistance.
As a dynamic marker that tracks with gene expression, one of the ways 5hmC data can help inform drug development programs is by elucidating disease biology in liquid biopsy samples that can be used to characterize novel targets and to pinpoint the biological mechanisms associated with cancer. In particular, with the ability to obtain serial samples in a clinical setting, 5hmC profiling of plasma samples can be used to generate data for use in reverse translation efforts. The genome-wide approach enables an unbiased investigation for disease targets and potential drug candidates.
For drug candidates in Phase 1 studies, 5hmC patterns can be analyzed to assess target levels or oncogenic signaling associated with targets that can be leveraged for investigating pharmacodynamics and establishing target engagement upon dose escalation in liquid biopsy samples. Gene activation data can reveal whether a drug candidate is targeting the intended oncogenic pathway and whether it is effectively inhibiting that pathway. This is particularly valuable for agents targeting oncogenic drivers, such as KRAS.
Additionally, 5hmC can provide insights into adverse events observed in early-stage clinical trials. In the context of FDA’s Project Optimus, which emphasizes thorough dose optimization in early-phase oncology trials, 5hmC offers a non-invasive way to monitor target engagement and pharmacodynamic response across multiple dose levels, supporting more informed dose selection and enhancing patient safety.
Phase 2 and Phase 3 studies can also benefit from 5hmC analysis of serial collections of blood samples. Epigenetic data can reveal how study participants are responding to a drug candidate, and potentially identify distinct signatures associated with response or resistance. The technique is valuable for informing combination therapy strategies and monitoring response to treatment. Ultimately, 5hmC biomarkers developed through this process may have clinical utility if the drug advances to market. By extending the use of 5hmC from early-phase trials into later phases, clinical teams can generate robust pharmacodynamic and biomarker data that support regulatory interactions and align with the FDA’s emphasis on evidence-driven dose selection and translational endpoints.
Importantly, all of this information is based on easy-to-access blood samples, with standard volumes sufficient even for genome-wide analysis of 5hmC data. This makes 5hmC an ideal analyte for deriving extensive insights from clinical studies without placing undue burden on participants through large blood draws.
5hmC in action
A few key publications demonstrate the potential for 5hmC analysis. In a paper published in the Journal for Immunotherapy of Cancer, researchers describe the use of genome-wide 5hmC profiles to identify likely responders to treatment with immune checkpoint inhibitors (ICIs). The team analyzed over 150 blood samples collected longitudinally from 31 participants with non-small cell lung cancer. The results indicated that the ICI induced specific changes in plasma 5hmC profiles, with responders exhibiting markedly different changes compared to non-responders. In responders, 5hmC levels were increased near genes associated with immune activation, consistent with previously published tissue-based ICI response signatures. Notably, 5hmC-based molecular response signatures could be detected after just one treatment cycle, highlighting their potential as early biomarkers to predict which participants are most likely to benefit from treatment very early in future studies.
In a separate study published in Blood Cancer Discovery, Nakauchi and colleagues reported that 5hmC profiles are highly cell-type specific, with distinct signatures linked to key hematopoietic regulators and unique patterns associated with TET2-mutant phenotypes, modeling preleukemic states and highlighting the critical role of 5hmC in normal hematopoiesis and early disease biology.
Similarly, researchers at the University of Chicago mapped 5hmC across 19 human tissues, showing enrichment on gene bodies and enhancers of actively transcribed genes and tissue-specific regulatory patterns. Together, these studies establish 5hmC as a biologically informative epigenetic marker with relevance to both normal developmental processes and disease states, and as a promising candidate for biomarker development.
Lastly, a study from the University of California, San Francisco, mapped the 5hmC landscape across prostate cancer, revealing how this epigenetic marker tracks tumor progression and therapy resistance. Researchers found that 5hmC patterns highlight key androgen receptor regulatory regions and transcriptional programs, capturing shifts linked to advanced, treatment-resistant disease. Importantly, 5hmC profiling in blood samples could identify aggressive tumor subtypes — including those with high activity of drivers like TOP2A and EZH2 — suggesting potential utility for noninvasive biomarkers to predict treatment response and disease progression beyond what is possible with genomic mutation analysis alone.
Genome-wide analysis of 5hmC signals has already proven successful for identifying cancer activity and cellular subtypes, characterizing core biological processes associated with the onset and progression of cancer, and mapping immune cells based on transcription levels and chromatin accessibility. This advanced form of methylation analysis has the potential to transform drug discovery and development by providing deeper, more actionable insights from clinical trials.












