Morning Overview

New method exposes Epstein-Barr virus in blood with routine DNA tests

Researchers have found a way to detect persistent Epstein-Barr virus directly from routine blood-based genome sequencing, bypassing the antibody tests that have defined EBV diagnostics for decades. The work, published in Nature, analyzed hundreds of thousands of whole-genome sequences from two of the largest biobank programs in the world and identified EBV DNA fragments hiding among human genetic data. Because more than 90 percent of adults carry EBV for life, and the virus is tied to cancers, autoimmune conditions, and neurological disease, the ability to measure its persistence at population scale could reshape how clinicians assess long-term health risks.

Mining Routine Genome Data for a Hidden Virus

Standard EBV testing has long depended on antibody-based blood panels that target proteins such as viral capsid antigen (VCA), early antigen (EA), and Epstein-Barr nuclear antigen (EBNA), as described by the CDC laboratory guidance. The agency also notes that the Monospot test is not recommended for general diagnostic use. These serological tools can confirm whether someone was infected at some point, but they do a poor job of measuring how much virus persists in the body years or decades later. That gap matters because growing evidence links sustained EBV activity to diseases ranging from nasopharyngeal carcinoma to multiple sclerosis.

The new approach sidesteps serology entirely. Instead of looking for the immune system’s response to EBV, researchers extracted EBV DNA read-pairs that were incidentally captured during standard whole-genome sequencing of human blood samples. According to the primary Nature analysis, the team mapped EBV read-pairs from 486,315 participants in the UK Biobank and 336,123 participants in the All of Us Research Program. The detection of these viral fragments in sequencing data that was originally collected for human genetics research, not virology, is what makes the method so distinctive. It repurposes existing infrastructure rather than requiring new sample collection or specialized viral assays.

Two Biobanks, One Viral Signal

The scale of the datasets is central to the study’s significance. A closely related Nature paper used UK Biobank data from approximately 490,560 participants alongside approximately 245,394 whole-genome sequences plus linked health records from All of Us. The slight difference in reported UK Biobank participant counts between the two papers (486,315 versus approximately 490,560) likely reflects different filtering criteria or data freezes, but both figures describe the same underlying cohort. The All of Us genomics release itself encompasses roughly 245,388 clinical-grade whole-genome sequences, complete with data harmonization, quality control, and diversity metrics that make the program one of the most demographically varied genomic resources available. Researchers access these data through the program’s Researcher Workbench, which was also used to derive EBV read counts from participants’ blood-derived DNA.

By scanning these massive libraries for EBV-specific sequences, the investigators treated viral read detection as a surrogate for increased EBV persistence, according to the Nature study. To reduce false signals, the related paper describes masking repetitive EBV regions so that only high-confidence viral reads contributed to the analysis. This computational filtering step is important because the EBV genome contains stretches of repetitive DNA that could produce spurious matches against human sequences. The result is a quantitative estimate of circulating EBV burden drawn from data that already exists in biobank repositories worldwide, a resource that had been sitting untapped for viral surveillance and now offers a new lens on chronic infection.

Why Conventional Screening Falls Short

The limitations of antibody-only testing become clearer when measured against what DNA-based methods have already achieved in cancer screening. A large prospective trial enrolling approximately 20,174 adults in a high-incidence region demonstrated the clinical utility of measuring circulating cell-free EBV DNA in plasma to screen asymptomatic individuals for nasopharyngeal carcinoma. That study reported sensitivity and specificity metrics for its EBV DNA screening protocol and used a repeat testing algorithm with confirmatory endoscopy to reduce false positives. It showed that a blood-based DNA test could catch early-stage cancer in people who had no symptoms, something antibody panels alone could not reliably do.

An independent evaluation published in the Journal of Clinical Oncology directly compared EBV antibody-based screening against EBV DNA-based algorithms for nasopharyngeal carcinoma detection, including operational feasibility modeling. Some of the DNA-based strategies used a two-stage approach: real-time PCR as an initial screen followed by next-generation sequencing triage to confirm positive results. This layered workflow reduced unnecessary follow-up procedures while maintaining diagnostic accuracy. The new biobank-derived method extends this logic from targeted cancer screening to broad population-level viral monitoring, though it has not yet been validated for individual clinical decisions or for use as a stand-alone diagnostic in routine practice.

From Viral Load to Disease Risk

EBV is an endemic herpesvirus implicated in autoimmunity, cancer, and neurological disorders, and a recent review in PubMed highlights how viral DNA persistence in blood may serve as a marker of long-term risk. The ability to quantify its persistence at scale opens a path toward understanding which host genetic factors determine whether the virus stays quiet or drives chronic disease. Researchers at Baylor College of Medicine described the use of existing whole-genome sequencing datasets to pinpoint 22 human genes associated with increased rates of chronic disease following common viral infections, including EBV. Their work suggests that subtle differences in immune regulation, cell-cycle control, and DNA repair can tilt the balance between benign latency and harmful viral persistence.

The new Nature study builds on this concept by embedding EBV DNA measurements directly into large-scale human genetic analyses. Instead of treating infection status as a simple yes-or-no variable, the researchers model EBV read depth as a quantitative trait that can be correlated with host genotypes, clinical diagnoses, and environmental exposures. This framework could help disentangle why some EBV-positive individuals remain healthy while others develop conditions such as lymphoma or multiple sclerosis. It also offers a template for studying other latent viruses that integrate or persist in blood cells, turning population genomics into a powerful tool for mapping the long arc of infection-related disease.

Translating Population Signals Into Practice

Although the biobank-based approach is not ready for front-line diagnostics, it points toward practical ways that clinical laboratories and public health agencies might adapt. One avenue is to incorporate low-cost viral read detection into future whole-genome sequencing pipelines used for rare disease workups or cancer risk assessment, effectively adding a viral “side panel” to every genome. Another is to use aggregated EBV read data for surveillance, tracking how viral persistence patterns shift across age groups, regions, or immunosuppressed populations. Because the method relies on data already being generated for other purposes, its marginal cost could be relatively low once standardized workflows are in place.

Translating these insights into routine care will require careful validation and clear guidance. Regulatory bodies and professional societies will need to define when EBV DNA detected incidentally in a genome sequence should trigger follow-up testing, and which confirmatory assays (such as quantitative PCR or targeted plasma sequencing) are appropriate. The Nature authors emphasize that their EBV read counts are not equivalent to clinical viral load measurements. More work is needed to calibrate thresholds that predict meaningful risk. Access to the full methodological details may require authentication through the Nature sign-in portal, which hosts the supplementary protocols and code used for read extraction and filtering.

Informing Patients and the Public

As EBV DNA testing moves from specialized oncology trials into broader genomic workflows, communication with patients will be crucial. Many people are unaware that EBV infection is nearly universal, or that most infections never lead to serious illness. Educational materials from public health agencies can help contextualize new findings, explaining that the presence of EBV DNA in a sequencing report does not automatically mean a person has cancer or will develop it. The CDC media library already aggregates technical resources on EBV laboratory testing that could be adapted into patient-facing explanations as genomic approaches evolve.

There is also an opportunity to use subscription-based outreach tools to keep clinicians and researchers updated as evidence accumulates. Services like the CDC featured updates can distribute new guidelines, assay performance data, and risk communication templates as they are developed. In parallel, laboratory professionals will need training on how to interpret incidental viral findings from genome sequencing, including when to reassure, when to repeat testing, and when to refer patients for specialist evaluation. By combining large-scale genomic surveillance, rigorous clinical validation, and clear public messaging, the emerging ability to detect persistent EBV from routine sequencing could shift from a research curiosity to a practical tool for managing long-term health risks.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.