AI model flags advanced heart failure earlier using routine ultrasound data

Researchers have trained an artificial intelligence model to extract warning signs of advanced heart failure from routine echocardiogram videos, flagging disease progression years before conventional diagnostic methods catch it. The work, validated across multiple hospital systems and ultrasound equipment vendors, points to a future where a standard cardiac ultrasound exam doubles as an early-detection screen for conditions that often go undiagnosed until patients are critically ill. The approach also processes measurements in roughly one second, fast enough to fit into a busy clinical workflow without adding delays.

Spotting Heart Failure Worsening in About One Second

One of the sharpest demonstrations of this technology comes from a study in a European Society of Cardiology journal showing that AI-derived echocardiographic metrics can predict heart failure worsening using a composite clinical endpoint. The model focused on left ventricular ejection fraction (LVEF) and the E/E-prime ratio, two measurements cardiologists already use to assess how effectively the heart pumps and relaxes between beats. What set the AI apart was speed and consistency: the system produced its measurements in approximately 1,053 milliseconds and improved the C-index for both LVEF and E/E-prime compared with manual interpretation.

That level of automation matters because traditional echocardiogram reads can vary from one sonographer or cardiologist to another. Small differences in how the ventricle is traced or how Doppler signals are sampled can move a patient across a treatment threshold, influencing decisions about medications, device therapy, or referral to advanced heart failure programs. An AI system that standardizes these measurements and delivers them in about a second removes a bottleneck that has long frustrated clinics, where patients often wait days or weeks between imaging, interpretation, and follow-up decisions.

Beyond speed, the model’s performance remained stable across different ultrasound vendors and hospital systems. That kind of generalizability is crucial for any tool meant to support real-world heart failure management, where equipment, workflows, and patient populations differ widely. The study’s composite endpoint (capturing events such as hospitalization, transplant, or death) also anchors the algorithm’s predictions in outcomes that matter directly to patients and health systems.

Tracking Amyloid Cardiomyopathy Years Before Nuclear Testing

A separate line of evidence focuses on transthyretin amyloid cardiomyopathy, a form of heart failure caused by misfolded protein deposits that stiffen the heart muscle. This condition is notoriously underdiagnosed because its symptoms, such as fatigue, edema, and shortness of breath, overlap with more common types of cardiomyopathy. Definitive diagnosis typically requires nuclear scintigraphy or biopsy, tests that are expensive, not universally available, and rarely ordered early in the disease course.

In a retrospective analysis of cohorts from Yale–New Haven and Houston Methodist, investigators reported that AI-generated probabilities from both echocardiographic videos and electrocardiograms rose well before traditional testing confirmed disease. The European Heart Journal study found that these risk scores began increasing up to three years prior to nuclear amyloid imaging, suggesting that subtle changes in cardiac structure and electrical patterns are detectable long before clinicians would typically suspect amyloidosis.

The clinical implications are substantial. Patients who eventually receive a diagnosis of cardiac amyloidosis often experience years of progressive symptoms, repeated hospitalizations, and exposure to therapies that do little to address the underlying pathology. If an algorithm can raise suspicion during a routine echocardiogram or ECG, clinicians could prioritize confirmatory nuclear imaging and genetic testing for those at highest risk. Earlier identification would allow initiation of disease-modifying therapies at a stage when the myocardium is less damaged, potentially improving survival and quality of life while reducing avoidable hospital stays.

One Video Clip as a Screening Tool

Building on that concept, a multicenter, multivendor investigation showed that deep learning can detect cardiac amyloidosis from a single apical four-chamber echocardiographic clip. In this work, the model analyzed one standard view and still achieved robust discrimination between amyloid and non-amyloid hearts. The authors positioned this capability, described in an open-access amyloidosis screening study, as a way to reduce both missed early-stage disease and unnecessary downstream testing.

For high-volume echo labs, the single-clip approach is particularly attractive. Community hospitals and outpatient cardiology practices perform large numbers of echocardiograms, but most studies are ordered to evaluate common problems such as valve disease or reduced ejection fraction. An automated amyloidosis screen that runs in the background on a clip already being captured adds no extra imaging time and minimal workflow friction. The principal challenge is calibrating sensitivity and specificity so that the algorithm captures true disease without generating excessive false positives that could overwhelm nuclear imaging capacity or lead to patient anxiety.

Importantly, the multicenter design and inclusion of multiple ultrasound vendors help address concerns that a model might overfit to a single institution’s imaging style. By demonstrating performance across diverse settings, the study supports the idea that AI-based amyloid screening could be deployed broadly, not just in specialized referral centers.

From Hospital Carts to Handheld Devices

Another key question is whether models trained on hospital-grade echocardiography generalize to images acquired on smaller, portable systems. A peer-reviewed analysis showed that AI models originally developed for standard transthoracic echocardiograms can be applied to prospective handheld cardiac ultrasound videos with encouraging results. In that work, which evaluated multiple sites, the investigators reported that LVEF classification performance on handheld devices remained high, with area-under-the-curve metrics comparable to those seen on traditional machines.

This transferability is essential for scaling early heart failure detection. Handheld ultrasound units are less expensive, easier to maintain, and can be used at the bedside in primary care clinics, emergency departments, or rural health posts. If AI software running on or alongside these devices can reliably identify patients with reduced ejection fraction or other high-risk features, clinicians could initiate guideline-directed therapy or refer patients to specialists much earlier in the disease trajectory. That would broaden access well beyond academic medical centers, where most of the initial algorithm development has taken place.

Such deployment also raises practical considerations, including how non-expert users acquire adequate image quality and how AI outputs are integrated into electronic health records. The handheld study underscores that, with appropriate training and quality controls, portable imaging paired with robust algorithms can extend sophisticated cardiovascular assessment into settings that previously lacked it.

Predicting Exercise Capacity Without a Treadmill

While imaging-based tools advance, researchers are also exploring ways to infer exercise capacity from existing clinical data. A team affiliated with the Cardiovascular AI Initiative at Weill Cornell Medicine developed a model that estimates peak oxygen consumption (peak VO2) using information already present in the electronic health record. Peak VO2, typically measured during cardiopulmonary exercise testing, is the gold standard for assessing functional capacity and determining eligibility for advanced therapies such as ventricular assist devices or heart transplantation.

The treadmill-based test, however, is technically demanding, unavailable at many hospitals, and difficult for the sickest heart failure patients to complete. In a peer-reviewed analysis of EHR-derived predictions, investigators showed that AI models could approximate peak VO2 using variables like demographics, laboratory values, medications, and prior imaging results. The model’s estimates tracked observed exercise capacity closely enough to support risk stratification and to flag patients who might benefit from referral to advanced heart failure centers, even when formal exercise testing was not feasible.

This strategy complements imaging-based AI by leveraging data that clinicians already collect in routine care. It also illustrates a broader trend: combining structured clinical information with image-derived features to generate more comprehensive risk profiles. As algorithms mature, integrated models could simultaneously interpret echocardiograms, ECGs, laboratory trends, and clinical histories to provide a unified assessment of disease severity and trajectory.

Validation, Bias, and Prospective Testing

Despite promising results, experts emphasize the need for rigorous validation and careful implementation. A recent overview in a digital health journal highlighted both the opportunities and pitfalls of applying machine learning to cardiovascular imaging. The authors of this methodological review stressed the importance of external validation, transparent reporting, and continuous performance monitoring once tools are deployed in clinical practice.

Prospective trials are beginning to address these concerns. One registered investigation, listed as NCT04064450, is evaluating how AI-assisted echocardiography influences diagnostic accuracy, workflow efficiency, and treatment decisions in real-world settings. Such studies will be critical for understanding not just whether algorithms can predict outcomes, but whether acting on those predictions actually improves survival, reduces hospitalizations, or lowers costs.

Equity is another central issue. Training data often overrepresent patients from large academic centers and underrepresent those from rural areas or minority communities. Without deliberate efforts to diversify datasets and audit models for bias, AI tools could inadvertently widen existing disparities in heart failure care. Prospective studies and post-deployment surveillance will need to examine performance across demographic groups and care environments, adjusting models or thresholds where necessary.

The Road to Routine Clinical Use

Taken together, these studies outline a future in which a standard echocardiogram, a handheld scan at the bedside, or even a review of the electronic health record can reveal early signs of advanced heart failure and related conditions long before patients become critically ill. Rapid, standardized measurements of LVEF and diastolic function, early detection of amyloid cardiomyopathy from single video clips, and non-exercise estimates of peak VO2 all point toward a more proactive, data-driven approach to cardiovascular care.

Realizing that vision will require more than technical accuracy. Health systems will need to integrate AI outputs into existing workflows, train clinicians to interpret and act on algorithmic risk scores, and develop reimbursement models that reward earlier diagnosis and prevention. Regulators and professional societies will have to establish standards for validation, reporting, and ongoing oversight. With careful design and robust evidence, however, AI-enhanced echocardiography and data-driven risk prediction could shift heart failure management from reactive crisis response to earlier, more personalized intervention, potentially changing the trajectory of disease for millions of patients worldwide.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.

IG

FB

PIN

LI

X