search instagram arrow-down

Ana Gainaru

Computer Scientist at Oak Ridge National Laboratory

Data management strategies for large scale workflows coupling simulations to AI

Thursday 19 February 2026

Abstract

Computer Scientist (co-lead of the Data Understanding thrust in SciDAC RAPIDS and lead of the Self-improving AI models thrust in The Transformational AI Models Consortium (ModCon))

The next generation of HPC application is represented by hybrid approaches that weave together traditional simulations and modern AI. However, a critical bottleneck in integrating HPC with AI is the “lack of awareness” between workflow components. The outputs of HPC applications are often analyzed only sparingly before archival, effectively becoming inaccessible for future training codes due to the manual, time-consuming processes of finding, and processing datasets for each analysis purpose, frequently outweighing the cost of re-running simulations. This fragmentation results in complex, brittle workflows where data management is treated as an afterthought. In this presentation, we propose a unified framework for managing the complex lifecycle of data in hybrid AI-HPC systems. We will address the limitations of current domain-specific solutions by introducing abstractions that map the relationships between raw simulation outputs, processed training sets, and surrogate model inference. By creating a system where data provenance and transformation history are persistent, we enable workflows that “learn” from previous executions. Attendees will learn how to design workflows that minimize redundant processing, facilitate cross-domain optimization transfer, and ensure that the massive datasets required for AI training remain accessible, structured, and reusable.

Bio

Dr. Ana Gainaru is a Research Scientist at Oak Ridge National Laboratory’s Computer Science and Mathematics Division. Her research addresses performance optimization, data management, and the advancement of AI-driven workflows for large-scale scientific applications. She currently leads research thrusts for the RAPIDS SciDAC project and the Transformational AI Models Consortium in the Genesis project. Additionally, she serves on the leadership team for the UT-ORII initiative, where she sits on the ORNL hiring board and mentors research focused on the development of surrogate models for fusion.

MW26 Slides

MW26 Videos

MW26 Q & A