Talk Details

Fall 2025

Lei Li

Lei Li

Associate Professor at CMU

2025-10-03 11:00 AM CT

Generative AI for Functional Protein Design

Generative AI is not only revolutionizing language processing but is also transforming scientific fields such as biology and chemistry, as exemplified by the 2024 Nobel Prize in Chemistry. In this talk, we explore three paradigms for functional protein design: sequence-based, geometry-based, and sequence/structure co-design. We introduce recent advanced generative AI models following these paradigms—IsEMPro, SurfPro, EnzyGen, and PPDiff—along with their sample-efficient learning methodologies. Additionally, we highlight four impactful applications: designing enzymes with biocatalytic functions, optimizing proteins for specific fitness goals, developing protein-protein binders, and creating antimicrobial peptides. We will also introduce InstructPro, a large protein model that follows instructions in human language and generates proteins binding to ligands. By emphasizing the powerful synergies between AI and molecular sciences, this talk seeks to inspire collaboration and drive advancements at the forefront of drug discovery, material innovation, and sustainable chemical manufacturing.

Jean Kossaifi

Jean Kossaifi

Senior Research Scientist at NVIDIA

2025-10-10 11:00 AM CT

Neural Operators for Scientific Applications: Learning on Function Spaces

Applying AI to scientific problems such as weather forecasting and aerodynamics is an active research area, promising to help speed up scientific discovery and improved engineering design. Typically, these applications involve modeling complex spatiotemporal processes governed by partial differential equations (PDEs) defined on continuous domains and at multiple scales, essentially learning mappings between infinite-dimensional function spaces. Traditional deep learning methods, however, map between finite-dimensional vector spaces. Neural operators overcome this limitation by generalizing deep learning to learn mappings directly between function spaces, enabling them to effectively replace traditional PDE solvers while offering substantial speed improvements, often several orders of magnitude faster. In this talk, I will introduce the fundamental concepts behind neural operators, illustrate their effectiveness on practical problems such as weather forecasting, and briefly discuss how computational efficiency can be further enhanced using tensor algebraic methods. Finally, I will touch on practical implementation aspects in Python, demonstrating how these concepts can be applied in practice using open-source software.

Yihang Wang

Yihang Wang

Assistant Professor at Case Western Reserve University

2025-10-17 11:00 AM CT

Generative AI for Biomolecular Discovery: Overcoming Sampling Challenges in Molecular Simulations

Molecular dynamics (MD) simulations are emerging as essential tools for understanding biomolecular systems. However, challenges such as the rare event problem remain. I will showcase how generative AI methods can be carefully adapted to address this challenge. I will introduce how diffusion models, trained to generate molecular ensembles, improve molecular configuration sampling. Furthermore, by incorporating external physical or chemical data, these generative models can predict unseen or novel molecular structures, thus expanding the predictive capacity of MD beyond traditional simulation limits.

Jiachen Yao

Jiachen Yao

Ph.D. student at Caltech

2025-10-17 11:30 AM CT

Enhancing Diffusion-based Inverse Solver by Regularization

Inverse problems are fundamental in scientific computing, yet they pose significant challenges due to high dimensionality, ill-posedness, and uncertainty. Conventional approaches like optimization or Markov Chain Monte Carlo often encounter limitations in computational scalability and robustness to noise. Generative models offer promising alternatives for solving partial differential equation (PDE) inverse problems, but they often lack efficiency, stability, or physics consistency. In this talk, the speaker, Jiachen, explores how incorporating specialized biases and regularizations can significantly enhance the accuracy and efficiency of these models in scientific applications. Jiachen will present FunDPS, a novel framework that leverages the function-space inductive bias to deliver substantial gains in accuracy and speed. Complementing this, he introduces an equivariance regularization that exploits the problems’ intrinsic symmetries, yielding consistent improvements across diverse inverse problems. The discussion will explore both theoretical and practical sides of GenAI for sciences.

Rose Yu

Rose Yu

Associate Professor at UCSD

2025-10-24 11:00 AM CT

Grounding foundation models for physical sciences

Despite the huge success of foundation models across fields, they still suffer from hallucinations and can produce physically inconsistent outputs. To leverage foundation models for physical sciences, it is critical to integrate first principles and physical laws to the learning and reasoning of these models. In this talk, I will discuss our on-going effort to ground foundation models, including diffusion models and large language models for physical sciences. In particular, I will discuss dynamics-informed diffusion models for emulating complex fluids and an adaptive framework for LLM agents to use scientific tools. I will demonstrate the use cases of our methods on applications in climate science and epidemiology modeling.

Doug Downey

Doug Downey

Senior Director at Allen Institute for AI

2025-10-31 11:00 AM CT

Asta: Building and Evaluating Scientific AI Assistants

AI has tremendous potential for accelerating science---helping researchers access the literature, analyze data, execute experiments, and more. Asta, a project at the Allen Institute for AI, aims to deliver comprehensive scientific assistants that support all aspects of a scientific workflow. Rigorously evaluating such systems is a fundamental challenge, which led us to develop AstaBench, a holistic benchmark suite and leaderboard spanning the scientific discovery process. The work includes evaluations for scientific “deep research” systems, introducing methods for measuring coverage and precision of both answers and their attributions. Fully realizing AI's potential for scientists will require breakthroughs in scalable supervision, robust evaluation, and personalized and proactive assistance. To facilitate progress toward these goals, the Asta project provides open-source corpora, tools, and APIs.

Peichen Zhong

Peichen Zhong

Assistant Professor at NUS

2025-11-06 11:00 AM CT

Foundation potentials and generative modeling for materials sciences

Materials modeling with atomistic simulation has become an indispensable tool in computational materials science, enabling precise property predictions and mechanistic insights across a wide range of chemical and structural environments. While recent advancements in artificial intelligence (AI)-assisted techniques, such as machine learning interatomic potentials (MLIPs) trained on extensive databases (e.g., foundational potential), have significantly enhanced temporal and spatial simulation capabilities, exploring high-dimensional chemical spaces with quantum-level accuracy remains computationally demanding. Simultaneously, the emergence of generative modeling has demonstrated its potential to revolutionize computational materials and chemistry through generative approaches.

In this presentation, I will outline recent progress in universally augmenting foundational potentials with charge and electrical response prediction, referred to as Latent Ewald Summation (LES) for generalized learning schemes of atomic charges and long-range interactions. Beyond the advancements in foundational potential, I will discuss the diffusion-based deep generative model (CHGGen) that integrates inpainting generation and foundation potential optimization for crystal structure prediction. I will showcase how this model can be used to elucidate atomic configurations and Li transport properties within solid-electrolyte interphases.

Chao Huang

Chao Huang

Assistant Professor at HKU

2025-11-14 11:00 AM CT

From LLMs to Agents: Challenges and Opportunities

This talk examines the evolution from LLMs to autonomous AI agents, addressing key technical challenges including multi-step reasoning, tool integration, and multi-modal context understanding. We discuss transformative applications across different critical domains: Finance, Arts, Engineering, and Science. Through technical insights and case studies, we discuss how AI agents are reshaping traditional paradigms and creating unprecedented opportunities for human-AI collaboration.

Apostolos Filippas

Apostolos Filippas

Assistant Professor at Fordham University

2025-11-21 11:00 AM CT

Large Language Models as Simulated Economic Agents

This talk introduces a working paper on using LLMs as simulated economists. We argue that newly-developed LLMs, because of how they are trained and designed, are implicit computational models of humans—a Homo silicus. LLMs can be used like economists use Homo economicus: they can be given endowments, information, preferences, and so on, and then their behavior can be explored in scenarios via simulation. We discuss potential applications, conceptual issues, and why this approach can inform the study of humans. The talk will also introduce the Expected Parrot Domain-Specific Language (EDSL), a package that streamlines computational social science and market research using AI. EDSL enables researchers to design and run surveys and experiments with large numbers of AI agents or LLMs simultaneously, as well as perform complex data-labeling and related research tasks efficiently.

Arvind Ramanathan

Arvind Ramanathan

Computational Science Leader at Argonne National Laboratory

2025-12-12 11:00 AM CT

Towards Complex Biological Systems Design with Scalable Agentic Reasoning

Intrinsically disordered proteins (IDPs) represent challenging therapeutic targets due to their conformational heterogeneity yet play critical roles in numerous diseases. We present StructBioReasoner, a scalable multi-agent system for autonomous IDP-targeting biologics design that addresses this challenge through a tournament-based reasoning framework. Specialized agents focused on structural stability, evolutionary conservation, energetic optimization, and rational design principles compete to generate and refine engineering hypotheses, enabling parallel evaluation and natural distribution of computational work. Each agent integrates domain knowledge with computational tools including AlphaFold-based structure prediction, molecular dynamics simulation, and physics-based stability prediction, autonomously reasoning about tool selection and coordinating execution on HPC infrastructure. We demonstrate three key contributions: a multi-agent architecture specifically designed for IDP ensemble properties, a scalable tournament framework for efficient parallel hypothesis generation, and validation through case studies showing autonomous identification of stabilizing mutations matching literature-validated strategies. This work establishes a foundation for autonomous discovery of IDP-targeting therapeutics on emerging exascale platforms.