CS Colloquium Speaker
Speaker: Yilun Du, Massachusetts Institute of Technology
Date: Thursday, April 4
Time: 12:30pm EST
Location: CS 105
Host: Felix Heide
Event page: https://www.cs.princeton.edu/events/26595
Register for live-stream online here: https://princeton.zoom.us/webinar/register/WN_6ikSJRvFQb-ywv0jmzKgxA

Title: Generalizing Beyond the Training Distribution through Compositional Generation

Abstract: Generative AI has led to stunning successes in recent years but is fundamentally limited by the amount of data available.  This is especially limiting in the embodied setting – where an agent must solve new tasks in new environments. In this talk, I’ll introduce the idea of compositional generative modeling, which enables generalization beyond the training data by building complex generative models from smaller constituents. I’ll first introduce the idea of energy-based models and illustrate how they enable compositional generative modeling. I’ll then illustrate how such compositional models enable us to synthesize complex plans for unseen tasks at inference time. Finally, I'll show how such compositionality can be applied to multiple foundation models trained on various forms of Internet data, enabling us to construct decision-making systems that can hierarchically plan and solve long-horizon problems in a zero-shot manner.

Bio: Yilun Du is final year PhD student at MIT CSAIL advised by Leslie Kaelbling, Tomas Lozano-Perez and Joshua Tenenbaum. His research spans the fields of machine learning and robotics, with a focus on generative models.  He is supported by the NSFGraduate Research Fellowship and was previously a research fellow at OpenAI, a visiting researcher at FAIR and a student researcher at Google Deepmind.