Letian "Zac" Chen's Homepage

EDUCATION

Doctor of Philosophy: Computer Science
Georgia Institute of Technology

GRADUATED IN MAY 2025

Master of Science: Computer Science
Georgia Institute of Technology

GRADUATED IN MAY 2020

Bachelor of Science: Computer Science and Technology
Peking University

GRADUATED IN JUN 2018

Bachelor of Science: Psychology
Peking University

GRADUATED IN JUN 2018

WORK

Research Scientist
Waymo

JUNE 2025 - Now

Research Intern
Waymo

Designed input-output representations for fine-tuning large Vision-Language-Models (VLMs) for the vehicle planning task.
Proposed a novel Reinforcement Learning algorithm for fine-tuning VLMs towards planning metrics to replace the usual token matching objectives in LM training.
Developed training and evaluation pipeline infrastructure of the large VLM. Experiments show significant improvements in target behavior metrics via the proposed method.

MAY 2024 - DEC 2024

Research Intern
Toyota Research Institute

Implemented DIAYN to generate diverse driving policies for autonomous racing.
Proposed a novel algorithm, Learn Thy Enemy, to model and leverage opponent information in multi-car racing.
Deployed DIAYN and LTE policies on motion simulator hardware and demonstrated qualitatively and quantitatively high performance.

MAY 2023 - AUG 2023

Reinforcement Learning Intern
iRobot Corporation

Identified real-world challenges of Offline Policy Evaluation (OPE) methods.
Created a ease-to-use benchmark dataset where real-world challenges present.
Proposed an ad-hoc OPE algorithm selection method via validation mechanisms.

MAY 2021 - AUG 2021

RESEARCH

Safe Learning form Demonstration

Created a new modality for users to specify safe vs. unsafe states for robots via demonstrations.
Proposed a novel shielding algorithm, SECURE, that can be applied on policies to enforce customized safety bounds (defined by users), via a combination of data-driven control-barrier function and task-aware safe action search.
Tested SECURE on two simulated robotic control tasks and a real robot kitchen cutting task where the robot is equipped with a knife; showed SECURE successfully prevent all unsafe executions, such as human-hand entering robot cutting space.

2022-2023

Paper

Learning Interpretable Tree-based Control Policies for Autonomous Driving

Developed interpretable, tree-based continuous-control models that allow gradient updates.
Demonstrated the strong qualitative and quantitative performance of the proposed model in comparison with black-box neural networks in 10+ driving scenarios.
Verified interpretability with user-studies to show the proposed model is easier and faster to interpret than neural networks and other interpretable models.

2022-2023

Paper

Learning from Offline Heterogeneous Demonstrations

Analyzed real Mars rover driving data and identified heterogeneity among rover drivers.
Proposed a novel IRL framework, DROID, to accommodate the offline learning required by the application while allowing learning from heterogeneous demonstrations via dual reward and policy distillation.
Applied DROID on two simulated robotic control tasks and the real Mars rover path-planning problem; achieved better learning and generalization to unseen conditions in all three domains.

2022-2023

Paper

Fast Lifelong Adaptive Learning from Demonstrations

Analyzed the personalization problem in lifelong learning from demonstration process where large number of heterogeneous demonstrations arrive sequentially by federation among users.
Proposed a novel IRL framework, FLAIR, to provide efficient personalization and scalability by constructing policy mixtures with a concise set of prototypical strategy policies.
Applied FLAIR on three virtual robotic control tasks and a real robot table-tennis task; achieved better personalization with significantly higher sample efficiency.

2021

Paper

Learning from Suboptimal Demonstration via Self-Supervised Reward Regression

Characterized policy performance degradation from noise injection by a sigmoid function.
Proposed a novel IRL framework, SSRR, to learn policies that are better than suboptimal demonstrations by inferring the idealized reward function (i.e., the latent intent of the demonstrator).
Applied algorithm on three virtual robotic tasks and a real robot table-tennis task; achieved accurate recovery of the demonstrator intention and a better-than-best-demonstration policy.

2020

Paper

Learning from Heterogeneous Demonstrations

Modeled humans' latent objective via shared task reward and individual strategy reward.
Proposed a novel IRL framework, MSRD, to jointly infer task reward and strategy reward to gain a better estimation of both.
Applied algorithm on two virtual robot control tasks and one real robot table-tennis task; achieved better learning of task reward than SOTA AIRL, extracted precise strategic rewards, and optimized versatile policies that resemble the heterogeneous demonstrations.

2020

Download Thesis

AWARDS

Amazon Science Scholarship for AAAI 2022

Best paper finalist in Conference on Robot Learning (CoRL 2020)

First place in Brainhack ATL 2019 Track 2

Graduate of merit in Beijing

Excellent Graduate in Peking University

Zhang Wenjin Scholarship link

Scholarship for Undergraduate Research

First Prize of National Olympiad in Informatics in Provinces Advanced Group

Letian (Zac) Chen

Research Scientist at Waymo

ABOUT

EDUCATION

WORK

RESEARCH

AWARDS

SKILLS

CONTACT