portal-cornell.github.io

2025

Multi-Turn Code Generation Through Single-Step Rewards

Jain, Arnav Kumar, Gonzalez-Pumariega, Gonzalo, Chen, Wayne, Rush, Alexander M, Zhao, Wenting, and Choudhury, Sanjiban

2025

PDF
Process Reward Models for LLM Agents: Practical Framework and Directions

Choudhury, Sanjiban

2025

PDF
Imitation Learning from a Single Temporally Misaligned Video

Huey, William, Wang, Huaxiaoyue, Wu, Anne, Artzi, Yoav, and Choudhury, Sanjiban

2025

PDF
Robotouille: An Asynchronous Planning Benchmark for LLM Agents

Gonzalez-Pumariega, Gonzalo, Yean, Leong Su, Sunkara, Neha, and Choudhury, Sanjiban

In The Thirteenth International Conference on Learning Representations 2025

PDF
Motion Tracks: A Unified Representation for Human-Robot Transfer in Few-Shot Imitation Learning

Ren, Juntao, Sundaresan, Priya, Sadigh, Dorsa, Choudhury, Sanjiban, and Bohg, Jeannette

2025

PDF
Aligning LLMs with Domain Invariant Reward Models

Wu, David, and Choudhury, Sanjiban

2025

PDF
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

Jain, Arnav Kumar, Wiltzer, Harley, Farebrother, Jesse, Rish, Irina, Berseth, Glen, and Choudhury, Sanjiban

In The Thirteenth International Conference on Learning Representations 2025

PDF
Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback

Choudhury, Sanjiban, and Sodhi, Paloma

In The Thirteenth International Conference on Learning Representations 2025

PDF
One-Shot Imitation under Mismatched Execution

Kedia, Kushal, Dan, Prithwish, Chao, Angela, Pace, Maximus Adrian, and Choudhury, Sanjiban

2025

PDF

2024

Query-Efficient Planning with Language Models

Gonzalez-Pumariega, Gonzalo, Chen, Wayne, Kedia, Kushal, and Choudhury, Sanjiban

2024

PDF
Learning to Move Like Professional Counter-Strike Players

Durst, D., Xie, F., Sarukkai, V., Shacklett, B., Frosio, I., Tessler, C., Kim, J., Taylor, C., Bernstein, G., Choudhury, S., Hanrahan, P., and Fatahalian, K.

Computer Graphics Forum 2024

PDF
MOSAIC: A Modular System for Assistive and Interactive Cooking

Wang, Huaxiaoyue, Kedia, Kushal, Ren, Juntao, Abdullah, Rahma, Bhardwaj, Atiksh, Chao, Angela, Chen, Kelly Y, Chin, Nathaniel, Dan, Prithwish, Fan, Xinyi, Gonzalez-Pumariega, Gonzalo, Kompella, Aditya, Pace, Maximus Adrian, Sharma, Yash, Sun, Xiangwan, Sunkara, Neha, and Choudhury, Sanjiban

arXiv 2024

PDF
Hybrid Inverse Reinforcement Learning

Ren, Juntao, Swamy, Gokul, Wu, Zhiwei Steven, Bagnell, J. Andrew, and Choudhury, Sanjiban

arXiv 2024

PDF
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Zhao, Wenting, Chiu, Justin T, Hwang, Jena D, Brahman, Faeze, Hessel, Jack, Choudhury, Sanjiban, Choi, Yejin, Li, Xiang Lorraine, and Suhr, Alane

In North American Chapter of the Association for Computational Linguistics 2024

PDF
InteRACT: Transformer Models for Human Intent Prediction Conditioned on Robot Actions

Kedia, Kushal, Bhardwaj, Atiksh, Dan, Prithwish, and Choudhury, Sanjiban

In IEEE International Conference on Robotics and Automation 2024

PDF

2023

Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought

Wang, Huaxiaoyue, Gonzalez-Pumariega, Gonzalo, Sharma, Yash, and Choudhury, Sanjiban

In Advances in Neural Information Processing Systems 2023

PDF
ManiCast: Collaborative Manipulation with Cost-Aware Human Forecasting

Kedia, Kushal, Dan, Prithwish, Bhardwaj, Atiksh, and Choudhury, Sanjiban

In Conference on Robot Learning 2023

PDF
Learning Shared Safety Constraints from Multi-task Demonstrations

Kim, Konwoo, Swamy, Gokul, Liu, Zuxin, Zhao, Ding, Choudhury, Sanjiban, and Wu, Zhiwei Steven

In Advances in Neural Information Processing Systems 2023

PDF
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms

Vemula, Anirudh, Song, Yuda, Singh, Aarti, Bagnell, J. Andrew, and Choudhury, Sanjiban

In International Conference on Machine Learning 2023

PDF
Inverse Reinforcement Learning without Reinforcement Learning

Swamy, Gokul, Choudhury, Sanjiban, Bagnell, J. Andrew, and Wu, Zhiwei Steven

In International Conference on Machine Learning 2023

PDF
A Game-Theoretic Framework for Joint Forecasting and Planning

Kedia, Kushal, Dan, Prithwish, and Choudhury, Sanjiban

In IEEE/RSJ International Conference on Intelligent Robots and Systems 2023

PDF
Impossibly Good Experts and How to Follow Them

Walsman, Aaron, Zhang, Muru, Choudhury, Sanjiban, Fox, Dieter, and Farhadi, Ali

In International Conference on Learning Representations 2023

PDF
Guided Incremental Local Densification for Accelerated Sampling-based Motion Planning

Mandalika, Aditya, Scalise, Rosario, Hou, Brian, Choudhury, Sanjiban, and Srinivasa, Siddhartha S

In IEEE International Conference on Robotics and Automation 2023

PDF
Complementing a Policy with a Different Observation Space

Swamy, Gokul, Choudhury, Sanjiban, Bagnell, Drew, and Wu, Steven

2023

PDF

2022

The Blindfolded Traveler’s Problem: A Search Framework for Motion Planning with Contact Estimates

Saund, B., Choudhury, S., Srinivasa, S., and Berenson, D.

In The International Journal of Robotics Research 2022

PDF
Minimax Optimal Online Imitation Learning via Replay Estimation

Swamy, Gokul, Rajaraman, Nived, Peng, Matthew, Choudhury, Sanjiban, Bagnell, J Andrew, Wu, Zhiwei Steven, Jiao, Jiantao, and Ramchandran, Kannan

In Advances in Neural Information Processing Systems 2022

PDF
Sequence Model Imitation Learning with Unobserved Contexts

Swamy, Gokul, Choudhury, Sanjiban, Bagnell, J Andrew, and Wu, Zhiwei Steven

In Advances in Neural Information Processing Systems 2022

PDF
Causal imitation learning under temporally correlated noise

Swamy, Gokul, Choudhury, Sanjiban, Bagnell, Drew, and Wu, Steven

In International Conference on Machine Learning 2022

PDF
Towards Uniformly Superhuman Autonomy via Subdominance Minimization

Ziebart, Brian, Choudhury, Sanjiban, Yan, Xinyan, and Vernaza, Paul

In International Conference on Machine Learning 2022

PDF

2021

Leveraging experience in lazy search

Bhardwaj, Mohak, Choudhury, Sanjiban, Boots, Byron, and Srinivasa, Siddhartha

Autonomous Robots 2021

PDF
Expert Intervention Learning

Spencer, Jonathan, Choudhury, Sanjiban, Barnes, Matthew, Schmittle, Matthew, Chiang, Mung, Ramadge, Peter, and Srinivasa, Sidd

Autonomous Robots 2021

PDF
A Critique of Strictly Batch Imitation Learning

Swamy, Gokul, Choudhury, Sanjiban, Bagnell, J. Andrew, and Wu, Zhiwei Steven

arXiv preprint arXiv:2110.02063 2021

PDF
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap

Swamy, Gokul, Choudhury, Sanjiban, Bagnell, J Andrew, and Wu, Steven

In International Conference on Machine Learning 2021

PDF
Learning Online from Corrective Feedback: A Meta-Algorithm for Robotics

Schmittle, Matthew, Choudhury, Sanjiban, and Srinivasa, Siddhartha S

arXiv preprint arXiv:2104.01021 2021

PDF
Feedback in Imitation Learning: The Three Regimes of Covariate Shift

Spencer, Jonathan, Choudhury, Sanjiban, Venkatraman, Arun, Ziebart, Brian, and Bagnell, J Andrew

arXiv preprint arXiv:2102.02872 2021

PDF
Blending mpc & value function approximation for efficient reinforcement learning

Bhardwaj, Mohak, Choudhury, Sanjiban, and Boots, Byron

In International Conference on Learning Representations 2021

PDF
Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts

Lee, G., Hou, B., Choudhury, S., and Srinivasa, S.S

In IEEE/RSJ International Conference on Intelligent Robots and Systems 2021

PDF

2020

Toward fieldable human-scale mobile manipulation using RoMan

In Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications II 2020
Learning from Interventions: Human-robot interaction as both explicit and implicit feedback

Spencer, J., Choudhury, S., Barnes, M., and Srinivasa, S.

In Robotics: Science and Systems 2020

PDF
Imitation Learning as f-Divergence Minimization

Ke, L., Choudhury, S., Barnes, M., Sun, W., Lee, G., and Srinivasa, S.

In Workshop on the Algorithmic Foundations of Robotics 2020

PDF
ICS: Incremental Constrained Smoothing for State Estimation

Sodhi, P., Choudhury, S., Mangelson, J. G., and Kaess, M.

In IEEE International Conference on Robotics and Automation 2020

PDF
Posterior Sampling for Anytime Motion Planning on Graphs with Expensive-to-Evaluate Edges

Hou, B., Choudhury, S., Lee, G., Mandalika, A., and Srinivasa, S.

In IEEE International Conference on Robotics and Automation 2020

PDF