Zhuo Xu

I'm a research scientist at Google DeepMind, where I work on robotics and foundation models. I received my Ph.D. from UC Berkeley, advised by Prof. Masayoshi Tomizuka, and B.E. from Tsinghua University.

Email     Google Scholar     LinkedIn     Twitter

Selected Publications

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Hao-Tien Lewis Chiang*, Zhuo Xu*, Zipeng Fu*, Mithun George Jacob, Tingnan Zhang, Tsang-Wei Edward Lee, Wenhao Yu, Connor Schenck, David Rendleman, Dhruv Shah, Fei Xia, Jasmine Hsu, Jonathan Hoech, Pete Florence, Sean Kirmani, Sumeet Singh, Vikas Sindhwani, Carolina Parada*, Chelsea Finn*, Peng Xu*, Sergey Levine*, Jie Tan*
Conference on Robot Learning (CoRL) 2024

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Soroush Nasiriany, Fei Xia, Wenhao Yu, Ted Xiao, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee, Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter
International Conference on Machine Learning (ICML) 2024

Spatial VLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities

Boyuan Chen*, Zhuo Xu*, Sean Kirmani, Brian Ichter, Danny Driess, Pete Florence, Dorsa Sadigh, Leonidas Guibas, Fei Xia
Conference on Computer Vision and Pattern Recognition (CVPR) 2024

Multi-Agent Trajectory Generation with Diverse Contexts

Zhuo Xu, Rui Zhou, Yida Yin, Huidong Gao, Masayoshi Tomizuka, Jiachen Li
International Conference on Robotics and Automation (ICRA) 2024


Open x-embodiment: Robotic learning datasets and rt-x models

Open X-Embodiment Collaboration led by Google DeepMind
International Conference on Robotics and Automation (ICRA) 2024
★ Best Paper Award ★

Distributed Multi-agent Interaction Generation with Imagined Potential Games

Lingfeng Sun, Pin-Yun Hung, Changhao Wang, Masayoshi Tomizuka, Zhuo Xu
Amerizan Control Conference (ACC) 2024


Generative Expressive Robot Behaviors using Large Language Models

Karthik Mahadevan, Jonathan Chien, Noah Brown, Zhuo Xu, Carolina Parada, Fei Xia, Andy Zeng, Leila Takayama, Dorsa Sadigh
International Conference on Human Robot Interaction (HRI) 2024
★ Best Paper Award ★

Rt-trajectory: Robotic task generalization via hindsight trajectory sketches

Jiayuan Gu, Sean Kirmani, Paul Wohlhart, Yao Lu, Montserrat Gonzalez Arenas, Kanishka Rao, Wenhao Yu, Chuyuan Fu, Keerthana Gopalakrishnan, Zhuo Xu, Priya Sundaresan, Peng Xu, Hao Su, Karol Hausman, Chelsea Finn, Quan Vuong, Ted Xiao
International Conference on Learning Representations (ICLR) 2024