I am a Research Scientist at Google Research. My long-term research goal is to architect Native Omni Models as the cognitive engine for ASI. I focus on building multimodal foundation models to create unified generalist agents capable of mastering complex digital and physical environments.
I am currently exploring the intersection of World Models, Efficient AI, and System 2 Reasoning, specifically:
I am open to discuss the future of Multimodal LLMs, Multimodal Generative AI, World Models, and Agentic Foundation Models. I welcome collaborations across industry and academia. Authorized to work in US and Australia.
To accelerate the advent of machine intelligence and sustainable future
PhD in Computer Science
University of Technology Sydney
Master of Engineering
Shanghai Jiao Tong University
Visiting Student
Karlsruher Institut für Technologie
Bachelor of Engineering
Shanghai Jiao Tong University