We've just announced IQ AI.
Haotian Tang is a computer scientist specializing in systems and machine learning (SysML). His work focuses on efficient deep learning, particularly for 3D perception and large-scale foundation models. His career includes academic research at the Massachusetts Institute of Technology (MIT) and industry positions at companies including Waymo, NVIDIA, Google DeepMind, and Meta Superintelligence Labs. [1] [2]
Tang attended Shanghai Jiao Tong University (SJTU), where he graduated in 2020 with a Bachelor of Engineering degree in Computer Science and Technology. Following his graduation from SJTU, Tang enrolled at the Massachusetts Institute of Technology (MIT). He earned a Master of Science in Electrical Engineering and Computer Science in 2022 and is a Ph.D. candidate in the same department, with an expected graduation in 2025. At MIT, he is a member of the Han Lab, advised by Professor Song Han. [1] [3]
Tang began his career with a software engineering internship at Agora.io in 2017. While studying at Shanghai Jiao Tong University in 2019, he undertook a research internship at Tencent, where he worked on computer vision and machine learning, and also served as a research assistant in the university's Department of Computer Science. From 2019 to 2020, he worked as a remote research intern under the guidance of MIT Professor Song Han, focusing on efficient 3D deep learning. [1]
In 2020, Tang commenced his Ph.D. studies at MIT, where his research centered on systems and machine learning. His work during this period led to multiple publications on topics such as 3D neural networks, hardware efficiency for sparse data, and multi-sensor fusion for autonomous systems. Alongside his academic research, Tang completed several industry internships. In 2022, he interned at OmniML, which was later acquired by NVIDIA. In 2023, he was a research intern at Waymo, where he worked on multimodal behavior prediction. He followed this with an internship at NVIDIA in 2024, where his work focused on developing efficient visual generation models. [1]
In early 2025, Tang joined Google DeepMind as a research scientist, contributing to large-scale pretraining for world simulation projects. Later that year, he moved to Meta as a research scientist on the Superintelligence team, where he worked on multimodal foundation models. [1]
Tang's research addresses efficiency and performance challenges in deep learning systems. His work spans the co-design of algorithms and systems for large language models (LLMs), 3D point cloud processing for autonomous driving, and multi-sensor fusion. [1]
A significant portion of Tang's research has been dedicated to making large language models more efficient for both inference and fine-tuning.
Tang has also worked extensively on optimizing deep learning models for sparse and irregular 3D point cloud data, which is crucial for applications such as autonomous driving and augmented reality.