Weize Li

TARS Robotics

AIR, Tsinghua University

Weize Li

About

I am currently a Research Assistant (RA) at Tsinghua University and a Research Engineer at TARS Robotics, working with Prof. Wenchao Ding and Prof. Yilun Chen. Previously, I was fortunate to work closely with Prof. Xiaoxiao Long from NJU, Prof. Ping Tan from HKUST, and Prof. Hao Zhao from DISCOVER Lab. I was a visiting student at the Institute of Automation, Chinese Academy of Sciences in my senior year.

My long-term research goal is to build Embodied Intelligence with Dexterous Manipulation and Flexible Mobility. It should be able to perceive and imagine the real world, then derive insights through understanding to guide the action. I am currently focusing on {Robotics, 3D Vision, Graphics} for Embodied AI:

  • 🤖 Robotics
    Manipulation; Foundation Models for Robotics; Autonomous Driving.
  • 👀 3D Vision
    3D Scene Understanding (e.g.visual grounding, dense captioning, VQA); Robot Perception; Anomaly Detection.
  • 🪅 Graphics
    3D Reconstruction and Editing for Simulation; Digital Twins; World Models.

News

2025

August

🎉 Our work 'RoboGEM' is accepted by ACM MM 2025, RoboSoft'25 workshop as an Oral Presentation!

2025

August

🏆 We have released tech report of champion solution for WBCD Challenge.

2025

May

🇺🇸 I will participate the IEEE ICRA 2025 in person, see you in Atlanta!

2024

2024

October

🇮🇹 I will participate the ECCV 2024 in person to present our work "TOD3Cap", see you in Milano!

Publications

paper image
Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation

Weize Li, Zhengxiao Han, Lixin Xu, Xiangyu Chen, Harrison Bounds, Chenrui Zhang, Yifan Xu.
Technical Report, 2025

We won the ICRA 2025 WBCD Table Service Track with a champion solution that combines VR teleoperation and Learning from Demonstrations for efficient and reliable bimanual robot manipulation.


paper image
RoboGEM: Learning Language-guided Robotic Manipulation via Generalizable and Efficient Feature Distillation

Chunzheng Wang, Yuhang Zheng, Xiangyu Chen, Weize Li, Songen Gu, Yupeng Zheng.
ACM International Conference on Multimedia (ACM MM), 2025
RoboSoft'25 Workshop - Oral - Best Paper Finalist

RoboGEM is a generalizable and efficient 3D representation that empowers robots to perform diverse manipulation tasks with speed and robustness.


paper image
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth

Bu Jin*, Weize Li*, Baihan Yang, Zhenxin Zhu, Junpeng Jiang, Huan-ang Gao, Haiyang Sun, Kun Zhan, Hengtong Hu, Xueyang Zhang, Peng Jia, Hao Zhao.
International Conference on Intelligent Robots and Systems (IROS), 2025

PosePilot is a lightweight framework that enhances camera pose controllability in generative world models, enabling precise, consistent, and adaptable viewpoint synthesis for autonomous driving and beyond.


paper image
Radiance Field-Based 3D Editing: A Survey

Weize Li*, Tianshu Kuai*, Huan-ang Gao, Xiangyue Liu, Yuhang Zheng, Yupeng Zheng, etc.
In Submission

This survey reviews recent advances, methods, datasets, and applications in Radiance Field-based 3D Editing, highlighting challenges and future directions.


paper image
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes

Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao.
European Conference on Computer Vision (ECCV), 2024 - Poster

We introduce TOD3Cap, a benchmark dataset and network for outdoor 3D dense captioning, enabling accurate object localization and rich natural language descriptions from LiDAR and panoramic images.


paper image
PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection

Qiang Zhou*, Weize Li*, Lihan Jiang, Guoliang Wang, Guyue Zhou, Shanghang Zhang, Hao Zhao.
Neural Information Processing Systems (NeurIPS), 2023 - Poster
Dataset & Benchmark Track

We introduce the MAD dataset, PAD benchmark, and OmniposeAD method to tackle pose-agnostic object anomaly detection with diverse 3D anomalies and standardized evaluation.


paper image
IRFLMDNN: Hybrid Model for PMU Data Anomaly Detection and Re-filling with Improved Random Forest and Levenberg Marquardt Algorithm Optimized Dynamic Neural Network

Miao Yu, Chenyu Yang*, Weize Li*, Weijie Du, Jinglin Li.
Neural Computing and Application, 2023

We propose IRFLMDNN, a hybrid model combining random forests and dynamic neural networks for accurate anomaly detection and adaptive data refilling in PMU time series.


Other Projects

paper image
Awesome-What-Bimanual-Can-Do

Core Contributor.

Affiliated with the ICRA 2025 What Bimanual Teleoperation and Learning from Demonstration Can Do (WBCD)


ICRA 2025 What Bimanuals Can Do (WBCD) Challenge

Weize Li, Zhengxiao Han, Lixin Xu, Xiangyu Chen, Harrison Bounds, Chenrui Zhang, Yifan Xu.
🏆The 1st Place in Table Services Track

In the Table Services track, we tackled a series of demanding tasks under strict requirements for speed, precision, and reliability: unfolding a tablecloth (deformable-object manipulation), placing a pizza into the container (pick-and-place), and opening and closing a food storage box.

Research Experience

TARS Robotics

Jan 2025 - Present · Research Engineer

Research Area: Human-centric Embodied AI; Foundation Models for Robotics; Manipulation.
Supervisors: Prof. Wenchao Ding & Prof. Yilun Chen

AIR Innovation Center, Tsinghua University

Jan 2025 - Present · Research Assistant

Research Area: Robotic Manipulation; Visual Representation for Robotics; Zero-shot 3D Reasoning.
Supervisors: Prof. Yilun Chen

LightIllusions

April 2024 - Oct 2024 · Research Intern

Research Area: Robotic Manipulation
Supervisors: Prof. Xiaoxiao Long & Prof. Ping Tan

Institute for AI Industry Research (AIR), Tsinghua University

Aug 2022 - Dec 2024 · Research Intern & Research Assistant

Research Area: 3D Scene Understanding; Visual Reasoning; Anomaly Detection
Supervisor: Prof. Hao Zhao & Prof. Shanghang Zhang

Education

Institute of Automation, Chinese Academy of Sciences

Feb 2022 - Aug 2022 · Visiting Student

Research Area: Computer Vision & Machine Learning
Final Year: Complete Bachelor Thesis and conduct a research project.

Beijing University of Civil Engineering and Architecture

Sep 2018 - Jun 2022 · Undergraduate

B.Eng. in Mechatronics Engineering
Graduated with Best Bachelor Thesis Award

Honors & Awards

[2025] Champion Award (1st place), IEEE ICRA 2025 What Bimanuals Can Do (WBCD) Challenge in Table Services Track.

[2022] Best Bachelor Thesis Award, Beijing Education Commission (top 1% in 130,000 students).

[2022] Silver Award, Beijing Challenge Cup: Entrepreneurial Plan Competition in AI System Track (Rank.2).

Academic Services

Conference Review: NeurIPS’23, CVPR’24, ICRA'24, ICLR'25.

Journal Review: IJCV.