Weize Li (李维泽)

I am currently spending my gap year as a research intern student at Institute for AI Industry Research (AIR), Tsinghua University. I am advised by Prof. Hao Zhao from DISCOVER Lab. Previously, I was a visiting student at Institute of Automation, Chinese Academy of Sciences in my senior year. And I obtained the Bachelor degree in Mechatronics Engineering from Beijing University of Civil Engineering and Architecture, advised by Prof. Miao Yu.

I am a active member of AnySyn3D, a non-profit research interest group comprising individuals with a strong interest in exploring research problems and cutting-edge technologies in any topics of 3D.

I am actively looking for the PhD opportunity in Fall 2025.

Email  /  CV  /  Google Scholar  /  GitHub  /  Twitter

profile photo
News

  • 2023-09:🎉PAD was accepted to NeurIPS 2023 as Poster!
  • 2023-08:😎Welcome to awesome-nerf-editing, find resource for exploring 3D editing!
  • 2023-06:👨‍🎓PKU CoRe Lab Visiting! Many thanks to Prof. Yixin Zhu's advices on my future study.
  • 2023-02:🎉IRFLMDNN was accepted to Neural Computing & Applications!
  • 2022-10:🎉My graduation dissertation was selected as 2022 Beijing Outstanding Undergraduate Dissertation Award (top1%)!
  • 2022-10:🎉I successfully defended Summer Research: McADTR. Thanks to my advisors and Prof. Li Yi's insightful comments.
  • 2022-08:😮‍💨I finished my visiting at CASIA and it was a pleasure to work with Lei, Yu, Xiaomeng and Xiaomin!
  • Research

    My long-term research goal is to build embodied agents that can effectively understand 3D worlds based on vision and be able to combine visual concepts and external knowledge for interpretable reasoning. Further, being able to apply the learned knowledge from reconstruction to generation and editing the digital world, towards the ultimate goal from understanding to changing the world. So I am up for anything related research and currently focusing on:

  • 👀Vision: 3D scene understanding with multi-modal; Visual-grounded reasoning; Anomaly Detection.
  • 🪅Graphics: Scene representations (NeRFs, 3DGS); 3D scene/object reconstruction, generation and editing.
  • 🤖Robotics: Robot manipulation; Action planning; Autonomous driving.
  • (: corresponding author; *: equal contribution)
    Publications
    dise TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes.
    Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, et al., Xiaoxiao Long, Yilun Chen, Hao Zhao.
    Under Review, 2024
    [arXiv] [Code]
    [Special Ack. to Dave Zhenyu Chen@TUM for valuable proofreading and insightful suggestions.]

    We introduce the new task of outdoor 3D dense captioning with TOD3Cap dataset; We propose TOD3Cap network, leveraging the BEV representation to encode sparse outdoor scenes, and combine Relation Q-Former with LLaMA-Adapter to dense captioning in the open-world.

    dise PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
    Qiang Zhou*, Weize Li*, Lihan Jiang, Guoliang Wang, Guyue Zhou, Shanghang Zhang, Hao Zhao.
    NeurIPS 2023 Dataset and Benchmark Track (Poster)
    [arXiv] [Code] [Dataset]

    We introduce pose-agnostic setting to 3D-aware object anomaly detection problem with MAD dataset, propose the NeRF-based anomaly detection framework: OmniposeAD.

    dise IRFLMDNN: Hybrid Model for PMU Data Anomaly Detection and Re-filling with Improved Random Forest and Levenberg Marquardt Algorithm Optimized Dynamic Neural Network.
    Miao Yu, Chenyu Yang*, Weize Li*, Weijie Du, Jinglin Li.
    Neural Computing and Application 2023
    [paper]

    We propose a hybrid model of improved RF + DNN to extract time-series features and classify anomalies in power system data.

    Patents

  • Power low frequency oscillation data anomaly monitoring system v1.0[s], CN Software Patent No.2022SR0277090
  • Power low frequency oscillation data acquisition system v1.0[s], CN Software Patent No.2022SR0281546
  • Selected Projects
    dise Advances in Radiance Fields: A Survey on 3D Editing
    [On-going] Free research with Tianshu Kuai, Huan-ang Gao, etc. 2024
    [Project Page]

    This is an ongoing survey paper and we are conducting a systematic classification and in-depth analysis for 3D content editing based on radiance field representations (including NeRFs, 3DGS, etc.).

    dise Zero1-to-Car: Finetuning Zero1-to-3 for Single Vehicle Image to 3D.
    AIR Research Project. 2023
    [Coming soon]

    Investigated one‑image‑to‑3D methods like Zero1‑to‑3 to enhance the quality of multi‑view car instances in simulation scenes, with a focus on fine‑tuning models using the ”Car” labeled 3D assets from the Objaverse dataset.

    dise McADTR: Multi-class Anomaly Detection TRansformer with Heterogenous Knowledge Distillation.
    AIR Summer Research 2022
    [Code]

    This project proposes a unified framework for visual anomaly detection based on heterogeneous knowledge distillation. The One-model-all class framework merges CNN and ViT with class-specific learnable query to enable mutually facilitated learning of anomalous features across multi-class samples.

    Service

    I served / was delegated as Reviewer for NeurIPS’23, CVPR’24.

    Awards

  • Outstanding Undergraduate Dissertation Award, Beijing Education Commission (top 1% in 130,000 students, 2022)
  • Silver Award, Beijing Challenge Cup: Entrepreneurial Plan Competition in AI System Track (Rank.2, 2022)
  • Misc.

    Outside of research, I enjoy playing football⚽, fitness💪 and photography📷. I am a member of the Certified Referee🗣️ Crew of the Chinese Football Association. I am also a big fan of Taylor Swift🦋.


    Website template from Jon Barron.


    © Weize Li | Last update: April.30, 2024