Skip to content
@hustvl

HUST Vision Lab

HUST Vision Lab of the School of EIC in HUST. Lab Lead @xinggangw

Welcome to the Vision Lab @ HUST!

🙋‍♀️ Introduction

Hello! This is the GitHub space for the Vision Lab led by Professor Xinggang Wang. We are based at the Artificial Intelligence Institute, School of Electronic Information and Communications, Huazhong University of Science and Technology (HUST).

Our research focuses on computer vision and deep learning. We are particularly interested in:

  • Multimodal Foundation Models
  • Visual Representation Learning
  • Object Detection, Segmentation, and Tracking
  • End-to-end Autonomous Driving
  • Novel Neural Architectures

Our group strives to push the boundaries of visual intelligence and has produced highly influential works in the field, including CCNet, Mask Scoring R-CNN, FairMOT, ByteTrack, EVA, MapTR, Vectorized Autonomous Driving (VAD), DiffusionDrive, Vision Mamba (Vim), 4D Gaussian Splatting (4DGS), YOLOS, YOLO-World, and LightningDiT & VA-VAE.

🌈 Contribution Guidelines & Collaboration

We actively contribute to the research community through publications and open-source projects.

  • Research Collaboration: We are open to collaborations in our areas of interest. Please feel free to reach out to Prof. Xinggang Wang (xgwang # hust.edu.cn).
  • Prospective Students: Our group has a strong track record of mentoring Ph.D. and Master's students who lead impactful publications. Interested students can find more information on Prof. Wang's faculty page.
  • Using Our Code: You are welcome to explore and use the code in our repositories. Please ensure you cite the corresponding publications appropriately. Specific details can usually be found in the README files of individual repositories.
  • Contributing to Projects: For guidelines on contributing to specific projects (e.g., bug reports, pull requests), please check the individual repositories.

👩‍💻 Useful Resources

Pinned Loading

  1. Vim Vim Public

    [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

    Python 3.5k 243

  2. LightningDiT LightningDiT Public

    [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

    Python 997 29

  3. 4DGaussians 4DGaussians Public

    [CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

    Jupyter Notebook 2.9k 259

  4. VAD VAD Public

    [ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

    Python 997 119

  5. MapTR MapTR Public

    [ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

    Python 1.3k 211

  6. SparseInst SparseInst Public

    [CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation

    Python 606 74

Repositories

Showing 10 of 105 repositories
  • GaussTR Public

    [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

    hustvl/GaussTR’s past year of commit activity
    Python 154 MIT 7 1 0 Updated Jul 10, 2025
  • MaskAdapter Public

    [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

    hustvl/MaskAdapter’s past year of commit activity
    Python 104 Apache-2.0 1 4 0 Updated Jul 7, 2025
  • Dynamic-2DGS Public

    [ACMMM 2025] Dynamic 2D Gaussians: Geometrically Accurate Radiance Fields for Dynamic Objects

    hustvl/Dynamic-2DGS’s past year of commit activity
    Python 135 Apache-2.0 5 3 0 Updated Jul 6, 2025
  • .github Public
    hustvl/.github’s past year of commit activity
    0 0 0 0 Updated Jul 4, 2025
  • GroundingSuite Public

    [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

    hustvl/GroundingSuite’s past year of commit activity
    Python 65 1 2 0 Updated Jun 26, 2025
  • DiffusionDrive Public

    [CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

    hustvl/DiffusionDrive’s past year of commit activity
    Python 848 MIT 58 10 0 Updated Jun 17, 2025
  • LightningDiT Public

    [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

    hustvl/LightningDiT’s past year of commit activity
    Python 997 MIT 29 3 0 Updated Jun 12, 2025
  • PersonViT Public

    PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification

    hustvl/PersonViT’s past year of commit activity
    Python 27 Apache-2.0 4 2 0 Updated Jun 11, 2025
  • MIM4D Public

    [IJCV 2025] MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning

    hustvl/MIM4D’s past year of commit activity
    Python 68 Apache-2.0 1 2 0 Updated May 30, 2025
  • PixelHacker Public

    PixelHacker: Image Inpainting with Structural and Semantic Consistency

    hustvl/PixelHacker’s past year of commit activity
    Python 434 Apache-2.0 16 10 0 Updated May 20, 2025
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy