Change the repository type filter
All
Repositories list
83 repositories
LSDBench
PublicA benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs. (ICCV2025)Seg-Zero
PublicProject Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"VisionThink
PublicVisionZip
PublicOfficial repository for VisionZip (CVPR 2025)- Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
VisionReasoner
PublicThe official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"TGDPO
Public[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference OptimizationVideo-P2P
PublicVideo-P2P: Video Editing with Cross-attention ControlRL-GPT
PublicJenga
PublicMagicMirror
PublicLogits-Based-Finetuning
PublicLLMGA
PublicThis project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 OralARPO
PublicMoTCoder
PublicThis is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.Open-Code-Zero
PublicLISA
PublicProject Page for "LISA: Reasoning Segmentation via Large Language Model"Step-DPO
PublicLyra
PublicOfficial Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition" (ICCV 2025)LBGAT
PublicLearnable Boundary Guided Adversarial Training (ICCV2021)Mr-Ben
PublicControlNeXt
PublicTagCLIP
PublicLongLoRA
PublicCode and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)DiffComplete
PublicOfficial Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"PFENet
PublicLLaMA-VID
PublicPointGroup
PublicPrompt-Highlighter
Public[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMsQ-LLM
PublicThis is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"