Content-Length: 262626 | pFad | http://github.com/eric-ai-lab

96 UCSC ERIC Lab · GitHub
Skip to content
@eric-ai-lab

UCSC ERIC Lab

UCSC Embodied and Responsible Interaction and Communication (ERIC) Lab

Pinned Loading

  1. MiniGPT-5 Public

    Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

    Python 858 52

  2. photoswap Public

    Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"

    Jupyter Notebook 345 24

  3. awesome-vision-language-navigation Public

    A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

    405 21

  4. PEViT Public

    Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"

    Python 98 5

  5. VLMbench Public

    NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

    Python 83 8

  6. Aerial-Vision-and-Dialog-Navigation Public

    Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"

    Python 49 6

Repositories

Showing 10 of 27 repositories
  • MSSBench Public

    Official codebase for the paper "Multimodal Situational Safety"

    Python 7 MIT 1 0 0 Updated Dec 18, 2024
  • MiniGPT-5 Public

    Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

    Python 858 Apache-2.0 52 6 0 Updated Dec 12, 2024
  • Aerial-Vision-and-Dialog-Navigation Public

    Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"

    Python 49 6 3 0 Updated Nov 4, 2024
  • JavaScript 0 0 0 0 Updated Oct 18, 2024
  • llm_coordination Public

    Code repository for the paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models"

    Python 27 MIT 2 0 0 Updated Oct 13, 2024
  • swap-anything Public

    Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"

    Python 237 MIT 10 4 0 Updated Oct 10, 2024
  • MMWorld Public

    Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

    Python 23 MIT 1 0 0 Updated Sep 21, 2024
  • ComCLIP Public

    Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

    Python 35 MIT 3 0 1 Updated Aug 18, 2024
  • Screen-Point-and-Read Public

    Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

    Python 24 2 0 0 Updated Jul 31, 2024
  • ProbMed Public

    "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"

    Python 15 1 1 0 Updated Jun 24, 2024

Top languages

Loading…

Most used topics

Loading…









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/eric-ai-lab

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy