SRBench

Welcome to our project! This repository contains all the source code, tests, and documentation required to understand and run the project. Below is an overview of the repository structure, installation, usage instructions, and contribution guidelines.

Overview

This repository is divided into several modules that cover various aspects of the project, including:

Data Processing: Scripts for loading, processing, and analyzing data.
Analysis Tools: Modules that perform computations and run experiments.
Visualization Components: Code for rendering results and generating reports.

Repository Structure

The project is organized as follows:

SRBench/
├── bin/                                  # Storage for model binaries
├── scripts/                              # Bash scripts for running the project
│   └── run.sh                            # Main execution script
├── src/                                  # Source code of the project
│   ├── data_creation/                    # Scripts for data creation
│       ├── __init__.py                	  # Initialization file
│       ├── create_data.py                # Script for data creation
│       ├── create_images.py              # Script to create images
│       └── create_prompts.py             # Script to generate prompts
│   ├── utils/                            # Utility functions
│       ├── __init__.py                   # Initialization file
│       ├── vlm_helpers.py                # Helper functions for the VLM models
│   ├── eval.py                           # Evaluation script
│   ├── eval_intern.py 				      # Evaluation script for InternVL
│   ├── eval_openai.py 				      # Evaluation script for OpenAI models
│   ├── eval_mini.py 				      # Evaluation script for MiniCPM-V
├── .gitignore                            # Files and directories to ignore
├── requirements.txt                      # Required packages
├── LICENSE                               # MIT License file
└── README.md                             # Project documentation

Installation

Clone the repository:

git clone https://github.com/stogiannidis/srbench.git
cd srbench

Create a virtual environment:

python3 -m venv venv
source venv/bin/activate

or using conda:

conda create -n srbench python=3.12
conda activate srbench

Install the required packages:
```
pip install -r requirements.txt
```

Usage

To run the project, follow these steps:

Fetch the dataset from Hugging Face:

huggingface-cli login
huggingface-cli download stogiannidis/srbench

Run the script:
```
bash scripts/run.sh
```

Citation

@misc{stogiannidis2025mindgapbenchmarkingspatial,
      title={Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models}, 
      author={Ilias Stogiannidis and Steven McDonagh and Sotirios A. Tsaftaris},
      year={2025},
      eprint={2503.19707},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.19707}, 
}

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository.
Create a new branch (git checkout -b feature/your_feature).
Commit your changes (git commit -am 'Add new feature').
Push to the branch (git push origin feature/your_feature).
Open a Pull Request.

License

This project is licensed under the MIT License. See the LICENSE file for more information.

Contact

For questions or feedback, please open an issue or contact me via email.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SRBench

Overview

Repository Structure

Installation

Usage

Citation

Contributing

License

Contact

About

Uh oh!

Uh oh!

Languages

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

stogiannidis/srbench

Folders and files

Latest commit

History

Repository files navigation

SRBench

Overview

Repository Structure

Installation

Usage

Citation

Contributing

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.