Welcome to our project! This repository contains all the source code, tests, and documentation required to understand and run the project. Below is an overview of the repository structure, installation, usage instructions, and contribution guidelines.
This repository is divided into several modules that cover various aspects of the project, including:
- Data Processing: Scripts for loading, processing, and analyzing data.
- Analysis Tools: Modules that perform computations and run experiments.
- Visualization Components: Code for rendering results and generating reports.
The project is organized as follows:
SRBench/
├── bin/ # Storage for model binaries
├── scripts/ # Bash scripts for running the project
│ └── run.sh # Main execution script
├── src/ # Source code of the project
│ ├── data_creation/ # Scripts for data creation
│ ├── __init__.py # Initialization file
│ ├── create_data.py # Script for data creation
│ ├── create_images.py # Script to create images
│ └── create_prompts.py # Script to generate prompts
│ ├── utils/ # Utility functions
│ ├── __init__.py # Initialization file
│ ├── vlm_helpers.py # Helper functions for the VLM models
│ ├── eval.py # Evaluation script
│ ├── eval_intern.py # Evaluation script for InternVL
│ ├── eval_openai.py # Evaluation script for OpenAI models
│ ├── eval_mini.py # Evaluation script for MiniCPM-V
├── .gitignore # Files and directories to ignore
├── requirements.txt # Required packages
├── LICENSE # MIT License file
└── README.md # Project documentation
- Clone the repository:
git clone https://github.com/stogiannidis/srbench.git cd srbench
- Create a virtual environment:
or using
python3 -m venv venv source venv/bin/activate
conda
:conda create -n srbench python=3.12 conda activate srbench
- Install the required packages:
pip install -r requirements.txt
To run the project, follow these steps:
- Fetch the dataset from
Hugging Face
:huggingface-cli login huggingface-cli download stogiannidis/srbench
- Run the script:
bash scripts/run.sh
@misc{stogiannidis2025mindgapbenchmarkingspatial,
title={Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models},
author={Ilias Stogiannidis and Steven McDonagh and Sotirios A. Tsaftaris},
year={2025},
eprint={2503.19707},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2503.19707},
}
Contributions are welcome! Please follow these steps:
- Fork the repository.
- Create a new branch (
git checkout -b feature/your_feature
). - Commit your changes (
git commit -am 'Add new feature'
). - Push to the branch (
git push origin feature/your_feature
). - Open a Pull Request.
This project is licensed under the MIT License. See the LICENSE file for more information.
For questions or feedback, please open an issue or contact me via email.