Brick-by-Brick ML pipeline

This repository was developed as part of the Brick by Brick 2024 Challenge, a global competition aimed at automating building data classification to enhance the intelligence and energy efficiency of buildings. By tackling this challenge, we contribute to the broader effort of enabling sustainable and smart building management through data-driven solutions.

Our machine learning pipeline achieves a 0.5767 macro F1 score on the public test leaderboard. The approach leverages advanced feature extraction and ensemble learning to optimize performance. For a detailed explanation of the methodology, feature extraction process, and model optimization, please refer to the accompanying report.pdf and our published paper.

Brick by Brick 2024 Challenge

The Brick by Brick 2024 Challenge is a global initiative designed to advance automation in building data classification. The challenge's primary goal is to enable intelligent, energy-efficient buildings by leveraging machine learning and data science methodologies. Our contribution focuses on developing a robust, high-performing model that effectively processes and classifies building data.

Provided Solution

The solution is built around feature extraction and ensemble learning, leveraging a structured approach to feature selection and hyperparameter optimization. The pipeline follows a stratified k-fold cross-validation scheme to ensure robustness.

Repository Structure:

model.py: Contains the code to train the model based on extracted features. It also includes the feature selection paradigm and hyperparameter optimization using a stratified k-fold approach.
brick_feature_extractor.py: Implements feature extraction, generating time-based, interval-based, and full dataset features, which are stored in the features/ folder for reference.
dataloader.py: Merges extracted features for both training and test datasets.
Utility Files: Contains schema definitions and available classes relevant to the brick classification problem.
submission/: Includes all solution submission files related to the paper.

Citation

If you use this code or refer to our work, please cite the following paper:

@inproceedings{Steenwinckel2025,
  author    = {Bram Steenwinckel and Sofie Van Hoecke and Femke Ongenae},
  title     = {Another Brick in the Wall: Leveraging Feature Extraction and Ensemble Learning for Building Data Classification},
  booktitle = {Companion Proceedings of the ACM Web Conference 2025 (WWW Companion '25)},
  year      = {2025},
  publisher = {ACM},
  doi       = {10.1145/3701716.3718480},
  isbn      = {979-8-4007-1331-6/2025/04},
}

Requirements

This code was executed using Python 3.11 with the following necessary packages:

pandas (latest version)
scikit-learn (latest version)

License

This code is licensed under the Apache License 2.0. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit Cannot retrieve latest commit at this time. History 4 Commits
features		features
inference		inference
submissions		submissions
utils		utils
LICENSE.txt		LICENSE.txt
README.md		README.md
brick_feature_extractor.py		brick_feature_extractor.py
dataloader.py		dataloader.py
model.py		model.py
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brick-by-Brick ML pipeline

Brick by Brick 2024 Challenge

Provided Solution

Repository Structure:

Citation

Requirements

License

About

Releases

Packages

Languages

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

License

predict-idlab/brick-by-brick-pipeline

Folders and files

Latest commit

History

Repository files navigation

Brick-by-Brick ML pipeline

Brick by Brick 2024 Challenge

Provided Solution

Repository Structure:

Citation

Requirements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Packages