DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

-The code has been released, but there may be some issues in the code, possibly caused by mismatched parameter names during loading. We will fix them as soon as possible.

🎉 Jun 2025: [DAP-MAE] is accepted by ICCV 2025 🎉.

Abstract

In this work, we propose the Domain-Adaptive Point Cloud Masked Autoencoder (DAP-MAE), an MAE pre-training method, to adaptively integrate the knowledge of cross-domain datasets for general point cloud analysis. In DAP-MAE, we design a heterogeneous domain adapter that utilizes an adaptation mode during the pre-training, enabling the model to comprehensively learn information from point clouds across different domains, while employing a fusion mode in the fine-tuning to enhance point cloud features. Meanwhile, DAP-MAE incorporates a domain feature generator to guide the adaptation of point cloud features to various downstream tasks. With only one pre-training, DAP-MAE achieves excellent performance across four different point cloud analysis tasks, reaching 95.18% in object classification on ScanObjectNN and 88.45% in facial expression recognition on Bosphorus.

requirements

cd ./extensions/chamfer_dist && python setup.py install --user
# PointNet++
pip install "git+https://github.com/erikwijmans/Pointnet2_PyTorch.git#egg=pointnet2_ops&subdirectory=pointnet2_ops_lib"
# GPU kNN
pip install --upgrade https://github.com/unlimblue/KNN_CUDA/releases/download/0.2/KNN_CUDA-0.2-py3-none-any.whl

Datasets

We use FRGCv2, Bosphorus, BU3DFE, ShapeNet, ScanObjectNN, ModelNet40, S3DIS and ShapeNetPart in this work. See DATASET.md for details.

Pre-trained model and fine-tuned checkpoints

You can find the pre-trained model and fine-tuned checkpoints for downstream tasks right here: Google drive

PS: We only upload the pre-trained model and the checkpoint on Scanobjectnn OBJ-BG right now because the author is a little bit lazy...

Training and Inference

Pre-training on cross-domain datasets

CUDA_VISIBLE_DEVICES=<GPU> python main.py --config cfgs/pretrain/pretrain.yaml --exp_name <choose your name>

Fine-tuning on Scanobjectnn

CUDA_VISIBLE_DEVICES=<GPU> python main.py --config cfgs/finetune_classification/full/finetune_scan_objbg.yaml --finetune_model --exp_name <choose your name> --ckpts <checkpoints_path>

License

DAP-MAE is released under MIT License. See the LICENSE file for more details. Besides, the licensing information for pointnet2 modules is available here.

Acknowledgements

This codebase is built upon ReCon, Pointnet2_PyTorch, ACT

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
cfgs		cfgs
datasets		datasets
extensions/chamfer_dist		extensions/chamfer_dist
models		models
modules		modules
part_segmentation		part_segmentation
pointnet2_ops_lib		pointnet2_ops_lib
pytorch_utils		pytorch_utils
semantic_segmentation		semantic_segmentation
tools		tools
utils		utils
.gitignore		.gitignore
Bos_process.py		Bos_process.py
ContrastiveLoss.py		ContrastiveLoss.py
DAP-MAE.pdf		DAP-MAE.pdf
DAP-MAE.png		DAP-MAE.png
DATASET.md		DATASET.md
LICENSE		LICENSE
README.md		README.md
get_flops.py		get_flops.py
main.py		main.py
main_autoencoder.py		main_autoencoder.py
main_tsne.py		main_tsne.py
pcutils.py		pcutils.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

Abstract

requirements

Datasets

Pre-trained model and fine-tuned checkpoints

Training and Inference

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

License

CVI-SZU/DAP-MAE

Folders and files

Latest commit

History

Repository files navigation

DAP-MAE: Domain-Adaptive Point Cloud Masked Autoencoder for Effective Cross-Domain Learning

Abstract

requirements

Datasets

Pre-trained model and fine-tuned checkpoints

Training and Inference

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Packages