GitHub - Jeremy-lf/RT-DATR: RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

Abstract

Despite domain-adaptive object detectors based on CNN and transformers have made significant progress in cross-domain detection tasks, it is regrettable that domain adaptation for real-time transformer-based detectors has not yet been explored. Directly applying existing domain adaptation algorithms has proven to be suboptimal. In this paper, we propose RT-DATR, a simple and efficient real-time domain adaptive detection transformer. Building on RT-DETR as our base detector, we first introduce a local objectlevel feature alignment module to significantly enhance the feature representation of domain invariance during object transfer. Additionally, we introduce a scene semantic feature alignment module designed to boost cross-domain detection performance by aligning scene semantic features. Finally, we introduced a domain query and decoupled it from the object query to further align the instance feature distribution within the decoder layer, reduce the domain gap, and maintain discriminative ability. Experimental results on various benchmarks demonstrate that our method outperforms current state-of-the-art approaches.

Quick Start

## Enveriments
pip install -r requeriment.txt

## Train
sh run.sh or \

python export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
export FLAGS_START_PORT=35100
nohup python3.7 \
-m paddle.distributed.launch --gpus=0,1,2,3 --log_dir=log_base tools/train.py \
-c configs/domain_adaption/da_r50_rtdetr_backbone_encoder_instance_dn_cmt_city2foggycity.yml \
--eval \
--enable_ce True \
> rt_datr_r50_cityscapes_to_foggycity_2e4_72e_backbozne_encoder_instance_dn_cmt_loss2.txt 2>&1 &

## Eval
sh eval.sh or \

export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
python3.7  tools/eval.py \
-c configs/domain_adaption/da_rtdetr_r34_backbone_encoder_instance_dn_cmt_city2foggycity.yml \
-o weights=output/cityscapes2foggycity/best_model.pdparams

Experiments Results

We evaluated our approach on multiple scene datasets, including weather adaptation (Cityscapes to Foggy Cityscapes), scene adaptation (Cityscapes to BDD100K), artistic-to-real adaptation (Sim10K to Cityscapes) and cross-camera adaptation(KITTI to Cityscapes).

Cite

@article{lv2025rt,
  title={RT-DATR: Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning},
  author={Lv, Feng and Xia, Chunlong and Wang, Shuo and Cao, Huo},
  journal={arXiv preprint arXiv:2504.09196},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
benchmark		benchmark
configs		configs
dataset		dataset
demo		demo
ppdet		ppdet
scripts		scripts
tools		tools
README.md		README.md
eval.sh		eval.sh
requirement.txt		requirement.txt
run.sh		run.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

Abstract

Quick Start

Experiments Results

Cite

About

Releases

Packages

Languages

Jeremy-lf/RT-DATR

Folders and files

Latest commit

History

Repository files navigation

RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

Abstract

Quick Start

Experiments Results

Cite

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages