Skip to content

ZZY816/VoxelSplat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VoxelSplat Logo

[CVPR2025] VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction

Ziyue Zhu  ·  Shenlong Wang  ·  Jin Xie  ·  Jiang-jiang Liu  ·  Jingdong Wang  ·  Jian Yang

VoxelSplat is a novel regularization framework that leverages dynamic 3D Gaussian Splatting to improve the prediction of occupancy and scene flow.

Framework

Framework Diagram

Recent advances in camera-based occupancy prediction aim to jointly estimate 3D semantics and scene flow. We propose VoxelSplat, a novel regularization framework that leverages 3D Gaussian Splatting to improve learning in two key ways:
(i) 2D-Projected Semantic Supervision: During training, sparse semantic Gaussians decoded from 3D features are projected onto the 2D camera view, enabling camera-visible supervision to guide 3D semantic learning.
(ii) Enhanced Scene Flow Learning: Motion is modeled by propagating Gaussians with predicted scene flow, allowing enhanced flow learning from adjacent-frame labels.
VoxelSplat integrates easily into existing occupancy models, improving both semantic and motion predictions without increasing inference time.

News

  • [2025/7]: Code and pre-trained weights are released.
  • [2025/3]: Paper is accepted on CVPR 2025.

🕹️ Getting Started

Performance

Backbone Config Image Size Epochs Train Pretrain Memory RayIoU mAVE checkpoints
R50 FB-Occ (Baseline) 256 x 704 48 ImageNet 17 G 33.57 0.504 [model]
R50 voxelsplat-r50 256 x 704 48 ImageNet 19 G 37.14 0.312 [model]
EVA-VIT voxelsplat-eva 640 x 1600 24 ImageNet 28 G - - [model]
Intern-XL voxelsplat-intern 640 x 1600 24 Nus-Det 39 G - - [model]

Citation

If you find our paper and code useful for your research, please consider citing:

@inproceedings{zhu2025voxelsplat,
  title={Voxelsplat: Dynamic gaussian splatting as an effective loss for occupancy and flow prediction},
  author={Zhu, Ziyue and Wang, Shenlong and Xie, Jin and Liu, Jiang-jiang and Wang, Jingdong and Yang, Jian},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={6761--6771},
  year={2025}
}

About

CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages