Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation (CVPR 2022)

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

[arXiv] [Project] [BibTeX]

Features

A single architecture for panoptic, instance and semantic segmentation.
Support major segmentation datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Updates

Add Google Colab demo.
Video instance segmentation is now supported! Please check our tech report for more details.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for Mask2Former.

See Getting Started with Mask2Former.

Run our demo using Colab:

Integrated into Huggingface Spaces 🤗 using Gradio. Try out the Web Demo:

Replicate web demo and docker image is available here:

Advanced usage

See Advanced Usage of Mask2Former.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the Mask2Former Model Zoo.

License

Shield:

The majority of Mask2Former is licensed under a MIT License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license, Deformable-DETR is licensed under the Apache-2.0 License.

Citing Mask2Former

If you use Mask2Former in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@inproceedings{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={CVPR},
  year={2022}
}

If you find the code useful, please also consider the following BibTeX entry.

@inproceedings{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={NeurIPS},
  year={2021}
}

Acknowledgement

Code is largely based on MaskFormer (https://github.com/facebookresearch/MaskFormer).

Name	Name	Last commit message	Last commit date
Latest commit rohitgirdhar update license May 20, 2022 9b0651c · May 20, 2022 History 21 Commits
configs	configs	change yaml _BASE to panoptic segmentation instead of semantic segmen…	Jan 13, 2022
datasets	datasets	Mask2Former for video instance segmentation	Dec 20, 2021
demo	demo	Initial commit	Dec 3, 2021
demo_video	demo_video	Mask2Former for video instance segmentation	Dec 20, 2021
mask2former	mask2former	support inference on CPU	Feb 9, 2022
mask2former_video	mask2former_video	Mask2Former for video instance segmentation	Dec 20, 2021
tools	tools	nits	Dec 17, 2021
.gitignore	.gitignore	Initial commit	Dec 3, 2021
ADVANCED_USAGE.md	ADVANCED_USAGE.md	Initial commit	Dec 3, 2021
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Initial commit	Dec 3, 2021
CONTRIBUTING.md	CONTRIBUTING.md	Initial commit	Dec 3, 2021
GETTING_STARTED.md	GETTING_STARTED.md	Mask2Former for video instance segmentation	Dec 20, 2021
INSTALL.md	INSTALL.md	nits	Dec 17, 2021
LICENSE	LICENSE	update license	May 20, 2022
MODEL_ZOO.md	MODEL_ZOO.md	fix coco instance model config link	Jan 19, 2022
README.md	README.md	update license	May 20, 2022
cog.yaml	cog.yaml	replicate demo (facebookresearch#51 )	Feb 21, 2022
predict.py	predict.py	replicate demo (facebookresearch#51 )	Feb 21, 2022
requirements.txt	requirements.txt	Initial commit	Dec 3, 2021
train_net.py	train_net.py	ignore ShapelyDeprecationWarning from fvcore	Jan 20, 2022
train_net_video.py	train_net_video.py	ignore ShapelyDeprecationWarning from fvcore	Jan 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation (CVPR 2022)

Features

Updates

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

About

Releases

Packages

Languages

License

alexanderjaus/Mask2Former

Folders and files

Latest commit

History

Repository files navigation

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation (CVPR 2022)

Features

Updates

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages