Concept bottleneck models

Built using

Concept bottleneck models (CBMs) are deep learning models which are designed to be interpretable and intervenable. Instead of training a model which takes inputs and outputs a label, a CBM comprises of an encoder module which takes inputs and encodes these to a set of human interpretable concepts. From there, these concepts are fed into a predictor module which outputs the final labels. The concept layer can be used to explain which features of an input led to the final prediction, and can be intervened on, or corrected, to improve predictions on a particular input.

This repo contains example CBM(s) and associated evaluation metrics.

Tools used to assess concept alignment:

Saliency maps
Masking relevant image locations for selected concepts

See the original CBM paper here

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
cbm.py		cbm.py
encoder.py		encoder.py
predictor.py		predictor.py
traffic_light.ipynb		traffic_light.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Concept bottleneck models

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Concept bottleneck models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages