Experiments
1. Concept Prediction
- Our CB-AE and CC can provide interpretability with concept predictions for generated images (Fig. 3).
Figure 3. Qualitative evaluation of concept prediction
2. Steerability
- Our CB-AE enables concept interventions for controllable image generation (Fig. 4).
- Our optimization-based interventions further improve image quality and orthogonality of concept interventions (Fig. 5).
Figure 4. Qualitative evaluation of concept steerability with CB-AE
Figure 5. Qualitative evaluation of concept steerability with optimization-based interventions
- Steerability (%) is the success rate of concept interventions averaged over all concepts and computed over 5k generated images.
- The success rate is calculated w.r.t. a pretrained concept classifier (ViT-L-16-based, not used in CB-AE/CC training).
- From Table 2, we find improved steerability of CB-AE and CC over CBGM as well as reduced training time.
Table 2. Quantitative evaluation of concept steerability i.e. concept interventions (train time on 1 V100 GPU)
3. Generation Quality
- CBGM produces better image quality than CB-AE w.r.t. the base model generations since CBGM directly optimizes image generation objectives.
- However, our optimization-based interventions can obtain almost the same image quality as the base model.
Table 3. Generation quality evaluation using FID (train time in V100 GPU-hours)
Cite this work
A. Kulkarni, G. Yan, C. Sun, T. Oikarinen, and T.-W. Weng,
Interpretable Generative Models through Post-hoc Concept Bottlenecks, CVPR 2025.
@inproceedings{kulkarni2025interpretable
title={Interpretable Generative Models through Post-hoc Concept Bottlenecks},
author={Kulkarni, Akshay and Yan, Ge and Sun, Chung-En and Oikarinen, Tuomas and Weng, Tsui-Wei},
booktitle={IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year={2025},
}