EfficientPS (Cityscapes-fine) | 62.9 | EfficientPS: Efficient Panoptic Segmentation | |
Dynamically Instantiated Network | 55.4 | Pixelwise Instance Segmentation with a Dynamically Instantiated Network | |
Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary, multi-scale) | 67.8 | Scaling Wide Residual Networks for Panoptic Segmentation | - |
kMaX-DeepLab (single-scale) | 66.2 | kMaX-DeepLab: k-means Mask Transformer | |
OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained) | 68.0 | OneFormer: One Transformer to Rule Universal Image Segmentation | |
Axial-DeepLab-XL (Mapillary Vistas, multi-scale) | 66.6 | Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation | |