| mIoU | C.-Dec. | |||||
|---|---|---|---|---|---|---|
| QO | QS | QT | Scd | Rcd | ||
| Arch. | Model | |||||
| CNN | Mask2Former RN101 | 0.8080 | 0.2114 | 0.3209 | 0.4801 | 0.3294 |
| DeepLabV3Plus RN101 | 0.8016 | 0.1795 | 0.3464 | 0.4208 | 0.3280 | |
| PSPNet RN50 | 0.7792 | 0.1542 | 0.3661 | 0.3712 | 0.3339 | |
| FCN RN101 | 0.7524 | 0.2052 | 0.3809 | 0.4303 | 0.3895 | |
| Vision Transf. | Mask2Former Swin-B | 0.8352 | 0.3974 | 0.4187 | 0.5709 | 0.4886 |
| SegFormer MiT-b4 | 0.8189 | 0.3919 | 0.3997 | 0.5789 | 0.4833 | |
| SETR ViT-L_mla | 0.7624 | 0.3912 | 0.2913 | 0.6531 | 0.4476 | |
| other | DNL RN101 | 0.7831 | 0.2089 | 0.3774 | 0.4369 | 0.3743 |
| OCRNet hr18 | 0.7787 | 0.1722 | 0.2826 | 0.4607 | 0.2920 | |