TransDSSL | 0.906 | 0.967 | 0.984 | O | 4.321 | 0.172 | 0.711 | 0.095 | TransDSSL: Transformer based Depth Estimation via Self-Supervised Learning | |
Manydepth2(M+640x192) | 0.909 | 0.968 | 0.984 | O | 4.232 | 0.170 | 0.649 | 0.091 | Manydepth2: Motion-Aware Self-Supervised Multi-Frame Monocular Depth Estimation in Dynamic Scenes | |
SCIPaD(M+640x192) | 0.897 | 0.964 | 0.983 | O | 4.391 | 0.175 | 0.732 | 0.098 | SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning | - |
DS-Depth | 0.905 | 0.966 | 0.984 | - | 4.329 | 0.173 | 0.698 | 0.095 | DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume | |
VTDepthB2 (stereo supervision) | 0.904 | 0.965 | 0.983 | - | 4.439 | 0.178 | 0.743 | 0.099 | Exploring Efficiency of Vision Transformers for Self-Supervised Monocular Depth Estimation | - |
EPCDepth(S+1024x320) | 0.901 | 0.966 | 0.983 | X | 4.207 | 0.176 | 0.646 | 0.091 | Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation | |