An Efficient Monocular Depth Prediction Network Using Coordinate Attention and Feature Fusion


Huihui Xu, Fei Li, Journal of Information Processing Systems Vol. 18, No. 6, pp. 794-802, Dec. 2022  

10.3745/JIPS.02.0187
Keywords: Attention Mechanism, Depth Prediction, feature fusion, Multi-scale features
Fulltext:

Abstract

The recovery of reasonable depth information from different scenes is a popular topic in the field of computer vision. For generating depth maps with better details, we present an efficacious monocular depth prediction framework with coordinate attention and feature fusion. Specifically, the proposed framework contains attention, multi-scale and feature fusion modules. The attention module improves features based on coordinate attention to enhance the predicted effect, whereas the multi-scale module integrates useful low- and high-level contextual features with higher resolution. Moreover, we developed a feature fusion module to combine the heterogeneous features to generate high-quality depth outputs. We also designed a hybrid loss function that measures prediction errors from the perspective of depth and scale-invariant gradients, which contribute to preserving rich details. We conducted the experiments on public RGBD datasets, and the evaluation results show that the proposed scheme can considerably enhance the accuracy of depth prediction, achieving 0.051 for log10 and 0.992 for δ<1.253 on the NYUv2 dataset.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Xu, H. & Li, F. (2022). An Efficient Monocular Depth Prediction Network Using Coordinate Attention and Feature Fusion. Journal of Information Processing Systems, 18(6), 794-802. DOI: 10.3745/JIPS.02.0187.

[IEEE Style]
H. Xu and F. Li, "An Efficient Monocular Depth Prediction Network Using Coordinate Attention and Feature Fusion," Journal of Information Processing Systems, vol. 18, no. 6, pp. 794-802, 2022. DOI: 10.3745/JIPS.02.0187.

[ACM Style]
Huihui Xu and Fei Li. 2022. An Efficient Monocular Depth Prediction Network Using Coordinate Attention and Feature Fusion. Journal of Information Processing Systems, 18, 6, (2022), 794-802. DOI: 10.3745/JIPS.02.0187.