Abstract: Benefiting from convolutional neural networks (CNNs), monocular depth estimation has achieved remarkable performance. However, most CNNs focus on learning local details of the scene, ...