Abstract: Real-time acquisition of airport scene information is crucial for airport safety and optimization of airport utilization efficiency. However, detecting airport objects is still a challenging ...
Abstract: This study proposes an image-text multimodal classification algorithm based on a combination of convolutional neural networks (CNN) and Transformer, aiming to solve the key problems in ...