Abstract: Video question answering (Video-QA) has emerged as a core task in the vision-language domain, which requires the models to understand a given video and answer textual questions related to ...
Abstract: Coastal wetland monitoring is essential for protecting marine and terrestrial ecosystems. However, the complex spatial, temporal, and spectral characteristics of these wetlands pose ...