This is the official PyTorch implementation of LLMDet. Recent open-vocabulary detectors achieve promising performance with abundant region-level annotated data. In this work, we show that an ...
Abstract: Visual grounding for remote sensing (RSVG) aims to detect objects in remote sensing scenes based on textual descriptions. While existing methods perform well on RSVG datasets, they are ...