Abstract: Based on analyzing the character of cascaded decoder architecture commonly adopted in existing DETR-like models, this paper proposes a new decoder architecture. The cascaded decoder ...
Abstract: This paper proposes a model-level fusion-based multi-modal object detection and recognition method. This method employs various modalities to process images, speech, videos, etc., and fuses ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results