Abstract: As the interest in event-based vision sensors for mobile and aerial applications grows, there is an increasing need for high-speed and highly robust algorithms for performing visual tasks ...
Abstract: A key issue of existing deep-learning-based object detection methods in remote sensing images is that they often struggle to differentiate the background and small object regions due to ...
In this work, we tackle the complex challenge of image denoising, especially in scenarios where high-resolution images are corrupted by noise, making it difficult to restore their quality. Our ...
GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture. It introduces Multi-Token Prediction (MTP) loss and stable full-task ...
Get article recommendations from ACS based on references in your Mendeley library. Pair your accounts.