Everything you care about in one place

Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.

Get Feeder

arxiv.org

cs.CV updates on arXiv.org

Get the latest updates from cs.CV updates on arXiv.org directly as they happen.

Follow now 111 followers

Latest posts

Last updated about 21 hours ago

Uncertainty-Supervised Interpretable and Robust Evidential Segmentation

about 21 hours ago

arXiv:2509.17098v2 Announce Type: replace Abstract: Uncertainty estimation has been widely studied in...

StegOT: Trade-offs in Steganography via Optimal Transport

about 21 hours ago

arXiv:2509.11178v2 Announce Type: replace Abstract: Image hiding is often referred to as...

UrbanTwin: Building High-Fidelity Digital Twins for Sim2Real LiDAR Perception and Evaluation

about 21 hours ago

arXiv:2509.02903v2 Announce Type: replace Abstract: LiDAR-based perception in intelligent transportation systems (ITS)...

In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding with Gaze-Guided Prompting

about 21 hours ago

arXiv:2509.07447v2 Announce Type: replace Abstract: The emergence of advanced multimodal large language...

InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts

about 21 hours ago

arXiv:2509.10813v2 Announce Type: replace Abstract: The advancement of Embodied AI heavily relies...

Probabilistic Temporal Masked Attention for Cross-view Online Action Detection

about 21 hours ago

arXiv:2508.17025v2 Announce Type: replace Abstract: As a critical task in video sequence...

Contrast Sensitivity in Multimodal Large Language Models: A Psychophysics-Inspired Evaluation

about 21 hours ago

arXiv:2508.10367v2 Announce Type: replace Abstract: Understanding how Multimodal Large Language Models (MLLMs)...

KonfAI: A Modular and Fully Configurable Framework for Deep Learning in Medical Imaging

about 21 hours ago

arXiv:2508.09823v2 Announce Type: replace Abstract: KonfAI is a modular, extensible, and fully...

NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows

about 21 hours ago

arXiv:2508.16845v2 Announce Type: replace Abstract: Recent advances in Vision-Language-Action (VLA) models have...

Boosting Generic Semi-Supervised Medical Image Segmentation via Diverse Teaching and Label Propagation

about 21 hours ago

arXiv:2508.08549v2 Announce Type: replace Abstract: Both limited annotation and domain shift are...

STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes

about 21 hours ago

arXiv:2508.10427v2 Announce Type: replace Abstract: Vision-Language Models (VLMs) have been applied to...

Levarging Learning Bias for Noisy Anomaly Detection

about 21 hours ago

arXiv:2508.07441v2 Announce Type: replace Abstract: This paper addresses the challenge of fully...