Environmental Understanding Vision-Language Model for Embodied Agent

Published in IEEE/CVF Conference on Computer Vision and Pattern Recognition Findings (CVPRF), 2026

Recommended citation: Jinsik Bang, Jaeyeon Bae, Donggye Lee, Siyeol Jung, Taehwan Kim. "Environmental Understanding Vision-Language Model for Embodied Agent." CVPRF 2026.
Download Paper