R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models Paper • 2410.17885 • Published Oct 23, 2024
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance Paper • 2507.06272 • Published Jul 8, 2025
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration Paper • 2510.27266 • Published Oct 31, 2025 • 21