](https://deep-paper.org/en/paper/2402.08680/images/cover.png)
MARINE: A Training-Free Framework to Stop Vision-Language Models from Hallucinating
Introduction The rapid rise of Large Vision-Language Models (LVLMs) like LLaVA, mPLUG-Owl, and GPT-4V has revolutionized how machines understand the world. By aligning visual encoders with powerful Large Language Models (LLMs), these systems can look at an image and describe it, answer complex questions about it, or even reason through visual problems. However, despite their impressive capabilities, these models suffer from a critical and often embarrassing flaw: Object Hallucination. Object hallucination occurs when an LVLM confidently describes objects in an image that simply aren’t there. For a casual user, this might result in a funny caption. But in safety-critical domains—such as medical imaging analysis or autonomous navigation—a model “seeing” a tumor that doesn’t exist or a stop sign that isn’t present poses severe risks. ...
](https://deep-paper.org/en/paper/2504.16925/images/cover.png)
](https://deep-paper.org/en/paper/2504.08201/images/cover.png)
](https://deep-paper.org/en/paper/2502.05749/images/cover.png)
](https://deep-paper.org/en/paper/11875_gmail_generative_modalit-1762/images/cover.png)
](https://deep-paper.org/en/paper/4514_discovering_a_zero_zero_v-1761/images/cover.png)
](https://deep-paper.org/en/paper/5707_efficient_source_free_unl-1760/images/cover.png)
](https://deep-paper.org/en/paper/5576_visual_and_domain_knowled-1759/images/cover.png)
](https://deep-paper.org/en/paper/8491_large_language_model_driv-1758/images/cover.png)
](https://deep-paper.org/en/paper/8317_scaling_trends_in_languag-1757/images/cover.png)
](https://deep-paper.org/en/paper/2412.03719/images/cover.png)
](https://deep-paper.org/en/paper/2502.01925/images/cover.png)
](https://deep-paper.org/en/paper/2507.08285/images/cover.png)
](https://deep-paper.org/en/paper/2411.16829/images/cover.png)
](https://deep-paper.org/en/paper/2506.05035/images/cover.png)
](https://deep-paper.org/en/paper/14278_policy_labeled_preferenc-1750/images/cover.png)
](https://deep-paper.org/en/paper/2407.04516/images/cover.png)
](https://deep-paper.org/en/paper/2505.03393/images/cover.png)
](https://deep-paper.org/en/paper/5715_bridging_layout_and_rtl_k-1747/images/cover.png)
](https://deep-paper.org/en/paper/2502.14770/images/cover.png)