Multimodal large language models (MLLMs) face an inherent trade-off between faithfulness and creativity, as different tasks require varying degrees of associative reasoning. However, existing methods lack the flexibility to modulate this reasoning strength, limiting MLLMs' adaptability across factual and creative scenarios. To bridge this gap, we propose equipping MLLMs with mechanisms that enable flexible control over associative reasoning. We begin by investigating the internal mechanisms underlying associative behavior in MLLMs and find that: (1) middle layers play a pivotal role in shaping model's associative tendencies, (2) modifying representations in these layers effectively regulates associative reasoning strength, and (3) hallucinations can be exploited to derive steering vectors that guide this modulation. Building on these findings, we introduce Flexible Association Control (FlexAC), a lightweight and training-free framework for modulating associative behavior in MLLMs. FlexAC first induces hallucination-guided intermediate representations to encode associative directions. Then, it selects high-association instances to construct effective associative steering vectors, whose strengths are adaptively calibrated to balance creative guidance with output stability. Finally, recognizing the multi-dimensional nature of associative reasoning, FlexAC incorporates task-specific associative vectors derived from a forward pass on a few target-domain samples, enabling models to follow diverse associative directions and better adapt to creative tasks. Notably, our method achieves up to a 5.8x improvement in creativity on Creation-MMBench and a 29% reduction in hallucination rate on CHAIR, surpassing existing baselines and demonstrating its effectiveness in enabling flexible control over associative reasoning in MLLMs. Our code is available at https://github.com/ylhz/FlexAC.
翻译:多模态大语言模型(MLLMs)面临着忠实性与创造性之间的固有权衡,因为不同任务需要不同程度的关联推理。然而,现有方法缺乏调节这种推理强度的灵活性,限制了MLLMs在事实性与创造性场景间的适应能力。为弥补这一差距,我们提出为MLLMs配备能够灵活控制关联推理的机制。我们首先研究了MLLMs中关联行为的内在机制,发现:(1)中间层在塑造模型的关联倾向上起着关键作用;(2)修改这些层的表示可有效调节关联推理强度;(3)幻觉可被用于推导引导这种调节的导向向量。基于这些发现,我们提出了灵活关联控制(FlexAC),一个轻量级且无需训练的框架,用于调节MLLMs中的关联行为。FlexAC首先诱导幻觉引导的中间表示以编码关联方向。接着,它选择高关联实例以构建有效的关联导向向量,其强度被自适应校准以平衡创造性引导与输出稳定性。最后,认识到关联推理的多维特性,FlexAC整合了通过少量目标域样本前向传播得到的任务特定关联向量,使模型能够遵循多样化的关联方向并更好地适应创造性任务。值得注意的是,我们的方法在Creation-MMBench上实现了高达5.8倍的创造力提升,并在CHAIR上实现了29%的幻觉率降低,超越了现有基线,证明了其在实现MLLMs中关联推理灵活控制方面的有效性。我们的代码发布于https://github.com/ylhz/FlexAC。