Publications

Kimi K2.6: Advancing Open-Source Coding

We are open sourcing our latest model, Kimi K2.6, featuring state-of-the-art coding, long-horizon execution, and agent swarm …

Kimi Team, Yuyao Ge 葛钰峣

SkillForge: Co-Evolving Skills and Agents via Dynamic Skill Lifecycles

An agentic RL method that evolves the skill library through a fitness-driven lifecycle, enabling skills and the model to co-evolve throughout training.

Yuyao Ge 葛钰峣, Shenghua Liu, Yiwei Wang, Tianyu Liu, Yuchen He, Baolong Bi, Lingrui Mei, Jiayu Yao, Lizhe Chen, Jiafeng Guo, Xueqi Cheng

SkillForge: Co-Evolving Skills and Agents via Dynamic Skill Lifecycles

Prism-Δ: Differential Subspace Steering for Prompt Highlighting in Large Language Models

We propose PRISM-Δ, a differential subspace steering method for prompt highlighting that matches or exceeds the best existing method on 19 of 20 configurations with relative gains up to +10.6%, while halving the fluency cost.

Yuyao Ge 葛钰峣, Shenghua Liu, Yiwei Wang, Tianyu Liu, Baolong Bi, Lingrui Mei, Jiayu Yao, Jiafeng Guo, Xueqi Cheng

Do Large Language Models Already Know the Answer Before They Finish Thinking?

Probing hidden states during reasoning reveals that LLMs already know the answer before finishing thinking. We detect overthinking via ‘jumps’ and intervene during inference to improve reasoning.

Yuyao Ge 葛钰峣, Shenghua Liu, Yiwei Wang, Tianyu Liu, Lingrui Mei, Baolong Bi, Jiayuan Guo, Jiayu Yao, Jiafeng Guo, Xueqi Cheng

PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding

Abstract: We present PromptCD, a test-time method for controlling LLM behavior without additional training. The approach creates paired positive and negative guiding prompts for a target behavior and contrasts model responses at the token-probability level for LLMs and through visual attention patterns for VLMs.

Baolong Bi, Yuyao Ge 葛钰峣, Shenghua Liu, Yuchen He, Siqian Tong, Lizhe Chen, Lingrui Mei, Zehao Li, Yiwei Wang, Yujun Cai, Ming-Hsuan Yang, Xueqi Cheng

Kimi K2.5: Visual Agentic Intelligence

We introduce Kimi K2.5, an open-source multimodal agentic model designed to advance general agentic intelligence. The model focuses on …

Kimi Team, Yuyao Ge 葛钰峣

Modality-Grounded Contrastive Decoding for Cross-Modal Hallucination Mitigation

A training-free framework that softly anchors fused predictions toward the target modality’s own judgment when overshooting, mitigating cross-modal hallucination in multimodal LLMs.

Yuyao Ge 葛钰峣, Shenghua Liu, Yiwei Wang, Tianyu Liu, Baolong Bi, Lingrui Mei, Jiayu Yao, Jiafeng Guo, Xueqi Cheng

Gated Differentiable Working Memory for Long-Context Language Modeling

Abstract: Long contexts challenge transformers as attention scores dilute across thousands of tokens and critical information is often lost in the middle. We reframe test-time adaptation as a budget-constrained memory consolidation problem and propose Gdwm (Gated Differentiable Working Memory), which introduces a write controller that estimates Contextual Utility, an information-theoretic measure of long-range contextual dependence, to allocate gradient steps efficiently.

Lingrui Mei, Shenghua Liu, Yiwei Wang, Yuyao Ge 葛钰峣, Baolong Bi, Jiayu Yao, Jun Wan, Ziling Yin, Jiafeng Guo, Xueqi Cheng

Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning

Abstract: Reinforcement learning (RL) has shown great promise in enhancing LLM reasoning, but current approaches mainly focus on single domains with verifiable rewards. We propose RGR-GRPO, a rubric-driven RL framework for multi-domain reasoning that uses rubrics to provide fine-grained reward signals and offline guidance.

Baolong Bi, Shenghua Liu, Yiwei Wang, Siqian Tong, Lingrui Mei, Yuyao Ge 葛钰峣, Yilong Xu, Jiafeng Guo, Xueqi Cheng

Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning

A Survey of Vibe Coding with Large Language Models

The advancement of large language models (LLMs) has catalyzed a paradigm shift from code generation assistance to autonomous coding …

Yuyao Ge 葛钰峣, Lingrui Mei, Zenghao Duan, Tianhao Li, Yujia Zheng, Yiwei Wang, Lexin Wang, Jiayu Yao, Tianyu Liu, Yujun Cai, Baolong Bi, Fangda Guo, Jiafeng Guo, Shenghua Liu, Xueqi Cheng

A Survey of Vibe Coding with Large Language Models

Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning

Vision-Language Models (VLMs) have demonstrated remarkable success across diverse visual tasks, yet their performance degrades in …

Yuyao Ge 葛钰峣, Shenghua Liu, Yiwei Wang, Lingrui Mei, Baolong Bi, Xuanshan Zhou, Jiayu Yao, Jiafeng Guo, Xueqi Cheng

Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning

Are All Prompt Components Value-Neutral? Understanding the Heterogeneous Adversarial Robustness of Dissected Prompt in Large Language Models

Abstract: Prompt-based adversarial attacks have become an effective means to assess the robustness of large language models (LLMs). However, existing approaches often treat prompts as monolithic text, overlooking their structural heterogeneity-different prompt components contribute unequally to adversarial robustness.

Yujia Zheng, Tianhao Li, Haotian Huang, Tianyu Zeng, Jingyu Lu, Chuangxin Chu, Yuekai Huang, Ziyou Jiang, Qian Xiong, Yuyao Ge 葛钰峣, Mingyang Li

Can Graph Descriptive Order Affect Solving Graph Problems with LLMs?

We present the first comprehensive analysis of how the order of graph descriptions impacts LLM performance, evaluating four graph description orders across six graph problems using six mainstream LLMs.

Yuyao Ge 葛钰峣, Shenghua Liu, Baolong Bi, Yiwei Wang, Lingrui Mei, Wenjie Feng, Lizhe Chen, Xueqi Cheng

Can Graph Descriptive Order Affect Solving Graph Problems with LLMs?

Kimi K2: Open Agentic Intelligence

We introduce Kimi K2, a Mixture-of-Experts large language model with 32 billion activated parameters and 1 trillion total parameters. …

Kimi Team, Yuyao Ge 葛钰峣

A Survey of Context Engineering for Large Language Models

The performance of Large Language Models (LLMs) is fundamentally determined by the contextual information provided during inference. …

Lingrui Mei, Jiayu Yao, Yuyao Ge 葛钰峣, Yiwei Wang, Baolong Bi, Yujun Cai, Jiazhi Liu, Mingyu Li, Zhong-Zhi Li, Duzhen Zhang, Chenlin Zhou, Jiayi Mao, Tianze Xia, Jiafeng Guo, Shenghua Liu