Publications

* Equal contribution, ✉ Corresponding author

2026

  1. milr_m.png
    MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning
    Yapeng MiHengli Li , Yanpeng Zhao , Chenxi Li , Huimin Wu , Xiaojian MaSong-Chun ZhuYing Nian Wu , and Qing Li
    International Conference on Learning Representations (ICLR), 2026

2025

  1. sport.png
    Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning
    Pengxiang Li* , Zhi Gao* , Bofei ZhangYapeng MiXiaojian Ma , Chenrui Shi , Tao Yuan , Yuwei WuYunde JiaSong-Chun Zhu , and Qing Li
    Advances in Neural Information Processing Systems (NeurIPS), 2025
  2. building.png
    Building LLM Agents by Incorporating Insights from Computer Systems
    Yapeng MiZhi GaoXiaojian Ma , and Qing Li
    arXiv preprint arXiv:2504.04485, 2025