Jiwen Jiang

AI Researcher

Institute of Automation, Chinese Academy of Sciences

About Me

I am a third-year master’s student at the Institute of Automation, Chinese Academy of Sciences, under the supervision of Prof. Haifeng Zhang , collaborated with Prof. Jun Wang . My research focuses on Hardware Co-design for LLM, Agentic RL with tool use, and efficient LLM inference.

Latest News

2026-04-15

I have joined the Ernie team and research on the next generation of architecture design for LLM.

2026-02-10

In collaboration with Li Auto, we explored a hardware co-design law that jointly characterizes model accuracy and inference performance ( arXiv:2602.10377 ).

2024-03-30

I have been curating resources for system areas such as Dissys and MLsys .

2023-10-19

I have been curating resources for CUDA programming: CUDA .

2023-10-15

I have been curating a list of models for code generation: LLMs for Code .

Selected Publications [Full List]

* Equal contribution; ^✉ Corresponding author

Feasible Constraint Policy Optimization for Safe Reinforcement Learning

Luoyang Sun*, Jiwen Jiang*, Ning Yang^✉, Rasul Tutunov, Haifeng Zhang^✉, Jun Wang

AAMAS 2026 The 25th International Conference on Autonomous Agents and Multiagent Systems

We introduce Feasible Constraint Policy Optimization (FCPO), which seamlessly combines penalty and trust region methods to address policy feasibility while ensuring stability and performance.

📄 Paper 💻 Code

PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing

Cheng Deng*^✉, Luoyang Sun*, Jiwen Jiang*, Yongcheng Zeng, Xinjian Wu, Wenxin Zhao, Qingfa Xiao, Jiachuan Wang, Haoyang Li, Lei Chen^✉, Lionel M. Ni, Haifeng Zhang, Jun Wang^✉

arXiv 2025 arXiv preprint

We introduce the PLM, a Peripheral Language Model, developed through a co-design process that jointly optimizes model architecture and edge system constraints.

📄 Paper 💻 Code

Education

Master, Institute of Automation, Chinese Academy of Sciences, Beijing.

2023 -2026

Bachelor, Nanjing University, Nanjing.

2019 - 2023

Experience

Li Auto, MindGPT & MindVLA

2025-2026

Meituan, RL Infra & Algorithms

2025-2026

AMD, LLM & MLLM Inference Acceleration

2025-2026