ernie

I have joined the Ernie team and research on the next generation of architecture design for LLM.

April 15, 2026 · 1 min · Theme PaperMod

Scaling

In collaboration with Li Auto, we explored a hardware co-design law that jointly characterizes model accuracy and inference performance ( arXiv:2602.10377 ).

February 10, 2026 · 1 min · Theme PaperMod

Curated resources: distributed systems & ML systems

I have been curating resources for system areas such as Dissys and MLsys .

March 30, 2024 · 1 min · Theme PaperMod

Curated resources: CUDA programming

I have been curating resources for CUDA programming: CUDA .

October 19, 2023 · 1 min · Theme PaperMod

Curated list: LLMs for code generation

I have been curating a list of models for code generation: LLMs for Code .

October 15, 2023 · 1 min · Theme PaperMod