Model performance engineer

Yineng Zhang

About

Yineng Zhang is a Software Engineer at Baseten Model Performance team. He is also a core developer of the SGLang project.

Model performance

Day zero benchmarks for Qwen 3 with SGLang on Baseten

Qwen 3 235B: open-source MoE LLM brings frontier reasoning to 4 H100 GPUs. See benchmarks, SGLang setup, and FP8 tips for cost-efficient inferencing.

2 others

Machine learning infrastructure that just works

Baseten provides all the infrastructure you need to deploy and serve ML models performantly, scalable, and cost-efficiently.