UC Berkeley
Aug 2021 — Dec 2025
I am the co-founder and CTO of Inferact, a startup advancing the frontier of AI inference. I co-created and now co-lead the vLLM project, a widely-adopted open-source inference engine for LLMs.