I'm a PhD student @ UChicago, graduating, working in Large Language Model Inference. Check my home page for more about me!
- 🚀 Working on vLLM project as vLLM team member. My contributions:
- Performance dashboard: perf.vllm.ai.
- Performance comparison with other LLM inference engines: the end of the blog.
- Features: Disaggregated prefilling and CPU offloading.
- 💾 Contributing to the LMCache project, exploring fun ideas in KV caches.
- 🎮 Gaming: League of Legends, Stardew Valley, Go
- 💃 Street Dance: Locking main, but I also dance waacking.
- 🎤 Singing: Loch Lomond and 传奇 Legend
- Email: [email protected]
- LinkedIn: Kuntai Du