👋 Hi, I’m Xiao and I recently graduated from SJTU. I was rejected by all USA CS PhD Programs for the 2025 fall application season
- Cloud Computing
- Machine Learning Systems
- Currently working on LLM Serving Systems.
- ICSE-SEIP'23
- Eurosys'24
- ASPLOS'24
- RagInfer (OSDI'25 submission)
- AgentServing (OSDI'25 submission, co-first author)
- Aceso: Auto Parallel DNN Training
- Raginfer: low latency RAG inference system
- Autellix: high throuhput LLM agent serving system
- DeepScaler: RL LLM training
📫 Feel free to email me at [email protected] if you are interested in my work.