Skip to content
View Xunzhuo's full-sized avatar
🎲
Exploring AI Networks
🎲
Exploring AI Networks

Block or report Xunzhuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Xunzhuo/README.md

Profile View Counter Linkedln Zhihu Badge Gmail Badge Wechat Badge

Bit is exploring the frontier tech in combination of networking and LLM at Tencent. He is currently working around AI Infrastructure, serving as the Chair of K8S AI Gateway WG. Previously, he has been involved in research at UESTC NLP Lab.

Bit is leading the development of vLLM Semantic Router, an intelligent auto reasoning router for Efficient LLM Inference on Mixture-of-Models, saving tons of cost by advanced routing algorithm.

As a CNCF Ambassador and Linux Foundation LFAPAC, Bit serves on the Envoy Gateway Steering Committee. He also maintains multiple projects including Envoy AI Gateway, vLLM AIBrix, Istio, Kiali, Aeraki-Mesh, and Merbridge, as well as the approver of Higress and MOSN. Additionally, Bit contributes as a Kubernetes Gateway API and Kubernetes Ingress2Gateway reviewer and member of Kubernetes.

Pinned Loading

  1. vllm-project/semantic-router vllm-project/semantic-router Public

    Intelligent Mixture-of-Models Router for Efficient LLM Inference

    Go 1.5k 130

  2. envoyproxy/gateway envoyproxy/gateway Public

    Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway

    Go 2.1k 549

  3. envoyproxy/ai-gateway envoyproxy/ai-gateway Public

    Manages Unified Access to Generative AI Services built on Envoy Gateway

    Go 1.1k 102

  4. vllm-project/aibrix vllm-project/aibrix Public

    Cost-efficient and pluggable Infrastructure components for GenAI inference

    Go 4.3k 463