LLM Infrastructure
Moonshot AI's PrfaaS Rethinks LLM Serving at Scale
Moonshot AI and Tsinghua researchers unveil PrfaaS, a cross-datacenter KVCache architecture that decouples prefill from decode to dramatically improve LLM serving efficiency at scale.