gpu
Browse all articles, tutorials, and guides about gpu
2posts
Posts
⌘K
Kubernetes
|7 min read
Kubernetes 1.37 Just Locked Its Feature Set: What Made the Cut
The enhancements freeze for Kubernetes 1.37 landed on June 17, so the shape of the August release is now decided. GPU partitioning keeps maturing for AI workloads, and a cgroup v1 change will stop some kubelets from starting. Here is what is locked in and what to check before you upgrade.
Kubernetes
|11 min read
How NetEase Games Cut LLM Cold Starts From 42 Minutes to 30 Seconds Using Fluid
NetEase Games published a Kubernetes case study walking through how they took their serverless GPU inference cold-start time from 42 minutes down to under 30 seconds. The bottleneck isn't the GPU. It's the 60GB model weights crossing a region. Here is what they did with the CNCF Fluid project and how to apply the same pattern even if you are not on Kubernetes.