Workflow
GPU
icon
Search documents
X @Ansem
Ansem 🧸💸· 2026-05-09 18:11
RT Tulip King 🌷 (@tulipking)@blknoiz06 Intel: Agents use more CPU than GPU, the spend needs to rerate - therefore the comp is unironically an Nvidia that fabs its own chips and packages the chips for other companies. American silicon supercycle ...
Google Cloud Managed Lustre for LLM Inference: Cut GPU Waste by 50%
DDN· 2026-05-08 19:49AI Processing
When you feed an LLM a massive multimodal file like a 100page legal contract, you're asking for a massive heavy lift. It takes about 20 seconds of intense computation just to generate the initial analysis. The model is storing its mathematical work as large tensors in what we call the KV cache.But GPU memory is premium real estate. If the user steps away for lunch or the memory reaches capacity, that context is evicted to make room for other tasks. With managed luster, rather than evicting the context, we c ...