Skim Logo
Dwarkesh PatelApril 30, 2026
How GPT-5, Claude, and Gemini are actually trained and served – Reiner Pope
2:13:40
DP

How GPT-5, Claude, and Gemini are actually trained and served – Reiner Pope

Rematerialization vs. Storage Costs — Dwarkesh Patel

From How GPT-5, Claude, and Gemini are actually trained and served – Reiner Pope. Category: Tech. Format: Commentary. This is a single keypoint from the analysis.

The cost of re-creating (rematerializing) the KV cache from scratch is primarily compute-bound, while storing it in memory tiers like HBM incurs costs related to capacity and bandwidth. The choice depends on how long the cache needs to be held, with shorter holds favoring faster, more expensive tiers.

Impact: Medium. Understanding the cost dynamics of rematerialization versus storage is crucial for efficient cache management and overall model operational efficiency.

In the source video, this keypoint occurs from 01:51:56 to 01:55:05.

Sources in support: Dwarkesh Patel (Host)

For the full credibility analysis, key takeaways, and other keypoints from this video, see the full analysis on skim.

This keypoint analysis was generated by skim (skim.plus), an AI-powered content analysis platform by Credible AI.