Summary of "Highload 2. Кэширование"

Core concepts

Cache pyramid: multiple layers trade response speed vs cost vs capacity — from smallest/fastest to largest/slower: CPU registers → in-memory cache → local disk → network/disk/database. Faster layers are more expensive and smaller.

What to cache (types and tradeoffs)

Performance model and metrics

Cache placement and client-side caching

Key-value cache model and TTL

Eviction and failure modes

Cold-start / warm-up

Stampede / thundering herd / race conditions

Invalidation strategies (coherence)

Three basic approaches when underlying data changes:

  1. Synchronous overwrite / write-through / invalidate on write
    • Update or invalidate cache at the time of the data change.
    • Fast for reads but can be expensive and complex if many dependent keys exist.
  2. Immediate eviction + background recompute
    • Invalidate quickly on write and trigger asynchronous tasks to rebuild cache entries.
  3. Serve stale and recompute in background
    • Soft-expire entries and let a single worker recompute in-flight while others serve stale responses.

Notes:

Practical engineering patterns and recommendations

Examples from the talk

Tutorial / guide takeaways

Main speakers and referenced systems

Category ?

Technology


Share this summary


Is the summary off?

If you think the summary is inaccurate, you can reprocess it with the latest model.

Video