Files
system-design-101/data/guides/how-can-cache-systems-go-wrong.md
Kamran Ahmed ee4b7305a2 Adds ByteByteGo guides and links (#106)
This PR adds all the guides from [Visual
Guides](https://bytebytego.com/guides/) section on bytebytego to the
repository with proper links.

- [x] Markdown files for guides and categories are placed inside
`data/guides` and `data/categories`
- [x] Guide links in readme are auto-generated using
`scripts/readme.ts`. Everytime you run the script `npm run
update-readme`, it reads the categories and guides from the above
mentioned folders, generate production links for guides and categories
and populate the table of content in the readme. This ensures that any
future guides and categories will automatically get added to the readme.
- [x] Sorting inside the readme matches the actual category and guides
sorting on production
2025-03-31 22:16:44 -07:00

2.0 KiB
Raw Permalink Blame History

title, description, image, createdAt, draft, categories, tags
title description image createdAt draft categories tags
How Can Cache Systems Go Wrong? Explore common cache pitfalls and effective mitigation strategies. https://assets.bytebytego.com/diagrams/0038-how-caches-can-go-wrong.png 2024-03-01 false
caching-performance
Caching
Performance

The diagram above shows 4 typical cases where caches can go wrong and their solutions.

1. Thunder Herd Problem

This happens when a large number of keys in the cache expire at the same time. Then the query requests directly hit the database, which overloads the database.

There are two ways to mitigate this issue: one is to avoid setting the same expiry time for the keys, adding a random number in the configuration; the other is to allow only the core business data to hit the database and prevent non-core data to access the database until the cache is back up.

2. Cache Penetration

This happens when the key doesnt exist in the cache or the database. The application cannot retrieve relevant data from the database to update the cache. This problem creates a lot of pressure on both the cache and the database.

To solve this, there are two suggestions.

  • Cache a null value for non-existent keys, avoiding hitting the database.
  • Use a bloom filter to check the key existence first, and if the key doesnt exist, we can avoid hitting the database.

3. Cache Breakdown

This is similar to the thunder herd problem. It happens when a hot key expires. A large number of requests hit the database.

Since the hot keys take up 80% of the queries, we do not set an expiration time for them.

4. Cache Crash

This happens when the cache is down and all the requests go to the database.

There are two ways to solve this problem.

  • Set up a circuit breaker, and when the cache is down, the application services cannot visit the cache or the database.
  • Set up a cluster for the cache to improve cache availability.