Files
system-design-101/data/guides/why-is-kafka-fast.md
Kamran Ahmed ee4b7305a2 Adds ByteByteGo guides and links (#106)
This PR adds all the guides from [Visual
Guides](https://bytebytego.com/guides/) section on bytebytego to the
repository with proper links.

- [x] Markdown files for guides and categories are placed inside
`data/guides` and `data/categories`
- [x] Guide links in readme are auto-generated using
`scripts/readme.ts`. Everytime you run the script `npm run
update-readme`, it reads the categories and guides from the above
mentioned folders, generate production links for guides and categories
and populate the table of content in the readme. This ensures that any
future guides and categories will automatically get added to the readme.
- [x] Sorting inside the readme matches the actual category and guides
sorting on production
2025-03-31 22:16:44 -07:00

41 lines
1.6 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
title: "Why is Kafka Fast?"
description: "Explore the key design choices behind Kafka's high performance."
image: "https://assets.bytebytego.com/diagrams/0424-why-is-kafka-fast.jpg"
createdAt: "2024-02-05"
draft: false
categories:
- database-and-storage
tags:
- "Kafka"
- "Performance"
---
![No alternative text description for this image](https://assets.bytebytego.com/diagrams/0424-why-is-kafka-fast.jpg)
There are many design decisions that contributed to Kafkas performance. In this post, well focus on two. We think these two carried the most weight.
## Sequential I/O
The first one is Kafkas reliance on Sequential I/O.
## Zero Copy
The second design choice that gives Kafka its performance advantage is its focus on efficiency: zero copy principle.
The diagram above illustrates how the data is transmitted between producer and consumer, and what zero-copy means.
* Step 1.1 - 1.3: Producer writes data to the disk
* Step 2: Consumer reads data without zero-copy
* 2.1: The data is loaded from disk to OS cache
* 2.2 The data is copied from OS cache to Kafka application
* 2.3 Kafka application copies the data into the socket buffer
* 2.4 The data is copied from socket buffer to network card
* 2.5 The network card sends data out to the consumer
* Step 3: Consumer reads data with zero-copy
* 3.1: The data is loaded from disk to OS cache
* 3.2 OS cache directly copies the data to the network card via sendfile() command
* 3.3 The network card sends data out to the consumer
Zero copy is a shortcut to save multiple data copies between the application context and kernel context.