system-design-101/data/guides/vertical-partitioning-vs-horizontal-partitioning.md at main

mirror of https://github.com/ByteByteGoHq/system-design-101.git synced 2026-04-08 03:07:24 -04:00

Files

Kamran Ahmed ee4b7305a2 Adds ByteByteGo guides and links (#106 )

This PR adds all the guides from [Visual
Guides](https://bytebytego.com/guides/) section on bytebytego to the
repository with proper links.

- [x] Markdown files for guides and categories are placed inside
`data/guides` and `data/categories`
- [x] Guide links in readme are auto-generated using
`scripts/readme.ts`. Everytime you run the script `npm run
update-readme`, it reads the categories and guides from the above
mentioned folders, generate production links for guides and categories
and populate the table of content in the readme. This ensures that any
future guides and categories will automatically get added to the readme.
- [x] Sorting inside the readme matches the actual category and guides
sorting on production

2025-03-31 22:16:44 -07:00

2.2 KiB

Raw Permalink Blame History

title, description, image, createdAt, draft, categories, tags

title

description

image

createdAt

draft

Routing algorithm

The routing algorithm decides which partition (shard) stores the data.

Range-based sharding. This algorithm uses ordered columns, such as integers, longs, timestamps, to separate the rows. For example, the diagram below uses the User ID column for range partition: User IDs 1 and 2 are in shard 1, User IDs 3 and 4 are in shard 2.
Hash-based sharding. This algorithm applies a hash function to one column or several columns to decide which row goes to which table. For example, the diagram below uses User ID mod 2 as a hash function. User IDs 1 and 3 are in shard 1, User IDs 2 and 4 are in shard 2.

Benefits

Facilitate horizontal scaling. Sharding facilitates the possibility of adding more machines to spread out the load.
Shorten response time. By sharding one table into multiple tables, queries go over fewer rows, and results are returned much more quickly.

Drawbacks

The order by operation is more complicated. Usually, we need to fetch data from different shards and sort the data in the application's code.
Uneven distribution. Some shards may contain more data than others (this is also called the hotspot).

2.2 KiB Raw Permalink Blame History Unescape Escape

Routing algorithm

Benefits

Drawbacks

2.2 KiB

Raw Permalink Blame History