resharding

levkk · levkk · commit 38c9025a4741 · 2025-09-19T14:07:47.000-07:00
diff --git a/docs/features/sharding/resharding/.pages b/docs/features/sharding/resharding/.pages
@@ -0,0 +1,6 @@
+nav:
+  - 'index.md'
+  - 'databases.md'
+  - 'schema.md'
+  - 'hash.md'
+  - 'cutover.md'
diff --git a/docs/features/sharding/resharding/cutover.md b/docs/features/sharding/resharding/cutover.md
@@ -0,0 +1,7 @@
+---
+icon: material/set-right
+---
+# Traffic cutover
+
+!!! note "Work in progress"
+    This section is a work in progress. Check back soon!
diff --git a/docs/features/sharding/resharding/databases.md b/docs/features/sharding/resharding/databases.md
@@ -0,0 +1,39 @@
+---
+icon: material/database-plus-outline
+---
+
+# New databases
+
+PgDog's strategy for resharding Postgres databases is to create a new, independent cluster of machines and move data over to it in real-time. Creating new databases is environment-specific, and PgDog doesn't currently automate this step.
+
+## Requirements
+
+New databases should be **empty**: don't migrate your [table definitions](schema.md) or [data](hash.md). These will be taken care of automatically by PgDog. The following items do need to be created manually, however:
+
+1. Database users
+2. Database schemas
+
+### Database users
+
+Since PgDog was built to work in cloud-managed environments, like AWS RDS, we don't usually have access to the `pg_shadow` view, which contains password hashes. Therefore, tools like [`pg_dumpall`](https://www.postgresql.org/docs/current/app-pg-dumpall.html) aren't able to operate and we can't automatically migrate users to the new database.
+
+For this reason, migrating users to the new database cluster is currently **not supported** and is the responsibility of the operator.
+
+Make sure to create all the necessary Postgres users and roles before proceeding to the [next step](schema.md).
+
+### Database schemas
+
+!!! note ":material-account-hard-hat: Work in progress"
+    This step will be automated by a future version of PgDog.
+
+Before running the [schema sync](schema.md), make sure to re-create all of your existing schemas on the new databases. You can take advantage of [cross-shard DDL](../cross-shard.md#create-alter-drop) queries to make this easier.
+
+The `public` schema is created by default for all databases, so if you aren't using any additional schemas, you can skip this step.
+
+## Multiple Postgres databases
+
+If you are operating multiple Postgres databases on the same database server, they will need to be resharded separately. Logical replication, which PgDog uses to move data, operates on a single Postgres database level only.
+
+## Next steps
+
+- [Schema sync](schema.md)
diff --git a/docs/features/sharding/resharding/hash.md b/docs/features/sharding/resharding/hash.md
@@ -1,4 +1,8 @@
-# Hash resharding
+---
+icon: material/database-export-outline
+---
+
+# Move data
 
 If you're using the `HASH` sharding function, adding a new node to the cluster will change the modulo number by 1. The number returned by the hash function is uniformly distributed across the entire integer range, which makes it considerably larger than the modulo. Therefore, changing it will more often than not result in most rows remapped to different shard numbers.
 
@@ -23,16 +27,16 @@ PgDog's strategy for resharding is to **move data** from an existing cluster to
 
 ## Data sync
 
-Moving data online is a 2 step process:
+Moving data online is a 2-step process:
 
 1. Copy data from tables using Postgres `COPY`
-2. Stream real time changes using logical replication
+2. Stream real-time changes using logical replication
 
 To make sure no rows are lost in the process, PgDog follows a similar strategy used by Postgres in logical replication subscriptions, with some improvements.
 
 ### Copying tables
 
-Copying table data from source database cluster is done using Postgres `COPY` and logical replication slots. This is implemented in the `data-sync` command:
+Copying table data from the source database cluster is done using Postgres `COPY` and logical replication slots. This is implemented in the `data-sync` command:
 
 ```bash
 pgdog data-sync --help
@@ -50,5 +54,5 @@ All databases and users must be configured in `pgdog.toml` and `users.toml`.
 
 ### Real time changes
 
-After data sync is complete, changes for all tables in the publication will be streamed in real time. Keep this connection
+After data sync is complete, changes for all tables in the publication will be streamed in real-time. Keep this connection
 open until you are ready to cut traffic over to the new database cluster.
diff --git a/docs/features/sharding/resharding/index.md b/docs/features/sharding/resharding/index.md
@@ -1,12 +1,35 @@
-# Resharding overview
+---
+icon: material/set-split
+---
+
+# Resharding Postgres
 
 !!! note
     This feature is a work in progress. Support for resharding with logical replication was started in [#279](https://github.com/pgdogdev/pgdog/pull/279).
 
-Resharding adds more nodes to an existing database cluster, spreading the data evenly between all machines. Depending on which [sharding function](../sharding-functions.md) is used, this may require recomputing shard numbers for all rows and move them between databases.
+Resharding changes the number of shards in an existing database cluster, in order to add or remove capacity. To make this less impactful on production operations, PgDog's strategy for resharding is to create a new database cluster and reshard data in-flight, while moving it to the new databases.
+
+To make this an online process, with zero downtime or data loss, PgDog hooks into the logical replication protocol used by PostgreSQL and reroutes messages between nodes to create and update rows in real-time.
+
+<center>
+  <img src="/images/resharding-arch-1.png" width="90%" height="auto" alt="Mirroring">
+</center>
+
+## Step by step
+
+The resharding process is composed of four independent operations:
+
+1. #### [Create new databases](databases.md)
+2. #### [Synchronize schema](schema.md)
+3. #### [Move data](hash.md)
+4. #### [Cutover traffic](cutover.md)
 
-## Hash-based resharding
+All of the individual steps are automated by PgDog, while their orchestration is currently the responsibility of the user.
 
-PgDog's strategy for resharding for hash-sharded clusters is to create a new cluster with `N x 2` nodes (`N` is the number of nodes in the existing cluster) and move all data to the new cluster without downtime using logical replication.
+## Terminology
 
-[**→ Hash resharding**](hash.md)
+| Term | Description |
+|-|-|
+| Source database | The database cluster that's being resharded and contains all data and table definitions. |
+| Destination database | The database cluster with the new sharding configuration, where the data will be copied from the source database. |
+| Logical replication | Replication protocol available to PostgreSQL databases since version 10. |
diff --git a/docs/features/sharding/resharding/schema.md b/docs/features/sharding/resharding/schema.md
@@ -1,13 +1,16 @@
+---
+icon: material/database-edit-outline
+---
 # Schema sync
 
-PgDog can copy tables, indexes and other entities from your production database to the new sharded database automatically. To make [data sync](hash.md) as efficient as possible, it splits the schema sync into two parts:
+PgDog can copy tables, indexes and other entities from your production database to the new, sharded database automatically. This is faster than using `pg_dump`, because we separate this process into two parts:
 
-- Pre-data tables and primary keys
-- Post-data secondary indices
+1. [Create tables](#tables-and-primary-keys), primary key indices, and sequences
+2. Create [secondary indices](#secondary-indices)
 
-Before syncing data, run the first part to create the necessary tables and primary key constraints. Once data sync is caught up, run the second step to create secondary indexes, sequences and other entities.
+The first step needs to be performed first, before [copying data](hash.md). The second step is performed once the data sync is almost complete.
 
-## How it works
+## CLI
 
 PgDog has a command line interface you can call by running it directly. Schema sync is controlled by a CLI command:
 
@@ -18,13 +21,56 @@ pgdog schema-sync \
   --publication <publication>
 ```
 
-Expected and optional parameters for this command are as follows:
+Required (*) and optional parameters for this command are as follows:
 
 | Parameter | Description |
 |-|-|
-| `--from-database` | The name of the source database in `pgdog.toml`. |
-| `--to-database` | The name of the destination database in `pgdog.toml`. |
-| `--publication` | The name of the Postgres table publication with the tables you want to sync. |
+| `--from-database`* | The name of the source database in `pgdog.toml`. |
+| `--to-database`* | The name of the destination database in `pgdog.toml`. |
+| `--publication`* | The name of the Postgres table [publication](#publication) with the tables you want to sync. |
 | `--dry-run` | Print the SQL statements that will be executed on the destination database and exit. |
 | `--ignore-errors` | Execute SQL statements and ignore any errors. |
-| `--data-sync-complete` | Run the post-data step to create secondary indices and sequences. |
+| `--data-sync-complete` | Run the second step to create secondary indices and sequences. |
+
+## Tables and primary keys
+
+The first step in the schema sync copies over tables and their primary key indexes from the source database to the new, resharded cluster. This has to be done separately, because Postgres's logical replication only copies data and doesn't manage table schemas.
+
+### Primary keys
+
+A primary key constraint is **required** on all tables for logical replication to work correctly. Without a unique index identifying each row in a table, logical replication is not able to perform `UPDATE` and `DELETE` commands.
+
+Before starting the resharding process for your database, double-check that you have primary keys on all your tables.
+
+## Publication
+
+Since PgDog is using logical replication to move and reshard data, a [publication](https://www.postgresql.org/docs/current/sql-createpublication.html) for the relevant tables needs to be created on the source database.
+
+The simplest way to do this is to run the following command on the **source database**:
+
+```postgresql
+CREATE PUBLICATION pgdog FOR ALL TABLES;
+```
+
+This will make sure _all_ tables in your database will be resharded into the destination database cluster.
+
+!!! note "Multiple schemas"
+    If you're using schemas other than `public`, create them on the destination database before running the schema sync.
+
+## Schema admin
+
+Schema sync creates tables, indices, and other entities on the destination database. To make sure that's done with a user with sufficient privileges (e.g., `CREATE` permission on the database), you need to add it to [`users.toml`](../../../configuration/users.toml/users.md) and mark it as the schema administrator:
+
+```toml
+[[users]]
+name = "migrator"
+database = "prod"
+password = "hunter2"
+schema_admin = true
+```
+
+PgDog will use that user to connect to the source and destination databases, so make sure to specify one for both of them.
+
+## Secondary indices
+
+This step is performed after [data sync](hash.md) is complete. Running this step will create secondary indexes on all your tables, which will take some time.
diff --git a/docs/images/resharding-arch-1.png b/docs/images/resharding-arch-1.png