[WIP] MOLT Replicator draft docs #20465

taroface · 2025-10-01T05:05:55Z

This PR is still WIP.

Notes for reviewers:

Page	Please review	Notes
Load and Replicate	entire flow, but focus on Replicator setup, usage, troubleshooting	Fetch content is pre-existing
Migration Failback	entire flow	This was completely rewritten
Resume Replication	Replicator usage, any missing context/caveats about resuming	Structure is still rough
MOLT Replicator	whole page	Structure is WIP. Usage section is still barebones. I need to think about a good way to present the flags per dialect.
MOLT Fetch	check for content that should be removed/moved to Replicator	I think I caught everything, but may not understand something

github-actions · 2025-10-01T05:06:20Z

Files changed:

src/current/_data/redirects.yml
src/current/_includes/molt/crdb-to-crdb-migration.md
src/current/_includes/molt/fetch-data-load-output.md
src/current/_includes/molt/fetch-metrics.md
src/current/_includes/molt/fetch-replication-output.md
src/current/_includes/molt/fetch-table-filter-userscript.md
src/current/_includes/molt/migration-prepare-database.md
src/current/_includes/molt/migration-stop-replication.md
src/current/_includes/molt/molt-connection-strings.md
src/current/_includes/molt/molt-docker.md
src/current/_includes/molt/molt-install.md
src/current/_includes/molt/molt-limitations.md
src/current/_includes/molt/molt-setup.md
src/current/_includes/molt/molt-troubleshooting.md
src/current/_includes/molt/optimize-replicator-performance.md
src/current/_includes/molt/oracle-migration-prerequisites.md
src/current/_includes/molt/replicator-flags-usage.md
src/current/_includes/molt/replicator-flags.md
src/current/_includes/molt/replicator-metrics.md
src/current/_includes/v23.1/sidebar-data/migrate.json
src/current/_includes/v23.2/sidebar-data/migrate.json
src/current/_includes/v24.1/sidebar-data/migrate.json
src/current/_includes/v24.2/sidebar-data/migrate.json
src/current/_includes/v24.3/sidebar-data/migrate.json
src/current/_includes/v25.1/sidebar-data/migrate.json
src/current/_includes/v25.2/sidebar-data/migrate.json
src/current/_includes/v25.3/sidebar-data/migrate.json
src/current/_includes/v25.4/sidebar-data/migrate.json
src/current/advisories/a144650.md
src/current/molt/migrate-bulk-load.md
src/current/molt/migrate-data-load-and-replication.md
src/current/molt/migrate-data-load-replicate-only.md
src/current/molt/migrate-failback.md
src/current/molt/migrate-load-replicate.md
src/current/molt/migrate-replicate-only.md
src/current/molt/migrate-resume-replication.md
src/current/molt/migrate-to-cockroachdb.md
src/current/molt/migration-overview.md
src/current/molt/migration-strategy.md
src/current/molt/molt-fetch.md
src/current/molt/molt-replicator.md
src/current/releases/molt.md

netlify · 2025-10-01T05:06:28Z

✅ Deploy Preview for cockroachdb-api-docs canceled.

Name	Link
🔨 Latest commit	`c84c05c`
🔍 Latest deploy log	https://app.netlify.com/projects/cockroachdb-api-docs/deploys/68dd385278559300084c9654

netlify · 2025-10-01T05:07:10Z

✅ Deploy Preview for cockroachdb-interactivetutorials-docs canceled.

Name	Link
🔨 Latest commit	`c84c05c`
🔍 Latest deploy log	https://app.netlify.com/projects/cockroachdb-interactivetutorials-docs/deploys/68dd38522180d80007dc2610

netlify · 2025-10-01T05:21:22Z

✅ Netlify Preview

Name	Link
🔨 Latest commit	`c84c05c`
🔍 Latest deploy log	https://app.netlify.com/projects/cockroachdb-docs/deploys/68dd38525814220008185119
😎 Deploy Preview	https://deploy-preview-20465--cockroachdb-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

ryanluu12345 · 2025-10-03T15:37:17Z

src/current/_includes/molt/migration-prepare-database.md


 <section class="filter-content" markdown="1" data-scope="mysql">
-For MySQL **8.0 and later** sources, enable [global transaction identifiers (GTID)](https://dev.mysql.com/doc/refman/8.0/en/replication-options-gtids.html) consistency. Set the following values in `mysql.cnf`, in the SQL shell, or as flags in the `mysql` start command:
+Enable [global transaction identifiers (GTID)](https://dev.mysql.com/doc/refman/8.0/en/replication-options-gtids.html) and configure binary logging. Set `binlog-row-metadata` or `binlog-row-image` to `full` to provide complete metadata for replication.


I think it may be worth calling out that it's also important to tune binlog retention: https://dba.stackexchange.com/a/206602

This can impact if the data from the GTID you specify is still available or if it's now purged/rotated. It's important to note that if using something like AWS RDS or GCP CloudSQL, there are provider specific ways they handle this:

https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/mysql-stored-proc-configuring.html

https://discuss.google.dev/t/cloud-sql-mysql-database-instance-binlog-is-keep-increasing/124841/2 (binlog_expire_logs_seconds flag)

ryanluu12345

Excellent work @taroface . Not an easy doc to write, but you made it understandable and clean! Let's bottom out on some of these discussions and ensure the deprecation effort from @tuansydau reflects the reality of what we are documenting.

ryanluu12345 · 2025-10-03T15:38:16Z

src/current/_includes/molt/migration-prepare-database.md

+Enable [global transaction identifiers (GTID)](https://dev.mysql.com/doc/refman/8.0/en/replication-options-gtids.html) and configure binary logging. Set `binlog-row-metadata` or `binlog-row-image` to `full` to provide complete metadata for replication.
+
+{{site.data.alerts.callout_info}}
+GTID replication sends all database changes to Replicator. To limit replication to specific tables or schemas, use the `--table-filter` and `--schema-filter` flags in the `replicator` command.


Just a note that schema-filter and table-filter are not supported for replicator. This use case will actually require a userscript. Given we don't have userscripts documented right now, wondering how you want to proceed here? CC @Jeremyyang920 @rohan-joshi

ryanluu12345 · 2025-10-03T15:39:10Z

src/current/_includes/molt/migration-prepare-database.md

+
+Use the `Executed_Gtid_Set` value for the `--defaultGTIDSet` flag in MOLT Replicator.
+
+To verify that a GTID set is valid and not purged, use the following queries:


Great, this section will be really helpful and would have helped some folks sanity check before raising an issue.

ryanluu12345 · 2025-10-03T15:40:20Z

src/current/_includes/molt/migration-prepare-database.md

+---------------+----------+--------------+------------------+-------------------------------------------+
+~~~
+
+Use the `Executed_Gtid_Set` value for the `--defaultGTIDSet` flag in MOLT Replicator.


Just a note is that this value will only be used if there is no GTID in the memo table which is in the staging database (i.e _replicator). Otherwise, it will use the one in the memo table and keep track of advancing GTID checkpoints in memo. Is this called out elsewhere, or can we add a line about this here?

To force the system to respect the defaultGTIDSet you pass in, you can just clear the memo table and it will be as if it's a fresh run.

ryanluu12345 · 2025-10-03T15:40:58Z

src/current/_includes/molt/migration-prepare-database.md

 </section>

 <section class="filter-content" markdown="1" data-scope="oracle">
+##### Enable ARCHIVELOG and FORCE LOGGING


Deferring to @noelcrl to review the correctness here.

Looks correct overall, will clean up these commands a bit and clarify what is happening.

ryanluu12345 · 2025-10-03T15:42:16Z

src/current/_includes/molt/molt-connection-strings.md

 --source 'postgres://migration_user:password@localhost:5432/molt?sslmode=verify-full'
 ~~~
+
+The source connection must point to the PostgreSQL primary instance, not a read replica.


Well we do have a flag that can still ignore replication setup for cases where folks just want a data load and don't have any need for replication setup or information. Should we clarify this? CC @Jeremyyang920

ryanluu12345 · 2025-10-03T15:57:50Z

src/current/_includes/molt/replicator-metrics.md

@@ -0,0 +1,27 @@
+### Replicator metrics
+
+By default, MOLT Replicator exports [Prometheus](https://prometheus.io/) metrics at the address specified by `--metricsAddr` (default `:30005`) at the path `/_/varz`. For example: `http://localhost:30005/_/varz`.


Did we decide on referring to it as MOLT Replicator generally from now on? CC @rohan-joshi @Jeremyyang920

ryanluu12345 · 2025-10-03T15:59:27Z

src/current/_includes/molt/replicator-metrics.md

@@ -0,0 +1,27 @@
+### Replicator metrics
+
+By default, MOLT Replicator exports [Prometheus](https://prometheus.io/) metrics at the address specified by `--metricsAddr` (default `:30005`) at the path `/_/varz`. For example: `http://localhost:30005/_/varz`.


Looking at the code, I actually see what Replicator doesn't actually default metricsAddr which means that metrics are not enabled by default. This is stale information since the MOLT wrapper used to set metricsAddr to 30005. I think we should call out that the default behavior is to not spin up metrics, but you can set it to a port (:30005 recommended).

Here is the code snippet that made me realize this:

cmd.Flags().StringVar(&metricsAddr, "metricsAddr", "", "start a metrics server")

ryanluu12345 · 2025-10-03T16:02:47Z

src/current/molt/migrate-load-replicate.md

+
+{% include molt/molt-setup.md %}
+
+## Start Fetch


So an important note here is that as part of the deprecation of the wrapper, we're mainly removing the invocations of Replicator from MOLT. However, there is some source database Replication setup that we'll still need to perform for PostgreSQL specifically. The reason we have to do this is because we need to create the slot at the time we actually do the snapshot export so we don't have gaps in data.

So that means that we still need to document the behavior when we set certain pg-* flags for setting publication, slots and the relevant drop/recreate behavior. I think we'll need to discuss this a bit more in the next team meeting to clearly lay out what the behavior still is. CC @tuansydau @Jeremyyang920

ryanluu12345 · 2025-10-03T16:03:41Z

src/current/molt/migrate-resume-replication.md

+	</section>
+
+	<section class="filter-content" markdown="1" data-scope="mysql">
+	Use the `replicator mylogical` command. Replicator will automatically use the saved GTID from the staging schema, or fall back to the specified `--defaultGTIDSet` if no saved state exists.


Super nit: the saved GTID from the staging schema's memo table if they want to know where to look.

ryanluu12345 · 2025-10-03T16:04:40Z

src/current/molt/molt-replicator.md

+
+MOLT Replicator continuously replicates changes from source databases to CockroachDB as part of a [database migration]({% link molt/migration-overview.md %}). It supports live ongoing migrations to CockroachDB from a source database, and enables backfill from CockroachDB to your source database for failback scenarios to preserve a rollback option during a migration window.
+
+MOLT Replicator consumes change data from CockroachDB changefeeds, PostgreSQL logical replication streams, MySQL GTID-based replication, and Oracle LogMiner. It applies changes to target databases while maintaining configurable consistency {% comment %}and transaction boundaries{% endcomment %}, and features an embedded TypeScript/JavaScript environment for configuration and live data transforms.


Super nit: MOLT Replicator also consumes

ryanluu12345 · 2025-10-07T14:35:08Z

src/current/molt/migrate-failback.md

 ## Prepare the CockroachDB cluster

+{{site.data.alerts.callout_success}}
+For details on enabling CockroachDB changefeeds, refer to [Create and Configure Changefeeds]({% link {{ site.current_cloud_version }}/create-and-configure-changefeeds.md %}).


We need to also ensure that the license and organization are set:

SET CLUSTER SETTING cluster.organization = 'organization'; SET CLUSTER SETTING enterprise.license ='$LICENSE';

ryanluu12345 · 2025-10-07T14:36:34Z

src/current/molt/migrate-failback.md

-~~~
--source 'postgres://crdb_user@localhost:26257/defaultdb?sslmode=verify-full'
-~~~
+For failback, MOLT Replicator uses `--targetConn` to specify the original source database and `--stagingConn` for the CockroachDB staging database.


Hmm this might be confusing now since we don't have to explain it in terms of source or target for the data load portion. I think it may be clearer here to describe the target connection as the destination you want the data to go from from Cockroach sources.

ryanluu12345 · 2025-10-07T14:44:20Z

src/current/molt/migrate-data-load-and-replication.md

-MOLT Fetch replication modes will be deprecated in favor of a separate replication workflow in an upcoming release. This includes the `data-load-and-replication`, `replication-only`, and `failback` modes.
-{{site.data.alerts.end}}
-
-Use `data-load-and-replication` mode to perform a one-time bulk load of source data and start continuous replication in a single command.


@taroface just want to note that given we remove this specific mode in MOLT, I want to make sure we still document how exactly these modes look like if people want to run them manually. I'll go ahead and describe what replication-only and data-load-and-replication look like. Can you please take an action to move this content into the proper section for this new doc? I trust you with figuring out the appropriate location.

CRDB data load and replication

Before the data load, get the latest MVCC timestamp so you have the consistent point:

root@localhost:26257/molt> SELECT cluster_logical_timestamp(); -> cluster_logical_timestamp ---------------------------------- 1759848027465101000.0000000000

Create your changefeed so that the cursor='' is set to the value from above. Now, the changefeed will send data starting from the above MVCC timestamp

For replication-only, you can just create the changefeed and the changefeed will start sending data from "now". However, if you want to send data from a previous time, you can pass in the proper MVCC timestamp which is of the format shown above

Important note!!! Make sure that the GC TTL is set appropriately so the data from the cursor you're using is still valid: https://www.cockroachlabs.com/docs/stable/protect-changefeed-data

To add to the GC detail, which is important to ensure that the changes from back in time where the cursor is, are still valid and are able to be consumed from a changefeed.

Configure GC TTL for a data export or migration

Before starting a data export or migration with MOLT, make sure the GC TTL for the source database is long enough to cover the full duration of the process (for example, the total time it takes for the initial data load).
This ensures that historical data remains available from the changefeed when replication begins.

-- Increase GC TTL to 24 hours (example) ALTER DATABASE <database_name> CONFIGURE ZONE USING gc.ttlseconds = 86400;

Once the changefeed or replication has started successfully (which automatically protects its own data range), you can safely lower the TTL again if necessary to resume normal garbage collection:

-- Restore GC TTL to 5 minutes ALTER DATABASE <database_name> CONFIGURE ZONE USING gc.ttlseconds = 300;

Note: that the time in seconds will depend on the user's expected time for the initial data load, and it must be higher than that number.

ryanluu12345 · 2025-10-07T14:48:30Z

src/current/molt/migrate-data-load-and-replication.md

@@ -1,115 +0,0 @@
---
-title: Load and Replicate


A more general note than the one below is that the customer should ensure that they get the proper replication consistent point BEFORE they do a data load so that we can ensure we don't have any data gaps.

I wonder if we can make a call out that if folks want to do the full data load and replication and have consistency, that they first gather consistent point (which we document in the replicator setup sections):

SCN for Oracle

LSN for Postgres

GTID for MySQL

Cursor for CockroachDB

Second, run the data load until completion.

Third, run replication from the consistent points obtained in the steps above.

CC @tuansydau I think it's fairly crucial we log these out for users so they can at least have information of where they should start from.

noelcrl · 2025-10-07T18:23:01Z

src/current/_includes/molt/migration-prepare-database.md

+ALTER DATABASE ADD SUPPLEMENTAL LOG DATA (PRIMARY KEY) COLUMNS;
+
+-- Verify supplemental logging
+SELECT supplemental_log_data_min, supplemental_log_data_pk FROM v$database;
+-- Expected: SUPPLEMENTAL_LOG_DATA_MIN: IMPLICIT (or YES), SUPPLEMENTAL_LOG_DATA_PK: YES


Suggested change

ALTER DATABASE ADD SUPPLEMENTAL LOG DATA (PRIMARY KEY) COLUMNS;

-- Verify supplemental logging

SELECT supplemental_log_data_min, supplemental_log_data_pk FROM v$database;

-- Expected: SUPPLEMENTAL_LOG_DATA_MIN: IMPLICIT (or YES), SUPPLEMENTAL_LOG_DATA_PK: YES

-- Enable minimal supplemental logging for primary keys

ALTER DATABASE ADD SUPPLEMENTAL LOG DATA (PRIMARY KEY) COLUMNS;

-- Verify supplemental logging status

SELECT supplemental_log_data_min, supplemental_log_data_pk FROM v$database;

-- Expected:

-- SUPPLEMENTAL_LOG_DATA_MIN: IMPLICIT (or YES)

-- SUPPLEMENTAL_LOG_DATA_PK: YES

noelcrl · 2025-10-07T18:26:14Z

src/current/_includes/molt/migration-prepare-database.md

+SELECT MIN(t.START_SCNB) FROM V$TRANSACTION t;
+~~~
+
+Use the results as follows:


Suggested change

Use the results as follows:

Use the query results by providing the following flag values to `replicator`:

noelcrl · 2025-10-07T18:54:26Z

src/current/_includes/molt/migration-prepare-database.md

+-- Query the current SCN from Oracle
+SELECT CURRENT_SCN FROM V$DATABASE;
+
+-- Query the starting SCN of the earliest active transaction
+SELECT MIN(t.START_SCNB) FROM V$TRANSACTION t;
+~~~


There could be a correctness issue here, the following should work instead:

-- 1) Capture an SCN before inspecting active transactions SELECT CURRENT_SCN AS before_active_scn FROM V$DATABASE; -- 2) Find the earliest active transaction start SCN SELECT MIN(t.START_SCNB) AS earliest_active_scn FROM V$TRANSACTION t; -- 3) Capture the snapshot SCN after the checks SELECT CURRENT_SCN AS snapshot_scn FROM V$DATABASE;

noelcrl · 2025-10-07T18:59:32Z

src/current/_includes/molt/migration-prepare-database.md

+- `--scn`: Use the result from the first query (current SCN)
+- `--backfillFromSCN`: Use the result from the second query (earliest active transaction SCN). If the second query returns no results, use the result from the first query instead.


Suggested change

- `--scn`: Use the result from the first query (current SCN)

- `--backfillFromSCN`: Use the result from the second query (earliest active transaction SCN). If the second query returns no results, use the result from the first query instead.

Compute the flags for replicator as follows:

--backfillFromSCN: use the smaller value between `before_active_scn` and `earliest_active_scn`. If `earliest_active_scn` has no value, use `before_active_scn`.

--scn: use `snapshot_scn`.

Make sure --scn is greater than or equal to --backfillFromSCN.

[wip] MOLT Replicator draft docs

4cfa97a

fix links

01cb4a8

taroface changed the title ~~[wip] MOLT Replicator draft docs~~ [WIP] MOLT Replicator draft docs Oct 1, 2025

fix more links

c84c05c

taroface requested review from Jeremyyang920, noelcrl and ryanluu12345 October 1, 2025 14:37

ryanluu12345 reviewed Oct 3, 2025

View reviewed changes

ryanluu12345 reviewed Oct 7, 2025

View reviewed changes

noelcrl reviewed Oct 7, 2025

View reviewed changes


		Use the `Executed_Gtid_Set` value for the `--defaultGTIDSet` flag in MOLT Replicator.

		To verify that a GTID set is valid and not purged, use the following queries:

		@@ -0,0 +1,27 @@
		### Replicator metrics

		By default, MOLT Replicator exports [Prometheus](https://prometheus.io/) metrics at the address specified by `--metricsAddr` (default `:30005`) at the path `/_/varz`. For example: `http://localhost:30005/_/varz`.


		MOLT Replicator continuously replicates changes from source databases to CockroachDB as part of a [database migration]({% link molt/migration-overview.md %}). It supports live ongoing migrations to CockroachDB from a source database, and enables backfill from CockroachDB to your source database for failback scenarios to preserve a rollback option during a migration window.

		MOLT Replicator consumes change data from CockroachDB changefeeds, PostgreSQL logical replication streams, MySQL GTID-based replication, and Oracle LogMiner. It applies changes to target databases while maintaining configurable consistency {% comment %}and transaction boundaries{% endcomment %}, and features an embedded TypeScript/JavaScript environment for configuration and live data transforms.

	Use the results as follows:
	Use the query results by providing the following flag values to `replicator`:

		- `--scn`: Use the result from the first query (current SCN)
		- `--backfillFromSCN`: Use the result from the second query (earliest active transaction SCN). If the second query returns no results, use the result from the first query instead.

-- `--scn`: Use the result from the first query (current SCN)
-- `--backfillFromSCN`: Use the result from the second query (earliest active transaction SCN). If the second query returns no results, use the result from the first query instead.
+Compute the flags for replicator as follows:
+--backfillFromSCN: use the smaller value between `before_active_scn` and `earliest_active_scn`. If `earliest_active_scn` has no value, use `before_active_scn`.
+--scn: use `snapshot_scn`.
+Make sure --scn is greater than or equal to --backfillFromSCN.

[WIP] MOLT Replicator draft docs #20465

Are you sure you want to change the base?

[WIP] MOLT Replicator draft docs #20465

Uh oh!

Conversation

taroface commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Files changed:

Uh oh!

netlify bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for cockroachdb-api-docs canceled.

Uh oh!

netlify bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for cockroachdb-interactivetutorials-docs canceled.

Uh oh!

netlify bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Netlify Preview

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ryanluu12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noelcrl Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Configure GC TTL for a data export or migration

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noelcrl Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noelcrl Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noelcrl Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

taroface commented Oct 1, 2025 •

edited

Loading

github-actions bot commented Oct 1, 2025 •

edited

Loading

netlify bot commented Oct 1, 2025 •

edited

Loading

netlify bot commented Oct 1, 2025 •

edited

Loading

netlify bot commented Oct 1, 2025 •

edited

Loading

noelcrl Oct 7, 2025 •

edited

Loading

noelcrl Oct 7, 2025 •

edited

Loading

noelcrl Oct 7, 2025 •

edited

Loading

noelcrl Oct 7, 2025 •

edited

Loading