Skip to main content

Data Replication Overview

Data Replication in Data Hub copies data from your source system into PostgreSQL tables in the target, keeping both in sync. It extracts data from the source, loads it into the target as-is, and lets you transform it there without affecting the source system.

You set up replication after establishing a connection in Data Hub. Once a connection is in place, predefined tasks are created automatically based on the configured connection, with the relevant tables pulled in. You can also create additional tasks to define which data to synchronize between sources and targets, and how often.

To create, schedule, edit, or delete replication tasks, you must have an Admin, Designer, or Power User role.

How Replication Works

Data Hub replicates data in two stages:

Full Replication Incremental Replication

When it runs

The first time a task executes

All subsequent runs

What it copies

The complete data set - all historical records for the selected tables

Only records that are new or have changed since the last run

Requirements

None

Primary keys must be configured on the source tables. If not, the task falls back to full replication each time.

Performance

Can take time depending on data volume. Plan the first run during off-peak hours.

Fast and lightweight, reduces load on both source and target.

Delete handling

Not applicable

By default, Data Hub removes records from the target that no longer exist in the source. For large tables where this affects performance, use the Disable Advance Delete option in the table settings when creating the task. Best suited for tables where deletions are rare or handled by the source system.

Multi-environment Support

Your organization is provisioned with a default environment, which has no ERP data until you assign a data source to it. Each environment maps to one connection. If your subscription includes multiple connections, you can provision a new environment for each one. Each environment maintains its own replication tasks independently, so you can build and test in a non-production environment before replicating the same configuration to your primary one. Switch between environments directly from the Replication Tasks page, no separate login required.

Monitoring and Notifications

Data Hub tracks the progress and history of every replication task, so you can quickly identify issues and review past runs. You can also configure email notifications to stay informed when a task completes, fails, or is skipped.

Was this article helpful?

We're sorry to hear that.