Data Replication



Definition

Data replication is a design refactoring technique in which persistent data is copied locally (i.e., replicated) in order to increase performance.

Discussion

Sometimes, persistent data is used significantly more often than it is updated under conditions, whereby the time required to “unnecessarily” reobtaining the data across a network is prohibitive and may result in the failure of performance (e.g., response time and throughput requirements). However, the locally replicated data may easily become inconsistent with the original data stored in a database when the original data is updated or deleted. Also, if the same data is replicated in multiple locations, it must be updated in all of these locations in a timely manner, and this can cause performance problems with other tasks. If updates to persistent data occur only infrequently, then their timing may possibly be scheduled for periods of low transaction volumes. Data replication can reduce network communication volume and even costs when users are charged for network transactions. Data replication is much easier to implement when a database management system (DBMS) directly supports data replication and ensures the consistency of replicated data.

Objectives

The typical objectives of data replication are to:

Preconditions

Data replication can typically begin when the following preconditions hold:

Completion Criteria

Data replication is typically complete if the following postconditions hold:

Steps

When using the data replication technique, members of the database team typically perform the following steps:

Work Products

Data replication typically results in the following work products:

Limitations

Data replication is typically subject to the following limitations:

Guidelines