Saturday, February 28, 2015

Replication - How it is working and Architecture

Replication Architecture

REPLICATION ENTITIES

SQL Server replication is based on the “Publish and Subscribe” metaphor. Let us look at each of the individual components in detail.

1.     Publisher

o    It is a source database where replication starts. It makes data available for replication.

o    Publishers define what they publish through a publication.

2.     Article

o    Articles are the actual database objects included in replication like tables, views, indexes, etc.

o    An article can be filtered when sent to the subscriber.

3.     Publication

o    A group of articles is called publication.

o    An article can’t be distributed individually. Hence publication is required.

4.     Distributor

o    It is intermediary between publisher and subscriber.

o    It receives published transactions or snapshots and then stores and forwards these publications to the subscriber.

o    It has 6 system databases including distribution.

5.     Subscriber

o    It is the destination database where replication ends.

o    It can subscribe to multiple publications from multiple publishers.

o    It can send data back to publisher or publish data to other subscribers.

6.     Subscription

o    It is a request by a subscriber to receive a publication.

o    We have two types of subscriptions - push and pull.

7.     Push Subscriptions

o    With this subscription, the publisher is responsible for updating all the changes to the subscriber without the subscriber asking those changes.

o    Push subscriptions are created at the Publisher server

8.     Pull Subscriptions -

o    With this subscription the subscriber initiates the replication instead of the publisher.

o    The subscriptions are created at the Subscriber server.

REPLICATION AGENTS

·         We have discussed that replication process works in the background with the help of jobs.

·         These jobs are also called as agents. These jobs internally uses respective .exe files present in …………….. \110\COM folder.

·         All the agents information is present in Distribution db in the following tables.

o    dbo.MSxxx_agents

o    dbo.MSxxx_history

Snapshot Agent

·         It is an executable file that prepares snapshot files containing schema and data of published tables and db objects.

·         It stores the files in the snapshot folder, and records synchronization jobs in the distribution database.

Distribution Agent

·         It is used with snapshot and transactional replication.

·         It applies the initial snapshot to the Subscriber and moves transactions held in the Distribution db to Subscribers.

·         It runs at either the Distributor for push subscriptions or at the Subscriber for pull subscriptions.

Log Reader Agent

·         It is used with transactional replication, which moves transactions marked for replication from the transaction log on the publisher to the distribution db.

·         Each db has its own Log Reader Agent that runs on the Distributor and connects to the Publisher.

Merge Agent

·         It is used with merge replication.

·         It applies the initial snapshot to the Subscriber and moves incremental data changes that occur.

·         Each merge subscription has its own Merge Agent that connects to both the Publisher and the Subscriber and updates both.

·         It captures changes using triggers.

Queue Reader Agent

·         It is used with transactional replication with the queued updating option.

·         It runs at the Distributor and moves changes made at the Subscriber back to the Publisher.

·         Unlike Distribution Agent and Merge Agent, only one instance of the Queue Reader Agent exists to service all Publishers and publications for a given distribution db.

REPLICATION TYPES

·         Snapshot Replication

·         Transactional Replication

·         Merge Replication

1. Snapshot Replication

·         The snapshot process is commonly used to provide the initial set of data and database objects for transactional and merge publications.

·         It copies and distributes data and database objects exactly as they appear at the current moment of time.

·         Snapshot replication is used to provide the initial data set for transactional and merge replication.

·         It can also be used when complete refreshes of data are appropriate (BOL).

·         Scenarios

·         When the data is not changing frequently.

·         If we want to replicate small amount of data.

·         To replicate Look-up tables which are not changing frequently.

·         It is acceptable to have copies of data that are out of date with respect to the publisher for a period of time

For example, if a sales organization maintains a product price list and the prices are all updated at the same time once or twice each year, replicating the entire snapshot of data after it has changed is recommended.

Snapshot Replication Architecture

Source: BOL

How it Works?

·         Snapshot Agent establish a connection from distributor to publisher and generates fresh snapshot into snapshot folder by placing locks.

·         Snapshot agent writes copy of the table schema for each article to .sch file.

·         Copies data from published table at the Publisher and writes data to the snapshot folder in the form of.bcp file.

·         Appends rows to the Msrepl_commands and Msrepl_transactions.

·         Releases any locks on published tables.

Configuring Replication

1.     Configuring distributor

2.     Configuring publisher

3.     Creating publication of required type

4.     Creating subscription(s)

Step 1: Configuring distributor and publisher

1.     Take three instances

2.     Go to second instance -> Right click on Replication -> Configure Distribution…

3.     Next -> Select ‘SERVER2’ will act as its own distributor;

4.     Next

5.     Next

6.     Next

7.     Uncheck the check box present at Server2 -> Add

8.     Select instance Server1

9.     Next

10.  Enter strong password. (Automatically one login is created in distributor with the name Distributor_Admin)

11.  Next

12.  Next

13.  Finish

Observations

1.     Go to distributor -> Databases -> Find the new database “Distribution

2.     Go to Security -> Logins -> Find a new login “Distributor_admin

3.     Go to Server Objects -> Linked servers -> Find new linked server “repl_distributor

4.     Right Click on Replication -> Select distributor Properties

o    Transactions stored in distribution database are removed after 72 hrs and agents history is removed after 48 hrs.

o    To view snapshot folder path -> Click on publishers -> click on browse button (…) present to right side of publisher name.

o    Go to SQL Server Agent -> Jobs -> Find 6 new jobs are created automatically.

Step 2: Creating Snapshot Publication

1.     Go to publisher (Server1) -> Replication -> Right Click on Local Publications -> New publication.

2.     Next

3.     Select second option -> Click on Add -> Select Distributor instance (Server2)

4.     Connect ? Next

5.     Enter password of Distributor_admin login which we have mentioned while configuring publisher.

6.     Next

7.     Select required database. For example SSISDb

8.     Next

9.     Select “Snapshot Publication” -> Next

10.  Select required tables -> Next

11.  Next -> Next

12.  Select the check box to create snapshot as follows

13.  Next

14.  Click on security settings

15.  Select as follows

16.  OK

17.  Next

18.  Next -> Next

19.  Enter publication name as follows

20.  Finish

Observations

1.     Go to publisher -> Replication -> Local publications -> Find new publication is created

2.     To check snapshot was created or not -> Right click on the publication (SSISDBSP) -> View Snapshot Agent Status

3.     Go to repldata folder as follows:

4.     Go to sub folders find the snapshot files (.bcp, .sch, idx, .trg)

5.     Go to distributor -> SQL Server Agent -> Jobs -> Find snapshot agent job was created

 

FAQ: How to display database names which consists of publications?

Ans: Go to publisher -> take new query ->

Copy Code

select name from sys.databases where is_published=1 or is_subscribed=1

 

Creating Subscription

1.     Go to publisher -> Replication -> Local Publications -> Right Click on SSISDBSP -> New Subscription

2.     Next

3.     Select the publication name: SSISDBSP

4.     Next

5.     Select Push subscriptions

6.     Next

7.     Add Subscriber -> Select third instance (Server1\test) -> Connect

8.     Next

9.     Under Subscription Database if there is no database exists with same name -> Select New database -> Enter Database Name -> OK -> Next

10.  Click on browse button (…) under distribution agent security page.

11.  Select “Run under Agent Service Account” and “By impersonating the process account” options as both distributor and subscriber’s service accounts are same. If the service account of subscriber is different, then create a login in subscriber with sysadmin privileges then mention that login details.

12.  Next

13.  Under Agent Schedule -> Select “Run Continuously”

14.  Under Initialize when select -> Immediately

15.  Next -> Next -> Finish

Observations

1.     Go to subscriber -> SSISDB -> Tables -> Find two tables are created

2.     Go to distributor -> SQL Server Agent -> Find new job is created, related to Distribution Agent

Verifying Replication

1.     Go to publisher perform some changes in any table present in publication

2.     Go to distributor run Snapshot Agent Job

3.     Go to subscriber observe the changes in the respective table

FAQ: How many articles may be there in a snapshot publication? 
32767

FAQ: Max columns in a table? 
1000

 

 

This e-mail and any files transmitted with it are for the sole use of the intended recipient(s) and may contain confidential and privileged information. If you are not the intended recipient(s), please reply to the sender and destroy all copies of the original message. Any unauthorized review, use, disclosure, dissemination, forwarding, printing or copying of this email, and/or any action taken in reliance on the contents of this e-mail is strictly prohibited and may be unlawful. Where permitted by applicable law, this e-mail and other e-mail communications sent to and from Cognizant e-mail addresses may be monitored.

1 comment:

  1. Remarkable article, it is particularly useful! I quietly began in this, and I'm becoming more acquainted with it better! Delights, keep doing more and extra impressive! https://dbol-cycle.com/when-should-you-take-dianabol/

    ReplyDelete