DRP Run Book (AGS)
The Aguascalientes project requires disaster recovery operation; therefore, this document intended to enable and guide the operation teams to running technical operations in case of a disaster or an emergency event in the main site (C5). The document will describe step by step the process of moving the CityShob products from the main site (C5) to the Disaster Recovery (DR) site.
The document includes the following:
• DR Architecture from the aspects of components (infrastructure, Application, Hardware and Software).
• System Functionality available in DR site.
• DR scenarios – step by step operation guide.
There are 4 Time frames that are handled by the Run Book:
• Normal operation – C5 site is active and data is synced to DR
• Disaster Scenario – Disaster in C5 that requires Activating the DR Site, C5 is down
• Normal Operation at DR Site – DR site is active and data is synced to C5
• Fail Back – failing back to C5 Site. After this step we are back to “normal operation”
The Run Book is containing all the needed information, procedure (work order including prerequisite operation) to make operation teams be able to handle disaster event and to meet the customer requirements in terms of Move-to-DR and Move-Back-to-C5 for operational continuity.
For each component in the system, there is a set of activities that need to be taken in order to perform the failover/failback between the C5 and DR sites.
Most of the activities are automatic while some of them will be manual, each of the scenarios will have the information of the steps that need to be done, and how to activate/execute each of the steps.
The following table describes the list of functionalities exist in the Main site and in the DR site.
Remark: Not all the functionalities in the C5 site are available in the DR site. This is part of the project definition.
Functionality |
Main (C5) |
DRP (C4) |
C-Insight |
V |
V |
C-SHARE |
V |
V |
C-REACT |
V |
V |
Video System |
V |
V |
Video Analytic |
V |
V |
CAD (SafetyNET) |
V |
X |
Panic Button (AVAYA) |
V |
X |
Face Recognition |
V |
X |
LPR |
V |
X |
Public Transporting |
V |
V |
OSINT (Tangles) |
V |
X |
During normal operation, the system C5 is fully operative and the entire CityShob products data is replicated to the DR site constantly to the big-data servers and storage in the DRP site.
During this time, the DR site is offline and not providing service.
C-Insight clients in C5 and C2 are connected to C5 DC and all functionalities are available.
This scenario to be activated in case of a disaster or an emergency event in the main site (C5) and activate this scenario should be decided and communicated by the C5 manager or C5 management.
Activating this scenario will require the operation teams to execute the run book scenario step-by-step in order to make the DR site active.
The “Move-to-DR” process may take up to 8 hours and during this time the service will be down.
Upon completing the process, the DR site becomes the active site and all C-Insight clients in C5, C2 and DR are connected to DR DC and only DR functionalities will be available (see the table in Section 1.3).
The C5 site is offline and cannot be activated as long the DRP site is running as the Active Site.
Once C5 is restored from the disaster or an emergency event and the C5 manager decides it time to move the service back to the main site, the operation teams require to execute the run book scenario step-by-step in order to make the C5 site active again.
The “Move-Back-To-Main” process may take up to 8 hours and during this time the service will be down.
The process will include synchronizing all new data updated in DR while it was active back to C5 so no data will be lost.
Once the C5 site back to be the active site, all C-Insight clients in C5, C2 and DR are connected to C5 DC with full system functionalities (see the table in Section 1.3).
The DRP site is offline and data is synchronized from the C5 site.
During normal operation the data is constantly replicated from main to DR site.
Following steps show how to verify data replication is working properly.
- Login to SQL Server in DRP using the SQL Management studio
- Verify Vulcan and VulcanAuditDB are in Synchronizing status
- Login to server “DRP-INS-SRV1” and open CIT application
- Go to “Administrator” Tab 🡪 “Open Main To DR”
- Enter user, password and SQL Server instance of the DRP site
User name: drprunner
Password: DRP!ex1234
- Click “Sync Status” and verify C5 is active.
The Elastic Search data replication service is located in C5 SQL server (C5-SQL-SRV01)
- Login to SQL Server in C5
- Open windows services and verify “ElasticsearchMaintenanceService” is running
- Service logs is in 🡪 C:\C-INSIGHT\ElasticsearchMaintenance\logs
- Open last log and verify there are no errors.
Following describe the technical step by step to move the system to the DRP site.
Stage: |
Name |
Description |
Stage 1 |
IT Preparation |
IT Team configure and verify DR site readiness. |
Stage 2 |
Stop C5 |
Stop all C-Insight Application servers in C5 site. Full system down time! |
Stage 3 |
Move To DR |
This step will close the SQL Replication and start SQL Server in DR site with the correct configuration. |
Stage 4 |
Start DR Site |
Use PDQ to start all the C-Insight application servers in DR and validate server side is running correctly using CIT |
Stage 5 |
Setting C-Insight Clients to DR |
Use PDQ to reconfigure all clients in C2, C5 and C4 to connect to Application servers in DR Site |
Stage 6 |
Testing |
Run sanity test to validate all DR functionality |
Make sure IT will perform the following:
- VMS – the Video Management system should be fully operating from DR site and all the live stream from the outdoor is redirected by the network to the DRP site.
- Verify DDN is available in DRP
- DNS Routing for C-REACT is redirected by Firewall to DR site
- DRP DMZ VLAN is open to internet for C-SHARE, C-REACT and Public Transportation services.
Remark: if the phase is not completed and verify, CityShob products will not works.
- Login to server “DRP-Maint-SRV01” and open PDQ application.
- Execute Job 🡪 DRP RUN BOOK 🡪 MOVE TO DRP 🡪 STOP C5 APP SRV
on all C-Insight application server in C5 site.
** If C5 site is not available / down then skip this step.
- Login to C5-SQL-SRV01 Server in C5
- Open windows services and stop “ElasticsearchMaintenanceService” service
Following steps should be done from the DR site:
- Login to server “DRP-INS-SRV1” and open CIT application
- Go to “Administrator” Tab 🡪 “Open Main To DR”
- Enter user, password and SQL Server instance of the DRP site
User name: drprunner
Password: DRP!ex1234
- Click “Move To DR” and wait until all steps completed with “Pass” result.
- Login to server “DRP-Maint-SRV01” and open PDQ application.
- Execute Job 🡪 DRP RUN BOOK 🡪 MOVE TO DRP 🡪 START DRP APP SRV
** Executed this job on all DRP-INS-SRV* servers
- Login to server “DRP-INS-SRV1” and open CIT application
- In CIT go to “Project” Tab 🡪 Open from Server
Enter user, password and SQL Server details in DRP
- Go to Management 🡪 Select the “Fit” button to display all BU’s
- Verify all BU’s are green:
- Login to server “DRP-Maint-SRV01” and open PDQ application.
- Execute Job 🡪 DRP RUN BOOK 🡪 MOVE TO DRP 🡪 SET CLIENT TO DRP DC
on all C-Insight clients (C5, C2, C4).
- Run system Sanity test
Following describe the technical step by step moving system back to main site.
Stage: |
Name |
Description |
Stage 1 |
IT Preparation |
IT Team configure and verify C5 site readiness. |
Stage 2 |
Stop DR |
Stop all C-Insight Application servers in DR site. Full system down time! |
Stage 3 |
Data Replication |
Complete Data Replication back to C5 |
Stage 4 |
Move to C5 |
This step will perform full SQL Data restore from DR back to SQL Server in C5 and reconfigure the SQL Replication for normal operation. |
Stage 5 |
Start C5 Site |
Use PDQ to start all the C-Insight application servers in C5 and validate server side is running correctly using CIT |
Stage 6 |
Setting C-Insight Clients to C5 |
Use PDQ to reconfigure all clients in C2, C5 and C4 to connect to Application servers in C5 Site |
Stage 7 |
Testing |
Run sanity test to validate all DR functionality |
Make sure IT will perform the following:
- VMS – the Video Management system should be fully operating and all the live stream from the outdoor is redirected by the network to the main site.
- Verify DDN is available in C5
- DNS Routing for C-REACT is redirected by Firewall to C5 site
- C5 DMZ VLAN is open to internet for C-SHARE, C-REACT and Public Transportation services.
- Login to server “DRP-Maint-SRV01” and open PDQ application.
- Execute Job 🡪 DRP RUN BOOK 🡪 MOVE BACK TO C5 🡪 STOP DRP APP SRV
on all C-Insight application server in DRP site.
** Executed this job on all DRP-INS-SRV* servers
- Login to server DRP-SQL-SRV01
- Open windows services – and start service: “ElasticsearchMaintenanceService”
- Wait until all data replicate back from DR site to main.
- Check the logs for replication completed 🡪 C:\C-INSIGHT\ElasticsearchMaintenance\logs
- Stop service: “ElasticsearchMaintenanceService”.
- Login to server “DRP-INS-SRV1” and open CIT application
- Go to “Administrator” Tab 🡪 “Open DR To C5”
- Enter user, password and SQL Server instance of the DRP site
User name: drprunner
Password: DRP!ex1234
- Click “Move To C5 - Sync” and wait until all steps completed with “Pass” result.
- Login to server “DRP-INS-SRV1” and open CIT application
- Go to “Administrator” Tab 🡪 “Open DR To C5”
- Enter user, password and SQL Server instance of the DRP site
User name: drprunner
Password: DRP!ex1234
- Click “Move To C5” and wait until all steps completed with “Pass” result.
- Login to server “C5-Maint-SRV01” and open PDQ application.
- Execute Job 🡪 DRP RUN BOOK 🡪 MOVE BACK To C5 🡪 START C5 APP SRV
** Executed this job on all C5-INS-SRV* servers
- Login to server “C5-INS-SRV1” and open CIT application
- In CIT go to “Project” Tab 🡪 Open from Server
Enter user, password and SQL Server details in C5:
- Go to Management 🡪 Select the “Fit” button to display all BU’s
- Verify all BU’s are up (green):
- Login to server C5-SQL-SRV01
- Open windows services – and start service: “ElasticsearchMaintenanceService”
- Wait until all data replicate back from DR site to main.
- Check the logs for replication completed 🡪 C:\C-INSIGHT\ElasticsearchMaintenance\logs
- Login to server “C5-Maint-SRV01” and open PDQ application.
- Execute Job 🡪 DRP RUN BOOK 🡪 MOVE BACK TO C5 🡪 SET CLIENT TO C5 DC
on all C-Insight clients (C5, C2, C4).
- Run system Sanity test
Servers |
Server Name |
Phy/VM |
VLAN 61 |
DMZ VLAN 63 |
Infra |
C5-DC-SRV01 |
VM |
10.13.61.18 |
|
C5-RootCA-SRV01 |
VM |
10.13.61.19 |
|
|
C5-SubCA-SRV01 |
VM |
10.13.61.20 |
|
|
C5-Maint-SRV01 |
VM |
10.13.61.21 |
|
|
C-INSIGHT APPLICATION |
C5-INS-SRV1 |
VM |
10.13.61.23 |
|
C5-INS-SRV2 |
VM |
10.13.61.24 |
|
|
C5-INS-SRV3 |
VM |
10.13.61.25 |
|
|
C5-INS-SRV4 |
VM |
10.13.61.26 |
|
|
C5-INS-SRV5 |
VM |
10.13.61.27 |
|
|
C5-INS-SRV6 |
VM |
10.13.61.28 |
|
|
C5-INS-SRV7 |
VM |
10.13.61.29 |
|
|
C5-INS-SRV8 |
VM |
10.13.61.30 |
|
|
C5-INS-SRV9 |
VM |
10.13.61.31 |
|
|
C5-INS-SRV10 |
VM |
10.13.61.32 |
|
|
C5-INS-SRV11 |
VM |
10.13.61.33 |
|
|
C5-INS-SRV12 |
VM |
10.13.61.34 |
|
|
C5-INS-SRV13 |
VM |
10.13.61.35 |
|
|
C5-INS-SRV14 |
VM |
10.13.61.36 |
|
|
C5-INS-SRV15 |
VM |
10.13.61.37 |
|
|
C5-INS-SRV16 |
VM |
10.13.61.38 |
|
|
C5-MDM-SRV01 |
VM |
10.13.61.39 |
|
|
C5-RSNAP-SRV01 |
VM |
10.13.61.40 |
|
|
C5-RADFS-SRV01 |
VM |
10.13.61.80 |
|
|
C5-RID-SRV01 |
VM |
10.13.61.41 |
|
|
DMZ -Servers |
C5-RAPOLLO-SRV01 |
VM |
|
10.13.63.1 |
C5-RWEB-SRV01 |
VM |
|
10.13.63.3 |
|
C-INSIGHT BIG DATA |
C5-SQL-SRV01 |
VM |
10.13.61.42 |
|
C5-ESMASTER-UX01 |
VM |
10.13.61.72 |
|
|
C5-ESMASTER-UX02 |
VM |
10.13.61.73 |
|
|
C5-ESMASTER-UX03 |
VM |
10.13.61.74 |
|
|
C5-ESDATA-UX01 |
VM |
10.13.61.70 |
|
|
C5-ESDATA-UX02 |
VM |
10.13.61.71 |
|
|
C5-ESCLIENT-UX01 |
VM |
10.13.61.75 |
|
|
C5-ESCLIENT-UX01 |
VM |
10.13.61.76 |
|
|
Face Recognition |
C5-ESCLIENT-UX02 |
Physical |
10.13.61.48 |
|
Storage Servers (DDN) |
C5-DDN-SRV01/C5-DDN-SRV11 |
Physical |
10.13.61.49 |
|
C5-DDN-SRV02/C5-DDN-SRV12 |
Physical |
10.13.61.50 |
|
|
LPR Servers |
C5-LPR-SRV01 |
VM |
10.13.61.51 |
|
C5-LPR-SRV02 |
VM |
10.13.61.52 |
|
|
C5-LPR-SRV03 |
VM |
10.13.61.53 |
|
|
C5-LPR-SRV04 |
VM |
10.13.61.54 |
|
|
C5-LPR-SRV05 |
VM |
10.13.61.55 |
|
|
C5-LPR-SRV06 |
VM |
10.13.61.56 |
|
|
C5-LPR-SRV07 |
VM |
10.13.61.57 |
|
|
C5-LPR-SRV08 |
VM |
10.13.61.58 |
|
|
C5-LPR-SRV09 |
VM |
10.13.61.59 |
|
|
C5-LPR-SRV10 |
VM |
10.13.61.60 |
|
Servers |
Server Name |
Phy/VM |
VLAN 71 |
DMZ Vlan 73 |
|
DRP-DC-SRV01 |
VM |
10.13.71.18 |
|
DRP-RootCA-SRV01 |
VM |
10.13.71.19 |
|
|
DRP-RootCA-SRV01 |
VM |
10.13.71.20 |
|
|
DRP-SubCA-SRV01 |
VM |
10.13.71.21 |
|
|
C-INSIGHT APPLICATION |
DRP-INS-SRV1 |
VM |
10.13.71.23 |
|
DRP-INS-SRV2 |
VM |
10.13.71.24 |
|
|
DRP-INS-SRV3 |
VM |
10.13.71.25 |
|
|
DRP-INS-SRV4 |
VM |
10.13.71.26 |
|
|
DRP-INS-SRV5 |
VM |
10.13.71.27 |
|
|
DRP-INS-SRV6 |
VM |
10.13.71.28 |
|
|
DRP-INS-SRV7 |
VM |
10.13.71.29 |
|
|
DRP-INS-SRV8 |
VM |
10.13.71.30 |
|
|
DRP-INS-SRV9 |
VM |
10.13.71.31 |
|
|
DRP-INS-SRV10 |
VM |
10.13.71.32 |
|
|
DRP-INS-SRV11 |
VM |
10.13.71.33 |
|
|
DRP-INS-SRV12 |
VM |
10.13.71.34 |
|
|
DRP-INS-SRV13 |
VM |
10.13.71.35 |
|
|
DRP-INS-SRV14 |
VM |
10.13.71.36 |
|
|
DRP-INS-SRV15 |
VM |
10.13.71.37 |
|
|
DRP-INS-SRV16 |
VM |
10.13.71.38 |
|
|
DRP-React ADFS |
VM |
|
|
|
DRP-React SNAPI |
VM |
10.13.71.40 |
|
|
DRP-React IDSRV |
VM |
10.13.71.41 |
|
|
DMZ -Servers |
DRP-React APOLLO |
VM |
|
10.13.73.1 |
DRP-React WEBFARM |
VM |
|
10.13.73.3 |
|
C-INSIGHT BIG DATA |
DRP-SQL-SRV01 |
VM |
10.13.71.42 |
|
DRP-ESMASTER-UX01 |
VM |
10.13.71.72 |
|
|
DRP-ESMASTER-UX02 |
VM |
10.13.71.73 |
|
|
DRP-ESMASTER-UX03 |
VM |
10.13.71.74 |
|
|
DRP-ESDATA-UX01 |
VM |
10.13.71.70 |
|
|
DRP-ESDATA-UX02 |
VM |
10.13.71.71 |
|
|
DRP-ESCLIENT-UX01 |
VM |
10.13.71.75 |
|
|
DRP-ESCLIENT-UX01 |
VM |
10.13.71.76 |
|
|
Storage Servers (DDN) |
DRP-DDN-SRV01/DRP-DDN-SRV11 |
Physical |
10.13.71.49 |
|
DRP-DDN-SRV02/DRP-DDN-SRV12 |
Physical |
10.13.71.50 |
|
Created with the Personal Edition of HelpNDoc: Easily create CHM Help documents