
FileGPS Disaster Recovery Recommendation Document
This document outlines the disaster recovery process for FileGPS, covering the five main components of the system:
The purpose of this document is to ensure that all essential components required for disaster recovery are available on the DR server. By maintaining these components, administrators can quickly restore services in case of system failure. This document provides a checklist of required items for each component to be replicated on the DR server.
The five essential components – FileGPS UI, Backend, Rest Consumer, FileGPS Client and Superset must be available in both the Primary region and the Disaster Recovery (DR) region. Each component relies on certain dependencies, such as the Database, SMTP, and SAML configurations.
If the Primary region goes down for any reason, the data should be replicated in the DR region to ensure continuity. To maintain functionality, the database, SMTP, and SAML configurations must be correctly set up in the DR region. Before switching operations to the DR region, verify that these configurations are valid. Once the Primary region is restored, traffic should automatically revert to it.
If any of the SMTP configuration details differ between the Primary and DR regions, update the respective details in the DR region and verify them thoroughly.
Additionally, validate the SMTP credentials in the Backend and UI configurations, including the SMTP host and port, before finalizing the setup to ensure proper delivery of FileGPS notifications.
If any of the SAML configuration details differ between the Primary and DR regions, update the respective details in the DR region and verify them thoroughly. Additionally, ensure that the SAML configuration details in application.yml, including SSO URL, SLO URL, app-SLO, and IDP configurations such as metadata, IDP groups, and registration ID, are correctly validated.
If a load balancer is in place, ensure the following checks in the DR region:
Starting the UI
Checking UI Status
Use the following command to verify if the UI is running:
Checking UI Logs
Stopping the UI
(or) kill $(ps –ef | grep filegps-api-6.3.3.1 | awk ‘{print $2}’)
Starting the Backend
Checking Backend Status
Example: ps -ef | grep fnr_alert_job
Checking Backend Logs
Example: tail -f <Installed_Directory>/Backend/logs/fnr_alert.log
Stopping the Backend
(or) kill $(ps -ef | grep filegps_backend_job_name | awk ‘{print $2}’)
Example: pkill –f fnr_alert_job
Starting the Rest Consumer
Checking Rest Consumer Status
Checking Rest Consumer Logs
Stopping the Rest Consumer
If MQ is in the Primary region and only the application in the Primary region is down, the application in the DR region should be able to start and communicate with the MQ in the Primary region. This requires ensuring that the necessary firewall rules are configured to allow communication from the DR to the Primary region.
Starting the FileGPS Client
Checking FileGPS Client Status
Checking FileGPS Client Logs
Stopping the FileGPS Client