Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Name | Type | Version |
---|---|---|
Contains all changes from HF1 + HF2.
This Hotfix extends the Support Matrix of T4O 4.1 as follows:
Canonical Openstack Victoria based on Focal (20.04) Support
Kolla Ansible Openstack Victoria on Ubuntu 20.04 and CentOS8
Openstack Ansible Victoria on Ubuntu 20.04 and CentOS8
TripleO train on CentOS7 and CentOS8
The installation into these environments requires upgrading the Trilio Appliance from 4.1 GA to 4.1 HF3 or higher
Trilio release 4.1 introduces new features and capabilities including:
Openstack Ussuri support
New File Recovery Process
Increased Openstack independency
Installation Optimization for Kolla Ansible Openstack, Ansible Openstack, and Red Hat Openstack Platform
Support for External MySQL/MariaDB databases
Incremental Backup for nova booted instances
Support of Openstack User Groups
New Quota: Snapshots
S3 Support for Kolla Ansible Openstack
UI enhancement for selective Restore
Trilio 4.1.94 is the GA release of Trilio 4.1
Name | Type | Version |
---|
Trilio 4.1 continues to enable Openstack versions and Distributions, allowing Trilio customers to stay up to date with Openstack releases.
Trilio 4.1 introduces full support for Openstack Ussuri for Kolla Ansible Openstack, Ansible Openstack, and Canonical Openstack. In addition, Trilio 4.1 does of course support the active long-term releases from Red Hat and Canonical. Openstack users of those releases can continue to use the latest Trilio functionalities.
Since Trilio was released for the first time was the File Recovery Manager instance in tandem with Trilio. The File Recovery Manager instance helped customers to easily and quickly fetch and restore files and folders directly from the backups.
Over the years did more and more customers request to have the File Recovery Manager on their own images or on a specific Linux distribution. We looked into the possibility to create multiple versions of the File Recovery Manager which was identified as not getting to the point of flexibility our customers need.
The result was a revamp of the File Recovery Manager which allows the installation on any CentOS-based or Ubuntu-based instance.
Trilio has the goal to become an integrated part of Openstack since its first draft on a whiteboard. Back then were the Trilio services defined as sub-services of Nova and Trilio tied deeply into the already existing nova services and became an integral part of it.
Over the years did the nova service change and possibilities that were once available have been taken out from nova. This plus the required lengthy qualification of all Openstack Versions and Distributions made it clear that a change is required.
Trilio 4.1 is now a complete stand-alone service, which is communicating with nova and other Openstack services through APIs only. The goal of this independence is to speed up future qualification and requalification cycles to provide a broader support matrix again.
Trilio's integration into Openstack already starts with the installation process. This process required manual installation steps in the past, which were hard to automize and scale on bigger environments.
Trilio 4.1 is therefore introducing Ansible Playbooks for Kolla-Ansible and Ansible Openstack, while the integration with Red Hat Director has been deepened. Further, the Trilio configurator does now allow to set the backup target on the Trilio Appliance directly based on Distribution.
The Trilio Appliance provides its own database. A database that suits the needs of 99% of Trilio's customers. Some customers do have higher requirements. Be it performance, security, or just the general design of the Openstack itself.
For these customers and everyone who wants to use it is Trilio 4.1 introducing the possibility to configure Trilio with an external database.
Openstack provides the possibility to start an instance from a Cinder Volume or directly using the Glance Image and a nova volume.
Trilio always provided incremental forever backups for all cinder volumes. Root-Volumes from nova-booted instances were always taken as a full backup of the Glance image and the actual VM root volume.
Trilio 4.1 does introduce incremental backups for nova booted instance root volumes. This allows Trilio 4.1 to provide incremental backups to any type of root volume.
Trilio is using the Openstack Keystone service to authenticate any user and to verify that the right permissions are set. Openstack Keystone allows to group users and set the permissions to the group.
These Openstack User Groups are now fully supported.
Trilio 4.0 introduced the Quota functionality, which allowed to set quotas for the number of workloads, number of VMs and amount of storage used by a single tenant.
Trilio 4.1 extends this feature by the number of Trilio Snapshots that a Tenant is allowed to have.
S3 is becoming the standard to transfer data to and from storage solutions. Trilio introduced S3 already in Version 3.0 but had to take it out for Kolla Ansible since Kolla Ansible has been added to the Support Matrix.
Trilio 4.1 is now closing that gap to other Openstack Distributions and provides full S3 support for Kolla Ansible Openstack Ussuri.
The Selective Restore is the most powerful and complex restore Trilio has to offer. The UI needs to be easy to understand and help the user to fulfill its task.
Several points have been identified to improve this requirement of usability. The selective restore now allows to select or deselect all VMs at once. Further are the VMs now provided in an easy to overview list and the sub-controls can be expanded and collapsed as necessary.
This release contains the following known issues which are tracked for a future update.
Observation:
Login into the iscsi device is getting rejected for the Trilio service
Result is the Snapshot not moving forward until timing out
The workloadmgr Quota feature is still fully supported through CLI.
Observation:
It has been observed that on non-kolla, non-rhosp setups, such as openstack ansible, nova user id is not same as we consider as default(162).
Another observation is that, the id which is assigned to nova, was conflicting with id of system user in TVM, this created a situation where we had to redeploy Openstack.
Workaround:
Update permissions of /var/triliovault-mounts to 755
Observation:
VM Volumes stored on Ceph are successfully excluded from backup if desired
Restore does create empty Ceph Volume
created empty Ceph Volume is not attachable or formattable
Observation:
For every restore will the metadata config_drive be set as blank value
No impact on restored VMs known
Workaround
delete metadata config_drive
or set desired value
Observation:
TVault re-configuration while adding nodes to existing TVM cluster fails at "Configuring Trilio Cluster"
Reason is that the prev mysql password was not working and mysql root access has be reset.
Workaround:
remove /root/.my.cnf file on already configured TVM and reconfigure it
Observation:
After TVault re-configuration post addition of 2 more nodes to existing TVM cluster ("import workloads" was not seleted), the databases do not sync against already existing TVM.
It is expected that while adding the 2 new nodes, the db on node1 should get synced up with 2 new nodes and the existing workloads should be available post the reconfig on the new 3 node TVM cluster.
Workaround:
Run workload import from CLI
Observation:
VM was set with metadata exclude_boot_disk_from_backup set to true
Restored instance showed, that data was backed up and restored
Observation:
Reinitialize does not keep the already uploaded Openstack Certificates used to communicate with Openstack.
Workaround:
Upload Certificates again
Observation:
Workaround:
Configure/re-configure T4O with import option from UI after reinitialize.
Observation:
After reinitializing was neither the UI nor CLI showing Node information
Workaround:
Restart wlm-workloads and wlm-cron services on Trilio nodes
systemctl restart wlm-workloads
systemctl restart wlm-cron
Oberservation:
Running more than 25 workloads at the exact same time leads to error
dmapi service is not responding
Snapshots fail with "object is not subscriptable"
Workaround:
Contact Trilio Support to implement a known workaround.
Observation:
Just changing the backup target in the Kolla Ansible configuration files and redeploying leads to dmapi container stuck in restart
Workaround:
Observation:
SSL enabled Openstack
Backup and Restore jobs fail with with missing TLS CA certificate bundle error
Workaround:
Configure the Trilio appliance with Openstack CA provided
OR Provide Openstack CA to /etc/workloadmgr/ca-chain.pem
Name | Type | Version |
---|
Trilio for Openstack supports sending notification emails upon succeeded or failed backup/restore jobs. The required SMTP server configuration enforced the usage of a password for the SMTP user.
A password is no longer necessary when the SMTP server doesn’t need it.
Cinder supports a Volume Type, which allows attaching the same Volume to multiple instances simultaneously. Backups and Restore for this Volume Type failed. Trilio for Openstack is now providing base support for this Volume Type.
Cinder Boot Volumes with Multi-Attach activated are not yet supported.
This only allows the backup and restoration of Multi-Attach Volumes. Trilio will handle the Volume like any single attached Volume for now. For example, a multi-attach volume connected to 2 VMs will get backed up and restored twice.
Trilio for Openstack is tracking the progress of backups and restore using a tracking file. If this file is not getting updated within a defined timeframe, the Trilio data transfer fails. This timeframe got extended from 10 minutes to 20 minutes.
This value will become configurable in T4O 4.1 SP1
Trilio for Openstack was logging many system error messages, which were actually expected and handled internally without impacting the actual functions of the solution.
These error messages were misleading in the normal troubleshooting process. These error messages aren’t logged anymore by default. They can be reactivated using the debug mode for logging.
Image upload timeout window has been increased and made configurable
The upload of Trilio backups is limited in time to prevent stalling workloads with a stuck upload process.
This timeout window has been increased from 10 hours to 48h by default and is configurable in the workloadmgr config file.
Restart wlm-workloads after setting this value.
When the Global Job Scheduler is deactivated, no backups are triggered. The Global Job Scheduler contains a grace time for missed Snapshots. All Snapshots that were supposed to be triggered within this grace time before activation of the Global Job Scheduler are retriggered.
This grace period is now configurable in the workloadmgr config file.
After setting this value, restart the wlm-cron service.
In highly used environments, the dmapi worker got identified as a potential bottleneck. The amount of default workers used by a dmapi service has been increased to 16 and is configurable in the dmapi config file.
Restart the dmapi service after setting the configuration manually.
The upgrade process of RHOSP and Kolla Ansible will automatically set this value.
Response times in highly used environments might be slow, leading to the dmapi service timing out in the haproxy connection. The default values of haproxy are not always suitable in that case.
The haproxy configuration for the dmapi service has been extended to the following values.
Restart the haproxy service after setting the configuration manually.
The upgrade process of RHOSP and Kolla Ansible will automatically set these values.
The Openstack Neutron service is highly used up to the point that API calls are timing out. Trilio backups and restores failed when any Neutron API call timed out.
Trilio will now retry Neutron API calls three times before failing a backup or restore.
It was observed in multipathing environments that sometimes backups failed due to errors with the temporary Cinder Volumes during the following actions:
Create Cinder Volume out of Cinder Snapshot
Mount Cinder Volume to Compute Node
Unmount Cinder Volume from Compute Node
Delete Cinder Volume
During these operations in multipath environments, errors are now handled by rescanning the connected devices and retrying the internal commands.
The amount of retries is configurable in the tvault-contego config file.
Restart the tvault-contego service after manually setting the value.
The upgrade process of RHOSP and Kolla Ansible will automatically set these values.
The Trilio for Openstack GUI uses the admin account to secure access to the features and functionalities located on the Trilio appliance. The following events are now getting logged by the Trilio Appliance:
Login attempts
Logout events
Password changes for the admin user
The Trilio for Openstack login page can now is extendable to contain a text banner. This text banner is configurable on the Trilio appliance by editing the banner yaml file located under:
/etc/tvault-config/banner.yaml
The content of the file looks as follows:
Restart tvault-config after changing the banner to activate it.
An issue got fixed for rare occasions in which the status of the local job scheduler of a single workload was disabled, despite the workload created with an enabled job scheduler.
An issue got fixed for the Global Job Scheduler returning enabled or disabled even when the wlm- cron service is deactivated. The status returned in this scenario is now an error message showing the wlm-cron service status.
The documentation link available inside the Trilio Appliance was still pointing to the old outdated documentation webpage. The link has been updated to point towards the correct documentation.
An issue got fixed, which prevented the restoration of VMs into different Availability Zones in the case of the original Availability Zone no longer being available.
An issue got fixed, which lead to stall service jobs being left behind upon restart of workloadmgr services.
An issue got fixed, which prevented the correct mounting and access of Volumes partitioned and configured by LVM.
An issue got fixed, which lead to a race condition between upload threads in the S3 fuse plugin, which led to backups failing during the upload phase.
An issue got fixed, which lead to multipathing not being enabled in the Data-Mover container used by RHOSP and Kolla Ansible.
Upgrading to 4.1 HF1 will automatically activate multipathing where feasible.
An issue got fixed, which lead to the creation of a Trilio mount point, even when the provided S3 backup target is not reachable during deployment or configuration.
The deployment will still succeed, but the tvault-object-store service will be in a failed state.
An issue got fixed, which prevented the detection of the Trilio trustee role for a user, who had this role inherited from a user group.
An issue got fixed, during which the chosen endpoint type did not get honored for Keystone and the configurator always reached out to the Keystone internal endpoint.
The following value can be set in the api-paste ini file located under:
/etc/workloadmgr/api-paste.ini
Afterward the wlm-workloads service needs to be restarted.
It is recommended to reconfigure the appliance to activate the fix
An issue got identified, which leads to the disk integrity check failing, although there is no data loss.
Snapshots with a failed disk integrity check are currently no longer failing and instead show a warning in the log files about the failed disk integrity check.
A complete fix of the root cause is planned for 4.1 SP1.
An issue got identified in which Latin characters like á did lead to a Workload not being created or a backup not succeeding.
This hotfix implements support of Latin characters for the following:
Calendar shown and used during workload creation for the job scheduler
Name and description of security groups
Full support for Latin characters comes in 4.1 SP1
An issue got fixed, which prevented successful backups in environments using multipathing with the FC storage protocol.
Name | Type | Version |
---|
Contains all changes from HF1, HF2 and HF3.
A bug has been fixed which prevented the correct presentation of files on an ext2 filesystem when running a filesearch.
An issue has been fixed which prevented a correct filesearch on CentOS8 and RHEL8 backups with the xfs filesystem.
An issue has been fixed which prevented a successful selective restore in case of the original Glance image not being available anymore.
An issue has been fixed which prevented the successful creation of a Glance image during a restore.
An issue has been fixed which set the wrong ownership to the Trilio mountpoint.
A new configuration parametervault_s3_max_pool_connections
has been added to adjust the number of pool connections.
The default value is 500.
This parameter can be set in the workloadmgr.conf on the Trilio appliance.
In the case of a Canonical installation this parameter is to be set in the tvault-object-store.conf in the workloadmgr container.
The default value for the configuration parameter vault_s3_max_pool_connections
has been reduced from 120 to 30.
This parameter can be set in the workloadmgr.conf on the Trilio appliance.
In the case of a Canonical installation this parameter is to be set in the tvault-object-store.conf in the workloadmgr container.
A timeout with automatic process kill has been added to prevent high CPU usage from stale NFS mount operations in case of mounting errors.
The documentation has been extended to support the usage of Ubuntu 20.04 cloud images for the Snapshot mount functionality.
Name | Tag |
---|---|
Name | Tag |
---|
When the is used via CLI is the scheduler trust changed from enabled to disabled.
Follow guide to
Name | Tag |
---|
Name | Tag |
---|
s3fuse
python package
4.1.94.3
tvault-configurator
python package
4.1.94.6
workloadmgr
python package
4.1.94.5
workloadmgrclient
python package
4.1.94
dmapi
deb package
4.1.94.3
python3-dmapi
deb package
4.1.94.3
tvault-contego
deb package
4.1.94.6
python3-tvault-contego
deb package
4.1.94.6
tvault-horizon-plugin
deb package
4.1.94.3
python3-tvault-horizon-plugin
deb package
4.1.94.3
s3-fuse-plugin
deb package
4.1.94.3
python3-s3-fuse-plugin
deb package
4.1.94.3
workloadmgr
deb package
4.1.94.5
workloadmgrclient
deb package
4.1.94
dmapi
rpm package
4.1.94.3-4.1
python3-dmapi
rpm package
4.1.94.3-4.1
tvault-contego
rpm package
4.1.94.6-4.1
python3-tvault-contego
rpm package
4.1.94.6-4.1
tvault-horizon-plugin
rpm package
4.1.94.3-4.1
python3-tvault-horizon plugin-el8
rpm package
4.1.94.3-4.1
python-s3fuse-plugin-cent7
rpm package
4.1.94.3-4.1
python3-s3fuse-plugin
rpm package
4.1.94.3-4.1
workloadmgrclient
rpm package
4.1.94
Gitbranch
hotfix-3-TVO/4.1
RHOSP13 containers
4.1.94-hotfix-4-rhosp13
RHOSP16.0 containers
4.1.94-hotfix-4-rhosp16
RHOSP16.1 containers
4.1.94-hotfix-4-rhosp16.1
Kolla Ansible Ussuri containers
4.1.94-hotfix-4-ussuri
Kolla Ansible Victoria containers
4.1.94-hotfix2-victoria
TripleO Train container
4.1.94-hotfix-2-tripleo
s3fuse | python package | 4.1.94 |
tvault-configurator | python package | 4.1.94 |
workloadmgr | python package | 4.1.94 |
workloadmgrclient | python package | 4.1.94 |
contegoclient | python package | 4.1.94 |
dmapi | deb package | 4.1.94 |
python3-dmapi | deb package | 4.1.94 |
tvault-contego | deb package | 4.1.94 |
python3-tvault-contego | deb package | 4.1.94 |
tvault-horizon-plugin | deb package | 4.1.94 |
python3-tvault-horizon-plugin | deb package | 4.1.94 |
s3-fuse-plugin | deb package | 4.1.94 |
python3-s3-fuse-plugin | deb package | 4.1.94 |
workloadmgr | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94 |
python3-dmapi | rpm package | 4.1.94 |
tvault-contego | rpm package | 4.1.94 |
python3-tvault-contego | rpm package | 4.1.94 |
tvault-horizon-plugin | rpm package | 4.1.94 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94 |
python3-s3fuse-plugin | rpm package | 4.1.94 |
Gitbranch | stable/4.2 |
RHOSP13 containers | 4.1.94-rhosp13 |
RHOSP16.0 containers | 4.1.94-rhosp16 |
RHOSP16.1 containers | 4.1.94-rhosp16.1 |
Kolla Ansible Ussuri containers | 4.1.94-ussuri |
s3fuse | python package | 4.1.94.3 |
tvault-configurator | python package | 4.1.94.3 |
workloadmgr | python package | 4.1.94.3 |
workloadmgrclient | python package | 4.1.94 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.3 |
python3-tvault-contego | deb package | 4.1.94.3 |
tvault-horizon-plugin | deb package | 4.1.94.3 |
python3-tvault-horizon-plugin | deb package | 4.1.94.3 |
s3-fuse-plugin | deb package | 4.1.94.3 |
python3-s3-fuse-plugin | deb package | 4.1.94.3 |
workloadmgr | deb package | 4.1.94.3 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.3-4.1 |
python3-tvault-contego | rpm package | 4.1.94.3-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.3-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.3-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.3-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.3-4.1 |
workloadmgrclient | rpm package | 4.1.94 |
Gitbranch | hotfix-1-TVO/4.1 |
RHOSP13 containers | 4.1.94-hotfix2-rhosp13 |
RHOSP16.0 containers | 4.1.94-hotfix-2-rhosp16 |
RHOSP16.1 containers | 4.1.94-hotfix-2-rhosp16.1 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-2-ussuri |
s3fuse | python package | 4.1.94.4 |
tvault-configurator | python package | 4.1.94.6 |
workloadmgr | python package | 4.1.94.8 |
workloadmgrclient | python package | 4.1.94 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.7 |
python3-tvault-contego | deb package | 4.1.94.7 |
tvault-horizon-plugin | deb package | 4.1.94.3 |
python3-tvault-horizon-plugin | deb package | 4.1.94.3 |
s3-fuse-plugin | deb package | 4.1.94.4 |
python3-s3-fuse-plugin | deb package | 4.1.94.4 |
workloadmgr | deb package | 4.1.94.8 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.7-4.1 |
python3-tvault-contego | rpm package | 4.1.94.7-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.3-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.3-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.4-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.4-4.1 |
workloadmgrclient | rpm package | 4.1.94 |
Gitbranch | hotfix-4-TVO/4.1 |
RHOSP13 containers | 4.1.94-hotfix-5-rhosp13 |
RHOSP16.0 containers | 4.1.94-hotfix-5-rhosp16 |
RHOSP16.1 containers | 4.1.94-hotfix-5-rhosp16.1 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-5-ussuri |
Kolla Ansible Victoria containers | 4.1.94-hotfix-3-victoria |
TripleO Train container | 4.1.94-hotfix-3-tripleo |
Name | Type | Version |
---|---|---|
Contains all changes from HF1.
An issue has been fixed that prevented a successful restore in the case of restoring a Cinder boot volume with a volume type that has the multi-attach functionality activated.
An issue has been fixed which prevented the successful finish of the backup process for workloads with many protected instances or instances with long names.
An issue has been fixed which led to no visible Ansible logs upon reconfiguring the Trilio appliance.
An issue has been fixed which led to the SMTP configuration always throwing the misleading error smtp_timeout cannot be greater than 10
upon sending a test email.
An issue has been fixed which led to Security Groups not being restored when a remote Security Group was having the exact same Security Group Rule as another Security Group in the chain.
An issue has been fixed which led to a restore apparently completing successfully despite an error during the restore of the Security Groups.
Trilio, by TrilioData, is a native OpenStack service that provides policy-based comprehensive backup and recovery for OpenStack workloads. The solution captures point-in-time workloads (Application, OS, Compute, Network, Configurations, Data and Metadata of an environment) as full or incremental snapshots. These snapshots can be held in a variety of storage environments including NFS AWS S3 compatible storage. With Trilio and its single click recovery, organizations can improve Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO). With Trilio, IT departments are enabled to fully deploy OpenStack solutions and provide business assurance through enhanced data retention, protection and integrity.
With the use of Trilio’s VAST (Virtual Snapshot Technology), Enterprise IT and Cloud Service Providers can now deploy backup and disaster recovery as a service to prevent data loss or data corruption through point-in-time snapshots and seamless one-click recovery. Trilio takes point-in-time backup of the entire workload consisting of compute resources, network configurations and storage data as one unit. It also takes incremental backups that only captures the changes that were made since last backup. Incremental snapshots save time and storage space as the backup only includes changes since the last backup. The benefits of using VAST for backup and restore could be summarized as below:
Efficient capture and storage of snapshots. Since our full backups only include data that is committed to storage volume and the incremental backups only include changed blocks of data since last backup, our backup processes are efficient and storages backup images efficiently on the backup media
Faster and reliable recovery. When your applications become complex that snap multiple VMs and storage volumes, our efficient recovery process will bring your application from zero to operational with just click of button
Easy migration of workloads between clouds. Trilio captures all the details of your application and hence our migration includes your entire application stack without leaving any thing for guess work.
Through policy and automation lower the Total Cost of Ownership. Our tenant driven backup process and automation eliminates the need for dedicated backup administrators, there by improves your total cost of ownership.
Trilio is an add on service to OpenStack cloud infrastructure and provides backup and disaster recovery functions for tenant workloads. Trilio is very similar to other OpenStack services including nova, cinder, glance, etc and adheres to all tenets of OpenStack. It is a stateless service that scales with your cloud.
Trilio has four main software components:
Trilio ships as a QCOW2 image. User can instantiate one or more VMs from the QCOW2 image on a standalone KVM boxes.
Trilio API is a python module that is installed on all OpenStack controller nodes where the nova-api service is running.
Trilio Datamover is a python module that is installed on every OpenStack compute nodes
Trilio horizon plugin is installed as an add on to horizon servers. This module is installed on every server that runs horizon service.
Trilio is both a provider and consumer into OpenStack ecosystem. It uses other OpenStack services such as nova, cinder, glance, neutron, and keystone and provides its own service to OpenStack tenants. To accomodate all possible OpenStack deployments, Trilio can be configured to use either public or internal URLs of services. Likewise Trilio provides its own public, internal and admin URLs.
This figure represents a typical network topology. Trilio exposes its public URL endpoint on public network and Trilio virtual appliances and data movers typically use either internal network or dedicated backup network for storing and retrieving backup images from backup store.
Name | Type | Version |
---|---|---|
Contains all changes from previous hotfixes.
Temporary Volumes left behind under certain circumstances
An issue has been fixed which left the Trilio temporary Cinder Volumes behind when the upload was timing out or when the Trilio cluster got restarted.
Setting invalid date through workload edit
An issue has been fixed which allowed setting invalid dates through the workload edit command.
Parallel workload performing significantly worse
A race condition has been fixed which led to an exponential growth of required time with an increased amount of protected VMs.
Restore of ports with port-security disabled
An issue has been fixed which prevented a successful restore of Neutron ports that have the port-security functionality disabled.
Name | Type | Version |
---|---|---|
The following OpenStack distributions and versions have been added to the Trilio support matrix.
Red Hat OpenStack 16.2
Support for Ceph NFS-Ganesha
It is now possible to utilize the NFS-Ganesha gateway as backup target.
Restore continues even when Security Group restore fails
Failures upon restoring Security Groups are no longer leading to a complete fail of the restore. Failed Security Groups are logged to provide the required information for next steps.
Future versions and hotfixes will continue to improve the Security Group restore process and reduce the amount of reasons why Security Groups can't be restored.
AttributeError: 'unicode' object has no attribute 'get'
An issue with multipathing environments has been fixed which led to failed backups with the AttributeError: 'unicode' object has no attribute 'get'
error message.
Restricting the IP address of the Trilio GUI lead to logs not being downloadable
The documentation about restricting the IP access to the Trilio GUI has been updated to include the required port 3001 to enable the download of logs through the dashboard_ip
.
Multipath.conf file not present in Datamover container
An issue has been fixed which prevented the correct placement of the multipath.conf file inside the Trilio Datamover container.
Name | Type | Version |
---|---|---|
Contains all changes from HF1, HF2, HF3, and HF4.
Post restore Windows VM boots into recovery console
An issue has been fixed which prevented VMs running Windows to boot properly after a restore in the case of Nova boot volumes being used.
Workload list can not be retrieved after manual changes in the Trilio database
An issue has been fixed which prevented the successful pull of data from the Trilio database in case of incorrectly applied manual changes.
admin endpoint type not honored by the configurator
An issue has been fixed which prevented the successful usage of the Openstack admin endpoint network as the standard communication network for T4O
Workloads and Snapshots stuck in delete status
An issue has been fixed which prevented resetting the status of Workloads and Snapshots in case they are stuck in the deletion state.
Workload scheduler stability
The workload scheduler stability has been enhanced to prevent the start of multiple scheduled jobs at the same time or delayed from its expected time.
Name | Type | Version |
---|
Contains all changes from previous hotfixes.
Restore of tenant-shared security groups failed
An issue has been fixed which prevented a successful restore when a security group was referring to a shared security group from a different tenant.
Email alerts not working as intended
Fixed an issue where email alerts were not sent in the case of a passwordless SMTP server.
Name | Type | Version |
---|
Datamover container restarting
/var/trilio/triliovault-mounts
was having incorrect ownership. We have fixed it through DevOps code
Once the original project is deleted, the workload cannot be reassigned to a different UserID/ProjectID.
We removed the condition where we were checking if older tenant_ids if present in the newer tenant list
Trilio core functionality operations do not perform as expected when the master T4O node is powered off
Caching bug in the code where the in-memory dictionary was not in sync with the service table in MySQL.
tvault-config service is in crashloop on 2 out of 3 nodes in T4O cluster
workload policy shows an incorrect start time
default_tvault_dashboard_tvo-tvm not available after yum update
Reassign of the workload from deleted project fails
This document provides information on TVO-4.1.HF13 Release.
Important Info:
To use this hotfix (4.1.HF13)
Customers (except Canonical Openstack) and having Openstack Ussuri need to have an already deployed and working TVO-4.1 GA OR TVO-4.1.HF1 OR TVO-4.1.HF2 OR TVO-4.1.HF3 OR TVO-4.1.HF4 OR TVO-4.1.HF5 OR TVO-4.1.HF6 OR TVO-4.1.HF7 OR TVO-4.1.HF8 OR TVO-4.1.HF9 OR TVO-4.1.HF10 OR TVO-4.1.HF11 OR HF12
Customers (except Canonical Openstack) and having Openstack Victoria OR TripleO Train need to follow the TVO-4.1 GA deployment process and directly upgrade to 4.1.HF13 containers/packages. The high-level flow below:
Deplo T4O-4.1 GA appliance.
Upgrade to 4.1.HF13 packages on the appliance.
Kolla & TripleO
Deploy Trilio components via 4.1.HF13 containers/packages on Openstack Victoria/TripleO Train.
Openstack Ansible
Deploy Trilio components Openstack Victoria [This will deploy 4.1 GA packages]
Upgrade TrilioVault packages to 4.1.HF13 on Openstack Victoria.
Configure the Trilio appliance.
Canonical users having Openstack Ussuri can either upgrade (on top of 4.1 GA) using Trilio upgrade documents OR do a fresh deployment using 4.1 Deployment documents.
Canonical users having Openstack Victoria can either upgrade (on top of 4.1.HF4) using Trilio upgrade documents OR do a fresh deployment using 4.1 Deployment documents.
The deploy/upgrade documentations provide the detailed steps to deploy/upgrade to the hotfix.
Current Hotfix release targets the following:
Verification of Jira issues targeted for 4.1.HF13 release.
As part of the new process, the delivery will be via packages; end users would need to do the rolling upgrade on top of 4.1 GA OR 4.1.HF1 OR 4.1.HF2 OR TVO-4.1.HF3 OR TVO-4.1.HF4 OR TVO-4.1.HF5 OR TVO-4.1.HF6 OR TVO-4.1.HF7 OR TVO-4.1.HF8 OR TVO-4.1.HF9 OR TVO-4.1.HF10 OR TVO-4.1.HF11 OR TVO-4.1.HF12
Issues logged by Customers to be documented in this section
The following table gives the overview of coverage against Trilio Deployment with Openstack:
The following table gives the overview of coverage against backup stores covered as part of the development and testing of 4.1.HF9 release.
This hotfix contains only a package update for the Trilio appliance. There are no new containers available compared to earlier releases.
Name | Type | Version |
---|
200% cpu usage or spike in CLOSE_WAIT connections with S3 backup target
An issue has been fixed which lead to a high CPU resource usage in case of fluctuating connection to the S3 backup target.
Learn about Trilio Support for OpenStack Distributions
The CentOS community has moved over to CentOS stream.
The support for CentOS8 has ended on December 31st 2021. The official announcement can be found .
CentOS7 is still supported and maintained till June 30th 2024
Kolla Ansible environments running on CentOS8 are receiving continuoues limited support. This means that future updates from Trilio for Kolla Ansible environments on CentOS8 will use the latest available CentOS8 base containers and only the Trilio for OpenStack code gets updated. When the Kolla Ansible community provides CentOS Stream based containers, Trilio will provide CentOS Stream based containers as well.
Trilio Release | RHOSP | Canonical | Ansible | Kolla | TripleO |
---|
NFS & S3 Support:
All versions of Trilio for OpenStack support NFSv3 and S3 as backup targets.
Supported OS:
RHEL7 and RHEL8 for RHOSP; Ubuntu 18.04 and 20.04 for Canonical distributions; Ubuntu 18.04, 20.04, and CentOS Stream for Ansible OpenStack; Ubuntu 18.04, 20.04 and CentOS Stream for Kolla OpenStack; CentOS7 for TripleO Train.
Deployment:
RHOSP distributions are deployed using Red Hat Director, Canonical distributions are deployed using JuJu Charms, Ansible OpenStack and Kolla OpenStack distributions are deployed using Ansible, and TripleO Train is deployed using Heat.
Name | Type | Version |
---|
If Trilio created Cinder snapshot is not in available state, all subsequent backups fail.
An issue has been fixed which led to backups ending unsuccessfully when Trilio wasn't able to delete older no longer required Cinder Snapshots.
Multipath rescan commands not executing in a timely manner
An issue has been fixed which led to failed detach and deletion of temporary volumes due to the rescan command not executing fast enough.
Improved stability of the S3fuse plugin
Several issues have been fixed which led to an unstable connection of the S3fuse plugin, leading to backups and restore failing during the data transfer phase.
Enhanced support for Latin characters
An issue has been fixed where the usage of Latin characters in the restore name or description did lead to the restore being unsuccessful.
Name | Tag |
---|---|
Name | Tag |
---|---|
Name | Tag |
---|---|
Name | Tag |
---|---|
Name | Tag |
---|
Name | Tag |
---|
TVault Deployment Tool | Covered ? | Comments |
---|
Backup Storage | Covered? |
---|
Summary | Workaround/Comments (if any) |
---|
Name | Tag |
---|
Name | Tag |
---|
Distribution/Version | Trilio 4.1 HF10+ | Trilio 4.1 HF8+ | Trilio 4.1 HF3+ | Trilio 4.1 GA | OS | NFS Support | S3 Support | Deployment |
---|
Name | Tag |
---|
s3fuse
python package
4.1.94.3
tvault-configurator
python package
4.1.94.5
workloadmgr
python package
4.1.94.5
workloadmgrclient
python package
4.1.94
dmapi
deb package
4.1.94.3
python3-dmapi
deb package
4.1.94.3
tvault-contego
deb package
4.1.94.4
python3-tvault-contego
deb package
4.1.94.4
tvault-horizon-plugin
deb package
4.1.94.3
python3-tvault-horizon-plugin
deb package
4.1.94.3
s3-fuse-plugin
deb package
4.1.94.3
python3-s3-fuse-plugin
deb package
4.1.94.3
workloadmgr
deb package
4.1.94.5
workloadmgrclient
deb package
4.1.94
dmapi
rpm package
4.1.94.3-4.1
python3-dmapi
rpm package
4.1.94.3-4.1
tvault-contego
rpm package
4.1.94.4-4.1
python3-tvault-contego
rpm package
4.1.94.4-4.1
tvault-horizon-plugin
rpm package
4.1.94.3-4.1
python3-tvault-horizon plugin-el8
rpm package
4.1.94.3-4.1
python-s3fuse-plugin-cent7
rpm package
4.1.94.3-4.1
python3-s3fuse-plugin
rpm package
4.1.94.3-4.1
workloadmgrclient
rpm package
4.1.94
Gitbranch
hotfix-2-TVO/4.1
RHOSP13 containers
4.1.94-hotfix-2-rhosp13
RHOSP16.0 containers
4.1.94-hotfix-2-rhosp16
RHOSP16.1 containers
4.1.94-hotfix-2-rhosp16.1
Kolla Ansible Ussuri containers
4.1.94-hotfix-2-ussuri
s3fuse
python package
4.1.94.4
tvault-configurator
python package
4.1.94.7
workloadmgr
python package
4.1.94.14
dmapi
deb package
4.1.94.3
python3-dmapi
deb package
4.1.94.3
tvault-contego
deb package
4.1.94.8
python3-tvault-contego
deb package
4.1.94.8
tvault-horizon-plugin
deb package
4.1.94.4
python3-tvault-horizon-plugin
deb package
4.1.94.4
s3-fuse-plugin
deb package
4.1.94.4
python3-s3-fuse-plugin
deb package
4.1.94.4
workloadmgr
deb package
4.1.94.14
workloadmgrclient
deb package
4.1.94
dmapi
rpm package
4.1.94.3-4.1
python3-dmapi
rpm package
4.1.94.3-4.1
tvault-contego
rpm package
4.1.94.8-4.1
python3-tvault-contego
rpm package
4.1.94.8-4.1
tvault-horizon-plugin
rpm package
4.1.94.4-4.1
python3-tvault-horizon plugin-el8
rpm package
4.1.94.4-4.1
python-s3fuse-plugin-cent7
rpm package
4.1.94.4-4.1
python3-s3fuse-plugin
rpm package
4.1.94.4-4.1
workloadmgrclient
rpm pacakage
4.1.94
Gitbranch
hotfix-7-TVO/4.1
RHOSP13 containers
4.1.94-hotfix-8-rhosp13
RHOSP16.0 containers
4.1.94-hotfix-8-rhosp16
RHOSP16.1 containers
4.1.94-hotfix-8-rhosp16.1
Kolla Ansible Ussuri containers
4.1.94-hotfix-7-ussuri
Kolla Ansible Victoria containers
4.1.94-hotfix-5-victoria
TripleO Train container
4.1.94-hotfix-5-tripleo
s3fuse
python package
4.1.94.4
tvault-configurator
python package
4.1.94.11
workloadmgr
python package
4.1.94.17
dmapi
deb package
4.1.94.3
python3-dmapi
deb package
4.1.94.3
tvault-contego
deb package
4.1.94.9
python3-tvault-contego
deb package
4.1.94.9
tvault-horizon-plugin
deb package
4.1.94.4
python3-tvault-horizon-plugin
deb package
4.1.94.4
s3-fuse-plugin
deb package
4.1.94.4
python3-s3-fuse-plugin
deb package
4.1.94.4
workloadmgr
deb package
4.1.94.17
workloadmgrclient
deb package
4.1.94
dmapi
rpm package
4.1.94.3-4.1
python3-dmapi
rpm package
4.1.94.3-4.1
tvault-contego
rpm package
4.1.94.9-4.1
python3-tvault-contego
rpm package
4.1.94.9-4.1
tvault-horizon-plugin
rpm package
4.1.94.4-4.1
python3-tvault-horizon plugin-el8
rpm package
4.1.94.4-4.1
python-s3fuse-plugin-cent7
rpm package
4.1.94.4-4.1
python3-s3fuse-plugin
rpm package
4.1.94.4-4.1
workloadmgrclient
rpm pacakage
4.1.94
Gitbranch
hotfix-8-TVO/4.1
RHOSP13 containers
4.1.94-hotfix-9-rhosp13
RHOSP16.0 containers
4.1.94-hotfix-9-rhosp16
RHOSP16.1 containers
4.1.94-hotfix-9-rhosp16.1
RHOSP16.2 containers
4.1.94-hotfix-9-rhosp16.2
Kolla Ansible Ussuri containers
4.1.94-hotfix-8-ussuri
Kolla Ansible Victoria containers
4.1.94-hotfix-6-victoria
TripleO Train container
4.1.94-hotfix-6-tripleo
s3fuse
python package
4.1.94.4
tvault-configurator
python package
4.1.94.7
workloadmgr
python package
4.1.94.9
dmapi
deb package
4.1.94.3
python3-dmapi
deb package
4.1.94.3
tvault-contego
deb package
4.1.94.8
python3-tvault-contego
deb package
4.1.94.8
tvault-horizon-plugin
deb package
4.1.94.4
python3-tvault-horizon-plugin
deb package
4.1.94.4
s3-fuse-plugin
deb package
4.1.94.4
python3-s3-fuse-plugin
deb package
4.1.94.4
workloadmgr
deb package
4.1.94.9
workloadmgrclient
deb package
4.1.94
dmapi
rpm package
4.1.94.3-4.1
python3-dmapi
rpm package
4.1.94.3-4.1
tvault-contego
rpm package
4.1.94.8-4.1
python3-tvault-contego
rpm package
4.1.94.8-4.1
tvault-horizon-plugin
rpm package
4.1.94.4-4.1
python3-tvault-horizon plugin-el8
rpm package
4.1.94.4-4.1
python-s3fuse-plugin-cent7
rpm package
4.1.94.4-4.1
python3-s3fuse-plugin
rpm package
4.1.94.4-4.1
workloadmgrclient
rpm pacakage
4.1.94
Gitbranch
hotfix-5-TVO/4.1
RHOSP13 containers
4.1.94-hotfix-8-rhosp13
RHOSP16.0 containers
4.1.94-hotfix-8-rhosp16
RHOSP16.1 containers
4.1.94-hotfix-8-rhosp16.1
Kolla Ansible Ussuri containers
4.1.94-hotfix-7-ussuri
Kolla Ansible Victoria containers
4.1.94-hotfix-5-victoria
TripleO Train container
4.1.94-hotfix-5-tripleo
s3fuse | python package | 4.1.94.4 |
tvault-configurator | python package | 4.1.94.7 |
workloadmgr | python package | 4.1.94.11 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.8 |
python3-tvault-contego | deb package | 4.1.94.8 |
tvault-horizon-plugin | deb package | 4.1.94.4 |
python3-tvault-horizon-plugin | deb package | 4.1.94.4 |
s3-fuse-plugin | deb package | 4.1.94.4 |
python3-s3-fuse-plugin | deb package | 4.1.94.4 |
workloadmgr | deb package | 4.1.94.11 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.8-4.1 |
python3-tvault-contego | rpm package | 4.1.94.8-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.4-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.4-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.4-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.4-4.1 |
workloadmgrclient | rpm pacakage | 4.1.94 |
Gitbranch | hotfix-7-TVO/4.1 |
RHOSP13 containers | 4.1.94-hotfix-8-rhosp13 |
RHOSP16.0 containers | 4.1.94-hotfix-8-rhosp16 |
RHOSP16.1 containers | 4.1.94-hotfix-8-rhosp16.1 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-7-ussuri |
Kolla Ansible Victoria containers | 4.1.94-hotfix-5-victoria |
TripleO Train container | 4.1.94-hotfix-5-tripleo |
s3fuse | python package | 4.1.94.7 |
tvault-configurator | python package | 4.1.94.15 |
workloadmgr | python package | 4.1.94.22 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.10 |
python3-tvault-contego | deb package | 4.1.94.10 |
tvault-horizon-plugin | deb package | 4.1.94.6-4.1 |
python3-tvault-horizon-plugin | deb package | 4.1.94.6 |
s3-fuse-plugin | deb package | 4.1.94.7 |
python3-s3-fuse-plugin | deb package | 4.1.94.7 |
workloadmgr | deb package | 4.1.94.21 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.10-4.1 |
python3-tvault-contego | rpm package | 4.1.94.10-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.6-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.6-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.7-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.7-4.1 |
workloadmgrclient | rpm pacakage | 4.1.94 |
RHOSP16.1 Containers | Containers | 4.1.94-hotfix-15-rhosp16.1 |
RHOSP16.2 Containers | Containers | 4.1.94-hotfix-15-rhosp16.2 |
RHOSP13 Containers | Containers | 4.1.94-hotfix-15-rhosp13 |
Kolla Containers | Containers | 4.1.94-hotfix-12-ussuri 4.1.94-hotfix-11-victoria |
TripleO Containers | Containers | 4.1.94-hotfix-11-tripleo |
Gitbranch | hotfix-12-TVO/4.1 |
RHOSP16.1 containers | 4.1.94-hotfix-15-rhosp16.1 |
RHOSP16.2 containers | 4.1.94-hotfix-15-rhosp16.2 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-11-ussuri |
Kolla Ansible Victoria containers | 4.1.94-hotfix-9-victoria |
TripleO Train container | 4.1.94-hotfix-9-tripleo |
4.1-RHOSP13-CONTAINER | 4.1.94-hotfix-15-rhosp13 |
4.1-KOLLA-CONTAINER | 4.1.94-hotfix-12-ussuri 4.1.94-hotfix-11-victoria |
4.1-TRIPLEO-CONTAINER | 4.1.94-hotfix-11-tripleo |
Tag Reference in Upgrade Docs | Value | Comments |
1 | 4.1-HOTFIX-LABEL | hotfix-13-TVO/4.1 | Label against the Trilio repositories from where required code to be pulled for upgrades. |
2 | 4.1-RHOSP13-CONTAINER | 4.1.94-hotfix-16-rhosp13 | RHOSP13 Container tag against 4.1.HF13 |
3 | 4.1-RHOSP16.1-CONTAINER | 4.1.94-hotfix-16-rhosp16.1 | RHOSP16.1 Container tag against 4.1.HF13 |
4 | 4.1-RHOSP16.2-CONTAINER | 4.1.94-hotfix-16-rhosp16.2 | RHOSP16.2 Container tag against 4.1.HF13 |
5 | 4.1-KOLLA-CONTAINER | 4.1.94-hotfix-13-ussuri 4.1.94-hotfix-12-victoria | Kolla Container tag against 4.1.HF13 |
6 | 4.1-TRIPLEO-CONTAINER | 4.1.94-hotfix-12-tripleo | TripleO Container tag against 4.1.HF13 |
Summary |
1 | horizon logs getting dumped with errors |
2 | T4O 4.1 vulnerability reported by Fortinet |
3 | All the network ports of a project are deleted in case Restore Network Topology fails |
Package/Container Names | Package Kind | Package/Container Version/Tags |
1 | dmapi | deb | 4.1.94.3 |
2 | dmapi | rpm | 4.1.94.3-4.1 |
3 | python3-dmapi | rpm | 4.1.94.3-4.1 |
4 | python3-dmapi | deb | 4.1.94.3 |
5 | tvault-contego | rpm | 4.1.94.10-4.1 |
6 | tvault-contego | deb | 4.1.94.10 |
7 | python3-tvault-contego | deb | 4.1.94.10 |
8 | python3-tvault-contego | rpm | 4.1.94.10-4.1 |
9 | s3fuse | python | 4.1.94.7 |
10 | s3-fuse-plugin | deb | 4.1.94.7 |
11 | python3-s3-fuse-plugin | deb | 4.1.94.7 |
12 | python3-s3fuse-plugin | rpm | 4.1.94.7-4.1 |
13 | python-s3fuse-plugin-cent7 | rpm | 4.1.94.7-4.1 |
Package/Container Names | Package Kind | Package/Container Version/Tags |
1 | workloadmgr | deb | 4.1.95.22 |
2 | workloadmgr | python | 4.1.94.23 |
3 | tvault_configurator | python | 4.1.94.17 |
4 | tvault-horizon-plugin | rpm | 4.1.94.7-4.1 |
5 | tvault-horizon-plugin | deb | 4.1.94.7 |
6 | python3-tvault-horizon-plugin | deb | 4.1.94.7 |
7 | python3-tvault-horizon-plugin-el8 | rpm | 4.1.94.7-4.1 |
8 | RHOSP16.1 Containers | Containers | 4.1.94-hotfix-16-rhosp16.1 |
9 | RHOSP16.2 Containers | Containers | 4.1.94-hotfix-16-rhosp16.2 |
10 | RHOSP13 Containers | Containers | 4.1.94-hotfix-16-rhosp13 |
11 | Kolla Containers | Containers | 4.1.94-hotfix-13-ussuri 4.1.94-hotfix-12-victoria |
12 | TripleO Containers | Containers | 4.1.94-hotfix-12-tripleo |
1 | Shell Script | NO | Scoped out since TVO-4.1 |
2 | Ansible (Openstack native) | YES | For Kolla & Openstack ansible |
3 | Debian Packages | YES | Used on Ubuntu based distro via all TVault Deployment methods |
4 | RPM Packages | YES | Used on RH based distro via all TVault Deployment methods |
5 | RH Director | YES | For RHOSP |
6 | TripleO | YES | For TripleO |
7 | Juju/Charms | YES | For Canonical Openstack |
1 | AWS S3 | NO |
2 | NFS | YES |
3 | RH Ceph S3 | YES |
4 | Wasabi S3 | NO |
1 |
2 | restore fails for SRIOV network (Fixed in 4.1.HF7; documenting single scenario) | If port_security_enabled=False on the network , restore will pass and user can attach security group to the restored vm network port later after restore is done. |
3 | [Intermittent] All API calls are getting stuck. Note: Respective steps added to common T4O upgrade document. | Set oslo.messaging package version to 12.1.6 on all T4O nodes.
|
4 | Contego package installation failing on HF5 OSA with S3. Note : Respective steps added to common T4O components upgrade document against OSA distro. | Before contego package upgrade unmount /var/triliovault-mounts path |
5 | In-place restore not working properly with multiattach volume | Select all the VM's boot disk as well as cinder multiattach disk on the in-place restore window. Restore will work fine for all the VM |
6 | Snapshot mount only shows volume group/LVM for one VM when 2 or more VMs have volume group with same name | NA |
7 | Snapshot Disk Integrity Check Disabled for 4.1.HF1 release. Impact
| None |
8 | Backup and restore should not break for instances with multi-attach volumes. | After upgrade from 4.1 GA to 4.1HF1 , snapshots which trigger just after upgrade for workloads having multi-attach volume would be of “mixed” type after that all snapshots will be of incremental types . |
9 | [FRM] Snapshot mount not working | Update permissions of NFS mount point to 755 on the NFS server and retry snapshot mount operation. {noformat}chmod 755 /mnt/tvault/tvm4{noformat} |
10 | [Intermittent] [RHOSP 16.1] [Horizon] After the overcloud deployment, openstack UI messed UP. | Login to the Horizon container and run the following commands:
|
11 | [DR] Selective restore fails, If original image is deleted in canonical focal-victoria environment | None |
s3fuse | python package | 4.1.94.6 |
tvault-configurator | python package | 4.1.94.11 |
workloadmgr | python package | 4.1.94.18 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.9 |
python3-tvault-contego | deb package | 4.1.94.9 |
tvault-horizon-plugin | deb package | 4.1.94.4 |
python3-tvault-horizon-plugin | deb package | 4.1.94.4 |
s3-fuse-plugin | deb package | 4.1.94.6 |
python3-s3-fuse-plugin | deb package | 4.1.94.6 |
workloadmgr | deb package | 4.1.94.18 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.9-4.1 |
python3-tvault-contego | rpm package | 4.1.94.9-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.4-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.4-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.6-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.6-4.1 |
workloadmgrclient | rpm pacakage | 4.1.94 |
Gitbranch | hotfix-10-TVO/4.1 |
RHOSP13 containers | 4.1.94-hotfix-10-rhosp13 |
RHOSP16.0 containers | 4.1.94-hotfix-10-rhosp16 |
RHOSP16.1 containers | 4.1.94-hotfix-10-rhosp16.1 |
RHOSP16.2 containers | 4.1.94-hotfix-10-rhosp16.2 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-9-ussuri |
Kolla Ansible Victoria containers | 4.1.94-hotfix-7-victoria |
TripleO Train container | 4.1.94-hotfix-7-tripleo |
s3fuse | python package | 4.1.94.5 |
tvault-configurator | python package | 4.1.94.11 |
workloadmgr | python package | 4.1.94.17 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.9 |
python3-tvault-contego | deb package | 4.1.94.9 |
tvault-horizon-plugin | deb package | 4.1.94.4 |
python3-tvault-horizon-plugin | deb package | 4.1.94.4 |
s3-fuse-plugin | deb package | 4.1.94.5 |
python3-s3-fuse-plugin | deb package | 4.1.94.5 |
workloadmgr | deb package | 4.1.94.17 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.9-4.1 |
python3-tvault-contego | rpm package | 4.1.94.9-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.4-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.4-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.5-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.5-4.1 |
workloadmgrclient | rpm pacakage | 4.1.94 |
Gitbranch | hotfix-9-TVO/4.1 |
RHOSP13 containers | 4.1.94-hotfix-9-rhosp13 |
RHOSP16.0 containers | 4.1.94-hotfix-9-rhosp16 |
RHOSP16.1 containers | 4.1.94-hotfix-9-rhosp16.1 |
RHOSP16.2 containers | 4.1.94-hotfix-9-rhosp16.2 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-8-ussuri |
Kolla Ansible Victoria containers | 4.1.94-hotfix-6-victoria |
TripleO Train container | 4.1.94-hotfix-6-tripleo |
RHOSP 16.2 | Yes | Yes | RHEL8 | NFSv3 | Supported |
RHOSP 16.1 | Yes | Yes | Yes | Yes | RHEL8 | NFSv3 | Supported | Red hat Director |
RHOSP 16.0 | Yes | Yes | Yes | RHEL8 | NFSv3 | Supported | Red Hat Director |
RHOSP 13 | Yes | Yes | Yes | Yes | RHEL7 | NFSv3 | Supported | Red Hat Director |
Canonical Victoria | Yes | Yes | Yes | Yes | Ubuntu 20.04 | NFSv3 | Supported | JuJu Charms |
Canonical Ussuri | Yes | Yes | Yes | Yes | Ubuntu 18.04/20.04 | NFSv3 | Supported | JuJu Charms |
Canonical Train | Yes | Yes | Yes | Yes | Ubuntu 18.04 | NFSv3 | Supported | JuJu Charms |
Canonical Stein | Yes | Yes | Yes | Yes | Ubuntu 18.04 | NFSv3 | Supported | JuJu Charms |
Canonical Queens | Yes | Yes | Yes | Yes | Ubuntu 18.04 | NFSv3 | Supported | JuJu Charms |
Ansible Openstack Victoria | Yes | Yes | Yes | Ubuntu 20.04 | NFSv3 | Supported | Ansible |
Ansible Openstack Ussuri | Yes | Yes | Yes | Yes | Ubuntu 18.04/20.04 | NFSv3 | Supported | Ansible |
Kolla Openstack Victoria | Yes | Yes | Yes | Ubuntu 20.04, CentOS Stream | NFSv3 | Supported | Ansible |
Kolla Openstack Ussuri | Yes | Yes | Yes | Yes | Ubuntu 18.04 | NFSv3 | Supported | Ansible |
TripleO Train | Yes | Yes | Yes | CentOS7 | NFSv3 | Supported | Heat |
s3fuse | python package | 4.1.94.7 |
tvault-configurator | python package | 4.1.94.11 |
workloadmgr | python package | 4.1.94.20 |
dmapi | deb package | 4.1.94.3 |
python3-dmapi | deb package | 4.1.94.3 |
tvault-contego | deb package | 4.1.94.10 |
python3-tvault-contego | deb package | 4.1.94.10 |
tvault-horizon-plugin | deb package | 4.1.94.4 |
python3-tvault-horizon-plugin | deb package | 4.1.94.4 |
s3-fuse-plugin | deb package | 4.1.94.7 |
python3-s3-fuse-plugin | deb package | 4.1.94.7 |
workloadmgr | deb package | 4.1.94.20 |
workloadmgrclient | deb package | 4.1.94 |
dmapi | rpm package | 4.1.94.3-4.1 |
python3-dmapi | rpm package | 4.1.94.3-4.1 |
tvault-contego | rpm package | 4.1.94.10-4.1 |
python3-tvault-contego | rpm package | 4.1.94.10-4.1 |
tvault-horizon-plugin | rpm package | 4.1.94.4-4.1 |
python3-tvault-horizon plugin-el8 | rpm package | 4.1.94.4-4.1 |
python-s3fuse-plugin-cent7 | rpm package | 4.1.94.7-4.1 |
python3-s3fuse-plugin | rpm package | 4.1.94.7-4.1 |
workloadmgrclient | rpm pacakage | 4.1.94 |
Gitbranch | hotfix-11-TVO/4.1 |
RHOSP16.0 containers | 4.1.94-hotfix-13-rhosp16 |
RHOSP16.1 containers | 4.1.94-hotfix-13-rhosp16.1 |
RHOSP16.2 containers | 4.1.94-hotfix-13-rhosp16.2 |
Kolla Ansible Ussuri containers | 4.1.94-hotfix-11-ussuri |
Kolla Ansible Victoria containers | 4.1.94-hotfix-9-victoria |
TripleO Train container | 4.1.94-hotfix-9-tripleo |
4.1 HF10+ | 16.2 16.1 13 | Victoria Ussuri Train Stein Queens | Victoria Ussuri | Victoria Ussuri | Train |
4.1 HF8+ | 16.2 16.1 16.0 13 | Victoria Ussuri Train Stein Queens | Victoria Ussuri | Victoria Ussuri | Train |
4.1 HF3+ | 16.1 16.0 13 | Victoria Ussuri Train Stein Queens | Victoria Ussuri | Victoria Ussuri | Train |
4.1 GA | 16.1 16.0 13 | Ussuri Train Stein Queens | Ussuri | Ussuri |
For Canonical Openstack it is not necessary to spin up the Trilio VM.
The Trilio Appliance is delivered as qcow2 image and runs as VM on top of a KVM Hypervisor.
This guide shows the tested way to spin up the Trilio Appliance on a RHV Cluster. Please contact a RHV Administrator and Trilio Customer Success Agent in case of incompatibility with company standards.
The Trilio appliance is utilizing cloud-init to provide the initial network and user configuration.
Cloud-init is reading it's information either from a metadata server or from a provided cd image. Trilio is utilizing the cd image.
To create the cloud-init image it is required to have genisoimage available.
Cloud-init is using two files for it's metadata.
The first file is called meta-data
and contains the information about the network configuration.
Below is an example of this file.
Keep the hostname localhost. The hostname gets changed through the configuration step. Changing the hostname will lead to the tvault-config service not properly starting, blocking further configuration.
The instance-id has to match the VM name in virsh
The second file is called user-data
and contains little scripts and information to set up for example the user passwords.
Below is an example of this file.
Both files meta-data and user-data are needed. Even when one is empty, is it needed to create a working cloud-init image.
The image is getting created using genisoimage follwing this general command:
genisoimage -output <name>.iso -volid cidata -joliet -rock </path/user-data> </path/meta-data>
An example of this command is shown below.
The Trilio Appliance qcow2 image can be downloaded from the Trilio customer portal. Please contact your Trilio sales or technical lead to get access to the portal.
After the cloud-init image has been created the TriloVault appliance can be spun up on the desired KVM server.
Extract the Trilio QCOW2 tar file using the following command :
See below an example command, how to spin up the Trilio appliance using virsh and the created iso image.
It is of course possible to spin up the Trilio appliance without a cloud-init iso-image. It will spin up with default values.
Once the Trilio appliance is up and running with it's initial configuration is it recommended to uninstall cloud-init.
If cloud-init is not installed it will rerun the network configuration upon every boot. Setting the network configuration back to DHCP, if no metadata is provided.
To uninstall cloud-init, follow the example below.
It is recommended to directly update the Trilio appliance to the latest version.
To do so follow the minor update guide provided here:
It is recommended to think about the following elements prior to the installation of Trilio for Openstack.
Trilio uses Cinder snapshots for calculating full and incremental backups. For full backups, Trilio creates Cinder snapshots for all the volumes in the backup job. It then leaves these Cinder snapshots behind for calculating the incremental backup image during next backup. During an incremental backup operation it creates new Cinder snapshots, calculates the changed blocks between the new snapshots and the old snapshots that were left behind during full/previous backups. It then deletes the old snapshots but leaves the newly created snapshots behind. So, it is important that each tenant who is availing Trilio backup functionality has sufficient Cinder snapshot quotas to accommodate these additional snapshots. The guideline is to add 2 snapshots for every volume that is added to backups to volume snapshot quotas for that tenant. You may also increase the volume quotas for the tenant by the same amount because Trilio briefly creates a volume from snapshot to read data from the snapshot for backup purposes. During a restore process, Trilio creates additional instances and Cinder volumes. To accommodate restore operations, a tenant should have sufficient quota for Nova instances and Cinder volumes. Otherwise restore operations will result in failures.
AWS S3 object consistency model includes:
Read-after-write
Read-after-update
Read-after-delete
Each of them describes how an object will reach its consistent state after an object is created/updated or deleted. None of them provides strong consistency and there is a lag time for an object to reach the consistent state. Though Trilio employed mechanisms to work around the limitations of eventual consistency of AWS S3, when an object reach its consistency state is not deterministic. There is no official statement from AWS on how long it takes for an object to reach consistent state. However read-after-write has a shorter time to reach consistency compared to other IO patterns. Our solution is designed to maximize read-after-write IO pattern. The time in which an object reaches eventual consistency also depends on the AWS region. For example, aws-standard region does not have strong consistency model compared to us-east or us-west. We suggest to use these regions when creating s3 buckets for Trilio. Though read-after-update IO pattern is hard to avoid completely, we employed ample delays in accessing objects to accommodate larger durations for objects to get into consistent state. However in rare occasions, backups may still fail and need to restarted.
Trilio can be deployed as a single node or a three node cluster. It is highly recommended that Trilio is deployed as three node cluster for fault tolerance and load balancing. Starting with 3.0 release, Trilio requires additional IP for cluster and is required for both single node and three node deployments. Cluster ip a.k.a virtual ip is used for managing cluster and is used to register Trilio service endpoint in the keystone sevice catalog.
Once the Trilio VM or the Cluster of Trilio VMs has been spun, the installation process can begin. This process contains the following steps:
Install the Trilio dm-api service on the control plane.
Install the Trilio datamover service on the compute plane.
Install the Trilio Horizon plugin into the Horizon service.
How these steps look in detail is dependent on the Openstack distribution that Trilio is installed in. Each supported Openstack distribution has its own deployment tools. Trilio integrates into these deployment tools to provide a native integration from the beginning to the end.
Artifacts | Reference |
1 | Release Date | Oct 5, 2022 |
2 | Debian URL |
3 | RPM URL |
4 | PIP URL |
Trilio integrates natively with Openstack. This includes that Trilio communicates completely through APIs using the Openstack Endpoints. Trilio is also generating its own Openstack endpoints. In addition, is the Trilio appliance and the compute nodes writing to and reading from the backup target. These points affect the network planning for the Trilio installation.
Openstack knows 3 types of endpoints:
Public Endpoints
Internal Endpoints
Admin Endpoints
Each of these endpoint types is designed for a specific purpose. Public endpoints are meant to be used by the Openstack end-users to work with Openstack. Internal endpoints are meant to be used by the Openstack services to communicate with each other. Admin endpoints are meant to be used by Openstack administrators.
Out of those 3 endpoint types does only the admin endpoint sometimes contain APIs which are not available on any other endpoint type.
To learn more about Openstack endpoints please visit the official Openstack documentation.
Trilio is communicating with all services of Openstack on a defined endpoint type. Which endpoint type Trilio is using to communicate with Openstack is decided during the configuration of the Trilio appliance.
There is one exception: The Trilio Appliance always requires access to the Keystone admin endpoint.
The following network requirement can be identified this way:
Trilio appliance needs access to the Keystone admin endpoint on the admin endpoint network
Trilio appliance needs access to all endpoints of one type
Trilio is recommending providing full access to all Openstack endpoints to the Trilio appliance to follow the Openstack standards and best practices.
Trilio is generating its own endpoints as well. These endpoints are pointing towards the Trilio Appliance directly. This means that using those endpoints will not send the API calls towards the Openstack Controller nodes first, but directly to the Trilio VM.
Following the Openstack standards and best practices, it is therefore recommended to put the Trilio endpoints on the same networks as the already existing Openstack endpoints. This allows to extend the purpose of each endpoint type to the Trilio service:
The public endpoint to be used by Openstack users when using Trilio CLI or API
The internal endpoint to communicate with the Openstack services
The admin endpoint to use the required admin only APIs of Keystone
The Trilio solution is using backup target storage to securely place the backup data. Trilio is dividing its backup data into two parts:
Metadata
Volume Disk Data
The first type of data is generated by the Trilio appliance through communicating with the Openstack Endpoints. All metadata that is stored together with a backup is written by the Trilio Appliance to the backup target in the json format.
The second type of data is generated by the Trilio Datamover service running on the compute nodes. The Datamover service is reading the Volume Data from the Cinder or Nova storage and transferring this data as qcow2 image to the backup target. Each Datamover service is hereby responsible for the VMs running on its compute node.
The network requirements are therefor:
The Trilio appliance needs access to the backup target
Every compute node needs access to the backup target
Most Trilio customers are following the Openstack standards and best practices to have the public, internal, and admin endpoints on separate networks. They also typically don't have any network yet, which can access the desired backup target.
The starting network configuration typically looks like this:
Following the Openstack standards and Trilio's recommendation will the Trilio Appliance be placed on all those 3 networks. Further is the access to the backup target required by Trilio Appliance and Compute nodes. Here done by adding a 4th network.
The resulting network configuration would look like this:
It is of course possible to combine networks as necessary. As long as the required network access is available will Trilio work.
Each Openstack installation is different and so is the network configuration. There are endless possibilities of how to configure the Openstack network and how to implement the Trilio appliance into this network. The following three examples have been seen in production:
The first example is from a manufacturing company, which wanted to split the networks by function and decided to put the Trilio backup target on the internal network as the backup and recovery function was identified as an Openstack internal solution. This example looks complex but integrates Trilio just as recommended.
The second example is from a financial institute that wanted to be sure that the Openstack Users have no direct uncontrolled network access to the Openstack infrastructure. Following this example requires additional work as the internal HA-Proxy needs to be configured to correctly translates the API calls towards the Trilio
The third example is from a service company that was forced to treat Trilio as an external 3rd party solution, as we require a virtual machine running outside of Openstack. This kind of network configuration requires good planning on the Trilio endpoints and firewall rules.
Trilio and Canonical have started a partnership to ensure a native deployment of Trilio using JuJu Charms.
Those JuJu Charms are publicly available as Open Source Charms.
Trilio is not providing the JuJu Charms to deploy Trilio 4.1 in Canonical Openstack. These are developed and maintained by Canonical.
Canonical Openstack doesn't require the Trilio Cluster. The required services are installed and managed via JuJu Charms.
The following charms exist:
trilio-wlm Installs and manages Trilio Controller services.
trilio-dm-api Installs and manages the Trilio Datamover API service.
trilio-data-mover Installs and manages the Trilio Datamover service.
trilio-horizon-plugin Installs and manages the Trilio Horizon Plugin.
The documentation of the charms can be found here:
Trilio has four main software components:
Trilio ships as a QCOW2 image. User can instantiate one or more VMs from the QCOW2 image on a standalone KVM boxes.
Trilio API is a python module that is an extension to nova api service. This module is installed on all OpenStack controller nodes
Trilio Datamover is a python module that is installed on every OpenStack compute nodes
Trilio horizon plugin is installed as an add on to horizon servers. This module is installed on every server that runs horizon service.
The Trilio Appliance is not supported as an instance inside Openstack.
The Trilio Appliance gets delivered as a qcow2 image, which gets attached to a virtual machine.
Trilio supports KVM-based hypervisors on x86 architectures, with the following properties:
Software | Supported |
---|---|
The recommended size of the VM for the Trilio Appliance is:
When running Trilio in production, a 3-node cluster of the Trilio appliance is recommended for high availability and load balancing.
Ressource | Value |
---|---|
The qcow2 image itself defines the 40GB disk size of the VM.
In the rare case of the Trilio Appliance database or log files getting larger than 40GB disk, contact or open a ticket with Trilio Customer Success to attach another drive to the Trilio Appliance.
In addition to the Trilio Appliance does Trilio contain components, which are installed directly into the Openstack itself.
Each Openstack distribution comes with a set of supported operating systems. Please check the support matrix to see, which Openstack Distribution is supported with which Operating System.
Additional it is necessary to have the nfs-common
packages installed on the compute nodes in case of using the NFS protocol for the backup target.
The installation of Trilio for Openstack on Kolla Victoria with Trilio 4.1 is following this procedure:
Deplo T4O-4.1 GA Appliance
Upgrade to 4.1 HF5 or higher on the appliance
Deploy Trilio components of 4.1 HF5 or higher on the Kolla Openstack Victoria
Configure the Trilio appliance
Please follow to spin up the base Trilio 4.1GA appliance.
Trilio supports Kolla Victoria from 4.1HF5 onwards, so it is recommended to upgrade to the latest available hotfix on 4.1 to make deployment successful. Please follow to upgrade the appliance to the latest 4.1 Hotfix.
Run the deployment of the components following using the following values:
Variable | Value |
---|
The is the supported and recommended method to deploy and maintain any RHOSP installation. Trilio integrates natively into the RHOSP Director and manual deployment methods are not supported for RHOSP.
Backup target storage is used to store backup images taken by Trilio and also associated configuration needs. The following backup target types are supported by Trilio:
Backup Target Types | Required Configuration |
---|
The overcloud-deploy command must already have been run successfully prior to this point and overcloud should be available. Perform the following steps for 'undercloud' node on an existing RHOSP environment:
All commands need to be run as user 'stack' on undercloud node
RHOSP 16.0 is not supported anymore as RedHat has officially stopped supporting it. However, Trilio maintained it for some time and stopped the support from 4.1HF11 onwards. The latest hotfix available for RHOSP16.0 is 41.HF10. Reach out to the Support team for any help.
Ensure that the Trilio appliance connected to this installation is on the latest Hotfix version. Failure to ensure this may lead to your installation not working as expected.
Refer to this doc :
Run the following command to clone the triliovault-cfg-scripts github repository:
``
If your backup target type is 'Ceph-based S3' with SSL, skip this step. Otherwise, access the Red Hat Director scripts according to the RHOSP version being used:
RHOSP 13 - cd triliovault-cfg-scripts/redhat-director-scripts/rhosp13/
RHOSP 16.1 - cd triliovault-cfg-scripts/redhat-director-scripts/rhosp16.1/
RHOSP 16.2 - cd triliovault-cfg-scripts/redhat-director-scripts/rhosp16.2/
If your backup target is Ceph S3 with SSL and your SSL certificates are self-signed or authorized by private CA, you must provide the CA chain certificate to validate the SSL requests. Otherwise, skip this step. To do this:
Rename your CA chain cert file to s3-cert.pem.
Copy the renamed file into the following directory:
triliovault-cfg-scripts/redhat-director-scripts/redhat-director-scripts/<RHOSP_RELEASE_Directory>/puppet/trilio/files
If your overcloud deploy command uses any other deploy artifact through an environment file, then you must merge Trilio deploy artifact url and your url in a single file.
Then access the Red Hat Director scripts according to the version being used:
RHOSP 13 - cp s3-cert.pem /home/stack/triliovault-cfg-scripts/redhat-director-scripts/rhosp13/puppet/trilio/files/
RHOSP 16.1 - cp s3-cert.pem /home/stack/triliovault-cfg-scripts/redhat-director-scripts/rhosp16.1/puppet/trilio/files/
RHOSP 16.2 - cp s3-cert.pem /home/stack/triliovault-cfg-scripts/redhat-director-scripts/rhosp16.2/puppet/trilio/files/
From this point onwards in the documentation, only the following path will be used for examples: cd triliovault-cfg-scripts/redhat-director-scripts/rhosp16.1/
The following commands upload the Trilio puppet module to the overcloud registry. The upload only happens upon the next deployment.
Step 1: -
The output of the above command looks like the following.
Trilio puppet module is uploaded to overcloud as a swift deploy artifact with heat resource name 'DeployArtifactURLs'.
Step 2: - Check Trilio's Puppet module artifact file and ensure that it looks like the following:
Step 3: -
Firstly, check to make sure that your overcloud deploy environment files uses deploy artifacts. To do this check string DeployArtifactURLs in your environment files (only those mentioned in the overcloud deploy command with -e option). If you find any environment file with the -e option, then your deploy command is using deploy artifacts.
If your deploy command is using deploy artifact, you must merge all deploy artifacts in a single file. For example, if your artifact file path is /home/stack/templates/user-artifacts.yaml, then perform the following steps to merge both urls in single file, which is passed to the overcloud deploy command with the -e option.
Trilio contains multiple services. Add these services to your roles_data.yaml.
In the case of uncustomized roles_data.yaml can the default file be found on the undercloud at:
/usr/share/openstack-tripleo-heat-templates/roles_data.yaml
Add the following services to the roles_data.yaml
All commands need to be run as user 'stack'
This service needs to share the same role as the keystone
and database
service.
In case of the pre-defined roles will these services run on the role Controller
.
In case of custom-defined roles, it is necessary to use the same role where OS::TripleO::Services::Keystone
service installed.
Add the following line to the identified role:
This service needs to share the same role as the nova-compute
service.
In case of the pre-defined roles will the nova-compute
service run on the role Compute
.
In case of custom-defined roles, it is necessary to use the role that the nova-compute
service is using.
Add the following line to the identified role:
All commands need to be run as user 'stack'
Trilio containers are pushed to 'RedHat Container Registry'.
Registry URL: registry.connect.redhat.com
Container pull URLs are given below.
Please note that using the hotfix containers requires that the Trilio Appliance is getting upgraded to the desired hotfix level as well.
Refer to the word <HOTFIX-TAG-VERSION> as 4.1.94-hotfix-16 in the below sections
There are three registry methods available in the RedHat OpenStack Platform.
Remote Registry
Local Registry
Satellite Server
Follow this section when 'Remote Registry' is used.
In this method, container images get downloaded directly on overcloud nodes during overcloud deploy/update command execution. User can set the remote registry to the RedHat registry or any other private registry that he wants to use.
The user needs to provide credentials for the registry in containers-prepare-parameter.yaml
file.
Make sure other OpenStack service images are also using the same method to pull container images. If it's not the case you can not use this method.
Populate containers-prepare-parameter.yaml
with content like following. Important parameters are 'push_destination: false',
ContainerImageRegistryLogin: true and registry credentials.
Trilio container images are published to the registry registry.connect.redhat.com
Credentials of registry 'registry.redhat.io' will work for registry.connect.redhat.com
registry too.
Note: This file - containers-prepare-parameter.yaml
Note: File 'containers-prepare-parameter.yaml' gets created as output of command 'openstack tripleo container image prepare'. Refer above document by RedHat
3. Make sure you have network connectivity to the above registries from all overcloud nodes. Otherwise, image pull operation will fail.
4. Populate the trilio_env.yaml with Trilio container image URLs:
Trilio Datamover container
Trilio Datamover API container
Trilio Horizon Plugin
trilio_env.yaml will be available in
Follow this section when 'local registry' is used on the undercloud.
In this case, it is necessary to push the Trilio containers to the undercloud registry. Trilio provides shell scripts that will pull the containers from 'registry.connect.redhat.com' and push them to the undercloud and update the trilio_env.yaml.
Verify the changes
Verify the changes:
Verify the changes
The changes can be verified using the following commands.
Follow this section when a Satellite Server is used for the container registry.
Populate the trilio_env.yaml with container urls.
Provide backup target details and other necessary details in the provided environment file. This environment file will be used in the overcloud deployment to configure Trilio components. Container image names have already been populated in the preparation of the container images. Still it is recommended to verify the container URLs.
The following information are required additionally:
Network for the datamover api
datamover password
Backup target type {nfs/s3}
In case of NFS
list of NFS Shares
NFS options
In case of S3
S3 type {amazon_s3/ceph_s3}
S3 Access key
S3 Secret key
S3 Region name
S3 Bucket
S3 Endpoint URL
S3 Signature Version
S3 Auth Version
S3 SSL Enabled {true/false}
S3 SSL Cert
Use ceph_s3 for any non-aws S3 backup targets.
The existing default haproxy configuration works fine with most of the environments. Only when timeout issues with the dmapi are observed or other reasons are known, change the configuration as described here.
Following is the haproxy conf file location on haproxy nodes of the overcloud. Trilio datamover api service haproxy configuration gets added to this file.
Trilio datamover haproxy default configuration from the above file looks as follows:
The user can change the following configuration parameter values.
To change these default values, you need to do the following steps. i) On the undercloud node, open the following file for edit (Edit <RHOSP_RELEASE> with your cloud's release information. Valid values are - rhosp13, rhosp16, rhosp16.1)
For RHOSP13
For RHOSP16.0
For RHOSP16.1
For RHOSP16.2
ii) Search the following entries and edit as required
iii) Save the changes.
Use the following heat environment file and roles data file in overcloud deploy command:
trilio_env.yaml
roles_data.yaml
Use the correct Trilio endpoint map file as per available Keystone endpoint configuration
Instead of tls-endpoints-public-dns.yaml
file, use environments/trilio_env_tls_endpoints_public_dns.yaml
Instead of tls-endpoints-public-ip.yaml
file, useenvironments/trilio_env_tls_endpoints_public_ip.yaml
Instead of tls-everywhere-endpoints-dns.yaml
file, useenvironments/trilio_env_tls_everywhere_dns.yaml
To include new environment files use '-e' option and for roles data file use '-r' option. An example overcloud deploy command is shown below:
If the containers are in restarting state or not listed by the following command then your deployment is not done correctly. Please recheck if you followed the complete documentation.
Make sure Trilio dmapi and horizon containers are in a running state and no other Trilio container is deployed on controller nodes. When the role for these containers is not "controller" check on respective nodes according to configured roles_data.yaml.
Verify the haproxy configuration under:
Make sure Trilio datamover container is in running state and no other Trilio container is deployed on compute nodes.
Make sure horizon container is in running state. Please note that 'Horizon' container is replaced with Trilio Horizon container. This container will have latest OpenStack horizon + Trilio's horizon plugin.
In RHOSP, nova
user id on nova-compute docker container is set to '42436'. The 'nova' user id on the Trilio nodes needs to be set the same. Do the following steps on all Trilio nodes:
Download the shell script that will change the user id
Assign executable permissions
Execute the script
Verify that nova
user and group id have changed to '42436'
Trilio components will be deployed using puppet scripts.
Backup target storage is used to store backup images taken by Trilio and details needed for configuration:
Following backup target types are supported by Trilio
a) NFS
Need NFS share path
b) Amazon S3
- S3 Access Key - Secret Key - Region - Bucket name
c) Other S3 compatible storage (Like, Ceph based S3)
- S3 Access Key - Secret Key - Region - Endpoint URL (Valid for S3 other than Amazon S3) - Bucket name
The following steps are to be done on 'undercloud' node on an already installed RHOSP environment. The overcloud-deploy command has to be run successfully already and the overcloud should be available.
All commands need to be run as user 'stack' on undercloud node
The following command clones the triliovault-cfg-scripts github repository.
Please note that the Trilio Appliance needs to get updated to hf3 as well.
If your backup target is ceph S3 with SSL and SSL certificates are self signed or authorized by private CA, then user needs to provide CA chain certificate to validate the SSL requests. For that, user needs to rename his ca chain cert file to 's3-cert.pem' and copy it into directory - 'triliovault-cfg-scripts/redhat-director-scripts/redhat-director-scripts/<RHOSP_RELEASE___Directory/puppet/trilio/files'
Trilio contains multiple services. Add these services to your roles_data.yaml.
In the case of uncustomized roles_data.yaml can the default file be found on the undercloud at:
/usr/share/openstack-tripleo-heat-templates/roles_data.yaml
Add the following services to the roles_data.yaml
All commands need to be run as user 'stack'
This service needs to share the same role as the keystone
and database
service.
In case of the pre-defined roles will these services run on the role Controller
.
In case of custom defined roles, it is necessary to use the same role where 'OS::TripleO::Services::Keystone' service installed.
Add the following line to the identified role:
This service needs to share the same role as the nova-compute
service.
In case of the pre-defined roles will the nova-compute
service run on the role Compute
.
In case of custom defined roles, it is necessary to use the role the nova-compute
service is using.
Add the following line to the identified role:
All commands need to be run as user 'stack'
Refer to the below-mentioned value of the respective placeholder in this document. HOTFIX-TAG-VERSION : 4.1.94-hotfix-12-tripleo
Trilio containers are pushed to 'Dockerhub'. Registry URL: 'docker.io'. Container pull URLs are given below.
There are two registry methods available in TripleO Openstack Platform.
Remote Registry
Local Registry
Follow this section when 'Remote Registry' is used.
For this method it is not necessary to pull the containers in advance. It is only necessary to populate the trilio_env.yaml file with the Trilio container URLs from Dockerhub registry.
Populate the trilio_env.yaml with container URLs for:
Trilio Datamover container
Trilio Datamover api container
Trilio Horizon Plugin
trilio_env.yaml will be available in
__triliovault-cfg-scripts/redhat-director-scripts/tripleo-train/environments
Follow this section when 'local registry' is used on the undercloud.
Run the following script. Script pulls the triliovault containers and updates the triliovault environment file with URLs.
Acceptable values for the below two parameters:
OS_platform: [centos7, centos8]
container_tool_available_on_undercloud: [docker, podman]
The changes can be verified using the following commands.
Fill triliovault details in file - '/home/stack/triliovault-cfg-scripts/redhat-director-scripts/tripleo-train/environments/trilio_env.yaml', triliovault environment file is self explanatory. Fill details of backup target, verify image urls and other details.
Use the following heat environment file and roles data file in overcloud deploy command
trilio_env.yaml: This environment file contains Trilio backup target details and Trilio container image locations
roles_data.yaml: This file contains overcloud roles data with Trilio roles added.
Use the correct trilio endpoint map file as per your keystone endpoint configuration.
- Instead of tls-endpoints-public-dns.yaml
this file, use ‘environments/trilio_env_tls_endpoints_public_dns.yaml’
- Instead of tls-endpoints-public-ip.yaml
this file, use ‘environments/trilio_env_tls_endpoints_public_ip.yaml’
- Instead of tls-everywhere-endpoints-dns.yaml
this file, use ‘environments/trilio_env_tls_everywhere_dns.yaml’
Deploy command with triliovault environment file looks like following.
If the containers are in restarting state or not listed by the following command then your deployment is not done correctly. Please recheck if you followed the complete documentation.
Make sure Trilio dmapi and horizon containers are in a running state and no other Trilio container is deployed on controller nodes. When the role for these containers is not "controller" check on respective nodes according to configured roles_data.yaml.
Verify the haproxy configuration under:
Make sure Trilio datamover container is in running state and no other Trilio container is deployed on compute nodes.
Make sure horizon container is in running state. Please note that 'Horizon' container is replaced with Trilio Horizon container. This container will have latest OpenStack horizon + Trilio's horizon plugin.
The installation of Trilio for Openstack on Kolla Victoria with Trilio 4.1 is following this procedure:
Deplo T4O-4.1 GA Appliance
Upgrade to 4.1 HF5 packages on the appliance
Deploy Trilio components on Openstack Victoria
Update Trilio components on Openstack Victora
Configure the Trilio appliance
Please follow to spin up the base Trilio 4.1GA appliance.
Trilio supports Ansible OpenStack Victoria from 4.1HF5 onwards, so it is recommended to upgrade to the latest available hotfix on 4.1 to make deployment successful. Please follow to upgrade the appliance to the latest 4.1 Hotfix.
Run the deployment of the components following using the following values:
Variable | Value |
---|
Change parameterOPENSTACK_DIST
in the file/etc/openstack_deploy/user_tvault_vars.yml
to victoria
Please ensure that the Trilio Appliance has been updated to the latest hotfix before continuing the installation.
Trilio is by default using the nova user id and group id 997:998 Ansible Openstack is not always 'nova' user id 162 on nova-compute containers. The 'nova' user id on the Trilio nodes need to be set the same as in the nova-compute containers. Do the following steps on all Trilio nodes in case of nova id not being 162:162:
Download the shell script that will change the user-id
Assign executable permissions
Edit script to use the correct nova id
Execute the script
Verify that 'nova' user and group id has changed to the desired value
Clone triliovault-cfg-scripts from github repository on Ansible Host.
Available values for <branch>:
Copy Ansible roles and vars to required places.
In case of installing on OSA Victora edit OPENSTACK_DIST in the file /etc/openstack_/user_tvault_vars.yml to Victoria
Add Trilio playbook to /opt/openstack-ansible/playbooks/setup-openstack.yml
at the end of the file.
Add the following content at the end of the file /etc/openstack_deploy/user_variables.yml
Create the following file /opt/openstack-ansible/inventory/env.d/tvault-dmapi.yml
Edit the file /etc/openstack_deploy/openstack_user_config.yml
according to the example below to set host entries for Trilio components.
Edit the common editable parameter section in the file /etc/openstack_deploy/user_tvault_vars.yml
Append the required details like Trilio Appliance IP address, Trilio package version, Openstack distribution, snapshot storage backend, SSL related information, etc.
The possible package versions are:
GA Trilio 4.1: 4.1.94
Run the following commands to deploy only Trilio components in case of an already deployed Ansible Openstack.
If Ansible Openstack is not already deployed then run the native Openstack deployment commands to deploy Openstack and Trilio Components together. An example for the native deployment command is given below:
Verify triliovault datamover api service deployed and started well. Run the below commands on controller node(s).
Verify triliovault datamover service deployed and started well on compute node(s). Run the following command oncompute node(s).
Verify that triliovault horizon plugin, contegoclient, and workloadmgrclient are installed on the Horizon container.
Run the following command on Horizon container.
Verify that haproxy setting on controller node using below commands.
After the deployment has been verified it is recommended to update to the latest hotfix to ensure the best possible experience.
This page lists all steps required to deploy Trilio components on Kolla-ansible deployed OpenStack cloud.
Please ensure that the Trilio Appliance has been updated to the latest hotfix before continuing the installation.
Refer to the below-mentioned acceptable values for the placeholders in this document as per the Openstack environment: kolla_base_distro : ubuntu / centos triliovault_tag : 4.1.94-hotfix-13-ussuri / 4.1.94-hotfix-12-victoria
Backup target storage is used to store backup images taken by Trilio and details needed for configuration:
Following backup target types are supported by Trilio. Select one of them and get it ready before proceeding to the next step.
a) NFS
Need NFS share path
b) Amazon S3
- S3 Access Key - Secret Key - Region - Bucket name
c) Other S3 compatible storage (Like, Ceph based S3)
- S3 Access Key - Secret Key - Region - Endpoint URL (Valid for S3 other than Amazon S3) - Bucket name
Clone triliovault-cfg-scripts GitHub repository on Kolla ansible server at '/root' or any other directory of your preference. Afterward, copy the Trilio Ansible role into the Kolla-ansible roles directory
Append triliovault_passwords.yml
to /etc/kolla/passwords.yml
. Passwords are empty. Set these passwords manually in the /etc/kolla/passwords.yml
.
Edit /etc/kolla/passwords.yml
, go to the end of the file and set trilio passwords.
Edit /etc/kolla/globals.yml
file to fill Trilio backup target and build details.
You will find the Trilio related parameters at the end of globals.yml
file.
Details like Trilio build version, backup target type, backup target details, etc need to be filled out.
Following is the list of parameters that the usr needs to edit.
In the case of a different registry than docker hub, Trilio containers need to be pulled from docker.io and pushed to preferred registries.
Following are the triliovault container image URLs. Replace kolla_base_distro and triliovault_tag variables with their values.
To enable Trilio's Snapshot mount feature it is necessary to make the Trilio Backup target available to the nova-compute and nova-libvirt containers.
Edit /usr/local/share/kolla-ansible/ansible/roles/nova-cell/defaults/main.yml
and find nova_libvirt_default_volumes
variable. Append the Trilio mount bind /var/trilio:/var/trilio:shared
to the list of already existing volumes.
For a default Kolla installation, will the variable look as follows afterward:
Next, find the variable nova_compute_default_volumes
in the same file and append the mount bind /var/trilio:/var/trilio:shared
to the list.
After the change will the variable look for a default Kolla installation as follows:
In case of using Ironic compute nodes, one more entry needs to be adjusted in the same file.
Find the variable nova_compute_ironic_default_volumes
and append trilio mount /var/trilio:/var/trilio:shared
to the list.
After the changes the variable will look like the following:
Pull the Trilio container images from the Dockerhub based on the existing inventory file. In the example is the inventory file named multinode
.
All that is left, is to run the deploy command using the existing inventory file. In the example is the inventory file named 'multinode'.
This is just an example command. You need to use your cloud deploy command.
Verify on the nodes that are supposed to run the Trilio containers, that those are available and healthy.
To see all TriloVault containers running on a specific node use the docker ps command.
To check the startup logs use the docker logs <container name> command.
Verify that the Trilio Appliance is configured. The Horizon tabs are only shown when a configured Trilio appliance is available.
Verify that the Trilio horizon container is installed and in a running state.
Trilio datamover api service logs on datamover api node
Trilio datamover service logs on datamover node
Note: This step needs to be done on Trilio Appliance node. Not on OpenStack node.
Pre-requisite: You should have already launched Trilio appliance VM
In Kolla OpenStack distribution, nova
user id on nova-compute docker container is set to '42436'. The nova
user id on the Trilio nodes needs to be set the same. Do the following steps on all Trilio nodes:
Download the shell script that will change the user id
Assign executable permissions
Execute the script
Verify that nova
user and group id have changed to '42436'
After this step, you can proceed to the 'Configuring Trilio' section.
deb [trusted=yes] /
Please follow to configure the upgraded Trilio 4.1 appliance.
Redhat document for remote registry method:
Pull the Trilio containers on the Red Hat Satellite using the given
In case if the overcloud deployment is failing, do the following command to provide the list of errors. The following document also provides valuable insights:
Please follow to spin up the base Trilio 4.1GA appliance.
Trilio supports TripleO Train from 4.1HF5 onwards, so it is recommended to upgrade to the latest available hotfix on 4.1 to make deployment successful. Please follow to upgrade the appliance to the latest 4.1 Hotfix.
Please follow to configure the upgraded Trilio 4.1 appliance.
Follow to update the packages on the OpenStack environment.
Please follow to configure the upgraded Trilio 4.1 appliance.
Openstack Version | Branch |
---|
To update the environment follow .
Parameter | Defaults/choices | comments |
---|
libvirt
2.0.0 and above
QEMU
2.0.0 and above
qemu-img
2.6.0 and above
vCPU
8
RAM
24 GB
Ussuri | hotfix-13-TVO/4.1 |
Victoria | hotfix-13-TVO/4.1 |
triliovault_tag | <triliovault_tag> | Container tags. Use ussuri tagged containers for Ussuri and victoria tagged containers for Victoria |
horizon_image_full | Keep Default | By default will the Trilio Horizon container not get deployed. Uncomment this parameter to deploy Trilio Horizon container instead of Openstack Horizon container. |
triliovault_docker_username | triliodocker | default docker user of Trilio (read permission only) |
triliovault_docker_password | triliopassword | password for default docker user of Trilio |
triliovault_docker_registry | Default value: docker.io | Edit this value if a different container registry for Trilio containers is to be used. Containers need to be pulled from docker.io and pushed to chosen registry first. |
triliovault_backup_target |
|
|
triliovault_nfs_shares | <NFS-IP/FQDN>:/<NFS path> | NFS share path example: ‘192.168.122.101:/nfs/tvault’ |
triliovault_nfs_options |
| These parameter set NFS mount options. Keep default values, unless a special requirement exists. |
triliovault_s3_access_key | S3 Access Key | Valid for |
triliovault_s3_secret_key | S3 Secret Key | Valid for |
triliovault_s3_region_name |
| Valid for If s3 storage doesn't have region parameter keep default |
triliovault_s3_bucket_name | S3 Bucket name | Valid for |
triliovault_s3_endpoint_url | S3 Endpoint URL | Valid for |
triliovault_s3_ssl_enabled |
| Valid for Set true for SSL enabled S3 endpoint URL |
triliovault_s3_ssl_cert_file_name | s3-cert.pem | Valid for OR issued by a private authority.
In this case, copy the directory on ansible server. Create this directory if it does not exist already. |
triliovault_copy_ceph_s3_ssl_cert |
| Valid for Set to True when: SSL enabled with self-signed certificates or issued by a private authority. |
Branch | hotfix-13-TVO/4.1 |
Tag | 4.1.94-hotfix-12-victoria |
NFS |
|
Amazon S3 |
|
Other S3 compatible storage, e.g. Ceph-based S3 |
|
Branch | hotfix-13-TVO/4.1 |
It is possible to configure Cinder and Ceph to use different Ceph users for different Ceph pools and Cinder volume types. Or to have the nova boot volumes and cinder block volumes controlled by different users.
In the case of multiple Ceph users, it is required to adopt the keyring extension in the tvault-contego.conf inside the Ceph block.
The following example will try all files with the extension keyring that are located inside /etc/ceph to access the Ceph cluster for a Trilio related task.
The uninstallation of Trilio is depending on the Openstack Distribution it is installed in. The high-level process is the same for all Distributions.
Uninstall the Horizon Plugin or the Trilio Horizon container
Uninstall the datamover-api container
Uninstall the datamover
Delete the Trilio Appliance Cluster
After the installation and configuration of Trilio for Openstack did succeed the following steps can be done to verify that the Trilio installation is healthy.
Trilio is using 4 main services on the Trilio Appliance:
wlm-api
wlm-scheduler
wlm-workloads
wlm-cron
Those can be verified to be up and running using the systemctl status
command.
The second component to check the Trilio Appliance's health is the nginx and pacemaker cluster.
Checking the availability of the Trilio API on the chosen endpoints is recommended.
The following example curl command lists the available workload-types and verifies that the connection is available and working:
Please check the API guide for more commands and how to generate the X-Auth-Token.
The dmapi service has its own Keystone endpoints, which should be checked in addition to the actual service status.
In order to check the dmapi service go to dmapi container which is residing on controller nodes and run below command
The datamover service is running on each compute node. Logging to compute node and run below command
The dmapi service has its own Keystone endpoints, which should be checked in addition to the actual service status.
Run the following command on “nova-api” nodes and make sure “triliovault_datamover_api” container is in started state.
Run the following command on "nova-compute" nodes and make sure the container is in a started state.
Run the following command on horizon nodes and make sure the container is in a started state.
Run the following command on MAAS nodes and make sure all trilio units like trilio-data-mover
, trilio-dm-api
, trilio-horizon-plugin
, trilio-wlm
are in active state
Make sure the Trilio dmapi and horizon containers (shown below) are in a running state and no other Trilio container is deployed on controller nodes. If the containers are in restarting state or not listed by the following command then your deployment is not done correctly. Please note that the 'Horizon' container is replaced with the Trilio Horizon container. This container will have the latest OpenStack horizon + Trilio's horizon plugin.
Make sure the Trilio datamover container (shown below) is in a running state and no other Trilio container is deployed on compute nodes. If the containers are in restarting state or not listed by the following command then your deployment is not done correctly.
Please check dmapi endpoints on overcloud node.
It is possible to configure Cinder to have multiple configurations and keyrings for CEPH.
In this case, the Trilio Datamover file needs to be extended with the CEPH information.
For Trilio to be able to work in such an environment it is required to put copies of each of these configurations and keyrings into a separate directory, which is then made known to the Trilio Datamover inside a [ceph]
block in the tvault-contego.conf.
A tvault-contego.conf file with the extended [ceph] block would look like this.
Ceph is the most common OpenSource solution to provide block storage through OpenStack Cinder.
Ceph is a very flexible solution. The possibilities of Ceph require additional steps to the Trilio solution.
Trilio is not providing the JuJu Charms to deploy Trilio 4.1 in Canonical Openstack. At the time of release are the JuJu Charms not yet updated to Trilio 4.1. We will update this page once the Charms are available.
The following steps need to be run on all nodes, which have the Trilio Datamover API service running. Those nodes can be identified by checking the roles_data.yaml
for the role that contains the entry OS::TripleO::Services::TrilioDatamoverApi
.
Once the role that runs the Trilio Datamover API service has been identified will the following commands clean the nodes from the service.
Run all commands as root or user with sudo permissions.
Stop trilio_dmapi
container.
Remove trilio_dmapi
container.
Clean Trilio Datamover API service conf directory.
Clean Trilio Datamover API service log directory.
The following steps need to be run on all nodes, which have the Trilio Datamover service running. Those nodes can be identified by checking the roles_data.yaml
for the role that contains the entry OS::TripleO::Services::TrilioDatamover
.
Once the role that runs the Trilio Datamover service has been identified will the following commands clean the nodes from the service.
Run all commands as root or user with sudo permissions.
Stop trilio_datamover
container.
Remove trilio_datamover
container.
Unmount Trilio Backup Target on compute host.
Clean Trilio Datamover service conf directory.
Clean log directory of Trilio Datamover service.
The following steps need to be run on all nodes, which have the haproxy service running. Those nodes can be identified by checking the roles_data.yaml
for the role that contains the entry OS::TripleO::Services::HAproxy
.
Once the role that runs the haproxy service has been identified will the following commands clean the nodes from all Trilio resources.
Run all commands as root or user with sudo permissions.
Edit the following file inside the haproxy container and remove all Trilio entries.
/var/lib/config-data/puppet-generated/haproxy/etc/haproxy/haproxy.cfg
An example of these entries is given below.
Restart the haproxy container once all edits have been done.
Trilio registers services and users in Keystone. Those need to be unregistered and deleted.
Trilio creates a database for the dmapi service. This database needs to be cleaned.
Login into the database cluster
Run the following SQL statements to clean the database.
Remove the following entries from roles_data.yaml used in the overcloud deploy command.
OS::TripleO::Services::TrilioDatamoverApi
OS::TripleO::Services::TrilioDatamover
In the case that the overcloud deploy command used prior to the deployment of Trilio is still available, it can directly be used.
Follow these steps to clean the overcloud deploy command from all Trilio entries.
Remove trilio_env.yaml entry
Remove trilio endpoint map file Replace with original map file if existing
Run the cleaned overcloud deploy command.
List all VMs running on the KVM node
Destroy the Trilio VMs
Undefine the Trilio VMs
Delete the TrlioVault VM disk from KVM Host storage
The Trilio Ansible OpenStack playbook can be run to uninstall the Trilio services.
To cleanly remove the Trilio Datamover API container run the following Ansible playbook.
Remove the tvault-dmapi_hosts
and tvault_compute_hosts
entries from /etc/openstack_deploy/openstack_user_config.yml
Remove Trilio Datamover API settings from /etc/openstack_deploy/user_variables.yml
Go inside galera container.
Login as root user in mysql database engine.
Drop dmapi database.
Drop dmapi user
Go inside rabbitmq container.
Delete dmapi user.
Delete dmapi vhost.
Remove /etc/haproxy/conf.d/datamover_service
file.
Remove HAproxy configuration entry from /etc/haproxy/haproxy.cfg
file.
Restart the HAproxy service.
List all VMs running on the KVM node
Destroy the Trilio VMs
Undefine the Trilio VMs
Delete the TrlioVault VM disk from KVM Host storage
Please ensure the following points are met before starting the upgrade process:
No Snapshot or Restore is running
Global job scheduler is disabled
wlm-cron is disabled on the Trilio Appliance
The following sets of commands will disable the wlm-cron service and verify that it is has been completly shut-down.
All commands need to be run as user 'stack' on undercloud node
The Trilio appliance connected to this installation needs to be of version 4.1 HF10
Separate directories are created as per Redhat OpenStack release under 'triliovault-cfg-scripts/redhat-director-scripts/' directory. Use all scripts/templates from respective directory. For ex, if your RHOSP release is 13, then use scripts/templates from 'triliovault-cfg-scripts/redhat-director-scripts/rhosp13' directory only.
Available RHOSP_RELEASE___DIRECTORY values are:
rhosp13 rhosp16.1 rhosp16.2
RHOSP 16.0 is not supported anymore as RedHat has officially stopped supporting it. However, Trilio maintained it for some time and stopped the support from 4.1HF11 onwards. The latest hotfix available for RHOSP16.0 is 41.HF10. Reach out to the Support team for any help.
If your backup target is ceph S3 with SSL and SSL certificates are self signed or authorized by private CA, then user needs to provide CA chain certificate to validate the SSL requests. For that, user needs to rename his ca chain cert file to 's3-cert.pem' and copy it into the puppet directory of the right release.
Trilio has two services as explained below.
You need to add these two services to your roles_data.yaml.
If you do not have customized roles_data file, you can find your default roles_data.yaml file at /usr/share/openstack-tripleo-heat-templates/roles_data.yaml
on undercloud.
You need to find that role_data file and edit it to add the following Trilio services.
i) Trilio Datamover Api Service:
Service Entry in roles_data yaml: OS::TripleO::Services::TrilioDatamoverApi
This service needs to be co-located with database and keystone services. That said, you need to add this service on the same role as of keystone and database service.
Typically this service should be deployed on controller nodes where keystone and database runs.
If you are using RHOSP's pre-defined roles, you need to addOS::TripleO::Services::TrilioDatamoverApi
service to Controller role
.
ii) Trilio Datamover Service:
Service Entry in roles_data yaml: OS::TripleO::Services::TrilioDatamover
This service should be deployed on role where nova-compute
service is running.
If you are using RHOSP's pre-defined roles, you need to add our OS::TripleO::Services::TrilioDatamover
service to Compute role
.
If you have defined your custom roles, then you need to identify the role name where in 'nova-compute' service is running and then you need to add 'OS::TripleO::Services::TrilioDatamover' service to that role.
All commands need to be run as user 'stack'
Refer to the word <HOTFIX-TAG-VERSION> as 4.1.94-hotfix-16 in the below sections
Trilio containers are pushed to 'RedHat Container Registry'. Registry URL is 'registry.connect.redhat.com'. The Trilio container URLs are as follows:
There are three registry methods available in RedHat OpenStack Platform.
Remote Registry
Local Registry
Satellite Server
Please refer to the following overview to see which containers are available.
Follow this section when 'Remote Registry' is used.
For this method, it is not necessary to pull the containers in advance. It is only necessary to populate the trilio_env.yaml file with the Trilio container URLs from Redhat registry.
Populate the trilio_env.yaml with container URLs for:
Trilio Datamover container
Trilio Datamover api container
Trilio Horizon Plugin
trilio_env.yaml will be available in
__triliovault-cfg-scripts/redhat-director-scripts/<RHOSP_RELEASE_DIRECTORY>/environments
Please refer to the this overview to see which containers are available.
Follow this section when 'local registry' is used on the undercloud.
In this case it is necessary to push the Trilio containers to the undercloud registry. Trilio provides shell scripts which will pull the containers from 'registry.connect.redhat.com' and push them to the undercloud and updates the trilio_env.yaml.
Verify the changes
Verify the changes:
Verify the changes
The changes can be verified using the following commands.
Please refer to the following overview to see which containers are available.
Follow this section when a Satellite Server is used for the container registry.
Pull the Trilio containers on the Red Hat Satellite using the given Red Hat registry URLs.
Populate the trilio_env.yaml with container urls.
It is recommended to re-populate the backup target details in the freshly downloaded trilio_env.yaml file. This will ensure that parameters that have been added since the last update/installation of Trilio are available and will be filled out too.
Locations of the trilio_env.yaml:
For more details about the trilio_env.yaml please check here.
Use the following heat environment file and roles data file in overcloud deploy command:
trilio_env.yaml
roles_data.yaml
Use correct Trilio endpoint map file as per available Keystone endpoint configuration
Instead of tls-endpoints-public-dns.yaml
file, use environments/trilio_env_tls_endpoints_public_dns.yaml
Instead of tls-endpoints-public-ip.yaml
file, useenvironments/trilio_env_tls_endpoints_public_ip.yaml
Instead of tls-everywhere-endpoints-dns.yaml
file, useenvironments/trilio_env_tls_everywhere_dns.yaml
To include new environment files use '-e' option and for roles data file use '-r' option. An example overcloud deploy command is shown below:
If the containers are in restarting state or not listed by the following command then your deployment is not done correctly. Please recheck if you followed the complete documentation.
Make sure Trilio dmapi and horizon containers are in a running state and no other Trilio container is deployed on controller nodes. When the role for these containers is not "controller" check on respective nodes according to configured roles_data.yaml.
Make sure Trilio datamover container is in running state and no other Trilio container is deployed on compute nodes.
Make sure horizon container is in running state. Please note that 'Horizon' container is replaced with Trilio Horizon container. This container will have latest OpenStack horizon + Trilio's horizon plugin.
After the Trilio VM has been configured and all components are installed can the license be applied.
The license can be applied either through the admin-tab in Horizon or the CLI
To apply the license through Horizon follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin
Navigate to Trilio
Navigate to License
Click "Update License"
Click "Choose File"
choose license-file on client system
click "Apply"
The container needs to be cleaned on all nodes where the triliovault_datamover_api
container is running.
The Kolla Openstack inventory file helps to identify the nodes with the service.
Following steps need to be done to clean the triliovault_datamover_api
container:
Stop the triliovault_datamover_api
container.
Remove the triliovault_datamover_api
container.
Clean /etc/kolla/triliovault-datamover-api
directory.
Clean log directory of triliovault_datamover_api
container.
The container needs to be cleaned on all nodes where the triliovault_datamover
container is running.
The Kolla Openstack inventory file helps to identify the nodes with the service.
Following steps need to be done to clean the triliovault_datamover
container:
Stop the triliovault_datamover
container.
Remove the triliovault_datamover
container.
Clean /etc/kolla/triliovault-datamover
directory.
Clean log directory of triliovault_datamover
container.
The Trilio Datamover API entries need to be cleaned on all haproxy
nodes.
The Kolla Openstack inventory file helps to identify the nodes with the service.
Following steps need to be done to clean the haproxy
container:
Delete all Trilio related entries from:
To cross-verify the uninstallation undo all steps done in append Kolla Ansible yml files and clone Trilio Ansible role.
Trilio entries can be found in:
Run deploy command to replace the Trilio Horizon container with original Kolla Ansible Horizon container.
Trilio created a dmapi service with dmapi user.
Trilio Datamover API service has its own database in the Openstack database.
Login to Openstack database as root user or user with similar priviliges.
Delete dmapi database and user.
List all VMs running on the KVM node
Destroy the Trilio VMs
Undefine the Trilio VMs
Delete the TrlioVault VM disk from KVM Host storage
The workloadmgr CLI client is provided as rpm and deb packages.
It got tested against the following operating systems:
CentOS7, CentOS8
Ubuntu 18.04, Ubuntu 20.04
Installing the workloadmgr client will automatically install all required Openstack clients as well.
Further will the installation of the workloadmgr client integrate the client into the global openstack python client, if available.
The required connection strings and package names can be found on the Trilio Dashboard under the Downloads tab.
The Trilio workload manager CLI client has several requirements that need to be met before the client can be installed without dependency issues.
The following steps need to be done to prepare the installation the workloadmgr client:
Add required repositories
epel-release
for CentOS7: centos-release-openstack-stein
for CentOS8: centos-release-openstack-train
install base packages
yum -y install epel-release
for CentOS7: yum -y install centos-release-openstack-stein
for CentOS8: yum -y install centos-release-openstack-train
These repositories are required to fulfill the following dependencies:
On CentOS7 Python2: python-pbr,python-prettytable,python2-requests,python2-simplejson,python2-six,pytz,PyYAML,python2-openstackclient
On CentOS8 Python3: python3-pbr,python3-prettytable,python3-requests,python3-simplejson,python3-six,python3-pyyaml,python3-pytz,python3-openstackclient
There are 2 possibilities for how the workloadmgr client packages can be installed.
The Trilio appliance is shipping the workloadmgr client version, that is matching the Trilio version of the Trilio appliance. These clients will always work with their respective Trilio versions.
The workloadmgr client can be directly downloaded using the following command:
For CentOS7:
wget http://<TVM-IP>:8085/yum-repo/queens/workloadmgrclient-<Trilio-Version>-<Trilio-Release>.noarch.rpm
For CentOS8: http://<TVM-IP>:8085/yum-repo/queens/python3-workloadmgrclient-<Trilio-Version>-<TVault-Release>.noarch.rpm
To identify the Trilio Version and Trilio release login into the Trilio Dashboard and check the upper left corner.
The yum package manager is used to install the workloadmgr client package:
yum install workloadmgrclient-<Trilio-Version>-<Trilio-Release>.noarch.rpm
An example installation can be found below:
To install the latest available workloadmgr package for a Trilio release from the Trilio repository the following steps need to be done:
Create the Trilio yum repository file /etc/yum.repos.d/trilio.repo
Enter the following details into the repository file:
Install the workloadmgr client issuing the following command:
For CentOS7: yum install workloadmgrclient
For CentOS8: yum install python-3-workloadmgrclient-el8
An example installation can be found below:
The Trilio workloadmgr client packages for Ubuntu are only available from the online repository.
There is no preparation required. All dependencies are automatically resolved by the standard repositories provided by Ubuntu.
There are 2 possibilities for how the workloadmgr client packages can be installed.
The Trilio appliance is shipping the workloadmgr client version, that is matching the Trilio version of the Trilio appliance. These clients will always work with their respective Trilio versions.
The workloadmgr client can be directly downloaded using the following command:
For Python2:
curl -Og6 http://<TVM-IP>:8085/deb-repo/deb-repo/python-workloadmgrclient_<Trilio-Version>_all.deb
For Python3:curl -Og6 http://<TVM-IP>:8085/deb-repo/deb-repo/python3-workloadmgrclient_<Trilio-Version>_all.deb
o identify the Trilio Version and Trilio release login into the Trilio Dashboard and check the upper left corner.
The apt package manager is used to install the workloadmgr client package:
For Python2:apt-get install ./python-workloadmgrclient_<Trilio-Version>_all.deb -y
For Python3:apt-get install ./python3-workloadmgrclient_<Trilio-Version>_all.deb -y
An example installation can be found below:
To install the latest available workloadmgr package for a Trilio release from the Trilio repository the following steps need to be done:
Create the Trilio yum repository file /etc/apt/sources.list.d/fury.list
Enter the following details into the repository file:
run apt update
to make the new repository available.
The apt package manager is used to install the workloadmgr client package:
For Python2:apt-get install python-workloadmgrclient
For Python3:apt-get install python3-workloadmgrclient
An example installation can be seen below:
The offline upgrade of the Trilio Appliance is only recommended for hotfix upgrades. For major upgrades in offline environments, it is recommended to download the latest qcow2 image and redeploy the appliance.
Please ensure to complete the upgrade of all the TVault components on the Openstack controller & compute nodes before starting the rolling upgrade of TVO.
The mentioned gemfury repository should be accessible from a VM/Server.
Please ensure the following points before starting the upgrade process:
No snapshot OR restore to be running.
Global job-scheduler should be disabled.
wlm-cron should be disabled & any lingering process should be killed. (This should already have been done during Trilio components upgrade on Openstack)
pcs resource disable wlm-cron
Check: systemctl status wlm-cron OR pcs resource show wlm-cron
Additional step: To ensure that cron is actually stopped, search for any lingering processes against wlm-cron and kill them. [Cmd : ps -ef | grep -i workloadmgr-cron]
VM/Server must have internet connectivity and connectivity to Trilio gemfury repo
Download latest pip package
Export the index URL
Download s3fuse package
Download tvault-configurator dependent package
Download workloadmgr and dependent package
Download workloadmgrclient package
Download contegoclient package
Download oslo.messaging package
All downloaded packages need to be copied from VM/server to all the TVM nodes.
Copy all the downloaded packages(listed below) from the VM/server to all the TVM nodes
pip
s3fuse
tvault-configurator
workloadmgr
workloadmgrclient
contegoclient
If any of the packages are already on the latest, the upgrade won’t happen. Make sure you should be present at the right dir which means run the below commands from where there all packages should be present
Please refer to the versions of the downloaded packages for the placeholder <HF_VERSION> in the below sections.
Take a backup of the configuration files
Activate the virtual environment
Run the following command on all TVM nodes to upgrade the pip package
Run the following command on all TVM nodes to upgrade s3fuse
Run the following command on all TVM nodes to upgrade tvault-configurator
Run the upgrade command on all TVM nodes to upgrade workloadmgr
Run the upgrade command on all TVM nodes to upgrade workloadmgrclient
Run the upgrade command on all TVM nodes to upgrade contegoclient
Using the latest available oslo.messaging version can lead to stuck RPC and API calls.
It is therefore required to fix the oslo.messaging version on the TVM.
Verify if the upgrade successfully completed or not.
And match the versions with the respective latest downloaded versions.
Restore the backed-up configuration files
Restart following services on all node(s) using respective commands
tvault-object-store restart required only if Trilio is configured with S3 backend storage
Enable wlm-cron service on primary node through pcs cmd, if T4O is configured with Openstack
Enable Global Job Scheduler
Verify the status of the services, if T4O is configured with Openstack.
tvault-object-store will run only if TVault is configured with S3 backend storage
Additional check for wlm-cron on the primary node, if T4O is configured with Openstack_._
Check the mount point using the “df -h” command if T4O is configured with Openstack
Due to the new installation method of Trilio for Kolla OpenStack, it is required to reinstall the Trilio components running on the Kolla Openstack nodes when upgrading from Trilio 4.0.
The Trilio appliance can be upgraded as documented.
Trilio 4.1 can be upgraded without reinstallation to a higher version of T4O if available.
Refer to the below-mentioned acceptable values for the placeholders in this document as per the Openstack environment: kolla_base_distro : ubuntu / centos triliovault_tag : 4.1.94-hotfix-13-ussuri / 4.1.94-hotfix-12-victoria
Please ensure the following points are met before starting the upgrade process:
Either 4.1 GA OR any hotfix patch against 4.1 should be already deployed
No Snapshot OR Restore is running
Global job scheduler should be disabled
wlm-cron is disabled on the primary Trilio Appliance
Access to the gemfury repository to fetch new packages
The following sets of commands will disable the wlm-cron service and verify that it is has been completly shut-down.
Before the latest configuration script is loaded it is recommended to take a backup of the existing config scripts' folder & Trilio ansible roles. The following command can be used for this purpose:
Clone the latest configuration scripts of the required branch and access the deployment script directory for Kolla Ansible Openstack. Available branches to upgrade T4O 4.1 are:
Copy the downloaded Trilio ansible role into the Kolla-Ansible roles directory.
This step is not always required. It is recommended to comparetriliovault_globals.yml
with the Trilio entries in the/etc/kolla/globals.yml
file.
In case of no changes, this step can be skipped.
This is required, in case of some variable names changed, some new variables have been added, or old variables removed in the latest triliovault_globals.yml
they need to be updated in /etc/kolla/globals.yml
file.
This step is not always required. It is recommended to comparetriliovault_passwords.yml
with the Trilio entries in the/etc/kolla/passwords.yml
file.
In case of no changes, this step can be skipped.
This step is required, when some password variable names have been added, changed, or removed in the latest triliovault_passwords.yml. In this case, the /etc/kolla/passwords.yml needs to be updated.
This step is not always required. It is recommended to comparetriliovault_site.yml
with the Trilio entries in the/usr/local/share/kolla-ansible/ansible/site.yml
file.
In case of no changes, this step can be skipped.
This is required because, in case of some variable names changed, some new variables have been added, or old variables removed in the latest triliovault_site.yml
they need to be updated in /usr/local/share/kolla-ansible/ansible/site.yml
file.
This step is not always required. It is recommended to comparetriliovault_inventory.yml
ith the Trilio entries in the/root/multinode
file.
In case of no changes, this step can be skipped.
By default, the triliovault-datamover-api service gets installed on ‘control' hosts and the trilio-datamover service gets installed on 'compute’ hosts. You can edit the T4O groups in the inventory file as per your cloud architecture.
T4O group names are ‘triliovault-datamover-api’ and ‘triliovault-datamover’
Edit '/etc/kolla/globals.yml' file to fill triliovault backup target and build details. You will find the triliovault related parameters at the end of globals.yml file. User needs to fill in details like triliovault build version, backup target type, backup target details, etc.
Following is the list of parameters that the user needs to edit.
This step is already part of the 4.1 GA installation procedure and should only be verified.
To enable Trilio's Snapshot mount feature it is necessary to make the Trilio Backup target available to the nova-compute and nova-libvirt containers.
Edit /usr/local/share/kolla-ansible/ansible/roles/nova-cell/defaults/main.yml
and find nova_libvirt_default_volumes
variable. Append the Trilio mount bind /var/trilio:/var/trilio:shared
to the list of already existing volumes.
For a default Kolla installation, will the variable look as follows afterward:
Next, find the variable nova_compute_default_volumes
in the same file and append the mount bind /var/trilio:/var/trilio:shared
to the list.
After the change will the variable look for a default Kolla installation as follows:
In case of using Ironic compute nodes one more entry need to be adjusted in the same file.
Find the variable nova_compute_ironic_default_volumes
and append trilio mount /var/trilio:/var/trilio:shared
to the list.
After the changes the variable will looks like the following:
In case, the user doesn’t want to use the docker hub registry for triliovault containers during cloud deployment, then the user can pull triliovault images before starting cloud deployment and push them to other preferred registries.
Following are the triliovault container image URLs. Replace kolla_base_distro and triliovault_tag variables with their values
Run the below command from the directory with the multinode file tull pull the required images.
Run the below command from the directory with the multinode file to start the upgrade process.
Verify on the nodes that are supposed to run the Trilio containers, that those are available and healthy.
Following are the default haproxy conf parameters set against triliovault datamover api service.
These values work best for triliovault dmapi service. It’s not recommended to change these parameter values. However, in some exceptional cases, If needed to change any of the above parameter values then same can be done on kolla-ansible server in the following file.
After editing, run kolla-ansible deploy command again to push these changes to openstack cloud.
Post kolla-ansible deploy, to verify the changes, please check following file, available on all controller/haproxy nodes.
Added external database-support Added the Openstack distribution for storage (mount path)
Trilio configuration process is using Ansible scripts. Ansible, in the last few years, has grown in popularity as a preferred configuration management tool and Trilio uses ansible playbooks extensively to configure the Trilio cluster. To troubleshoot Trilio configuration issues, the user should have a basic understanding of Ansible playbook output.
Ansible modules are inherently idempotent and hence Trilio configuration can run any number of times to change or reconfigure Trilio cluster.
Once the VM is booted, point your browser (Chrome or Firefox) to Trilio node IP address.
This will bring you to the Trilio Dashboard, which contains the Trilio configurator.
The user is: admin The default password is: password
After the very first login, you are requested to change the admin password.
Unlike previous versions of Trilio, the current version only requires you to configure the cluster once and the Trilio dashboard provides cluster-wide management capability.
OpenStack endpoints can be configured to use TLS. In such a configuration the Trilio appliance needs to trust the certificates provided by the OpenStack endpoints.
To achieve this trust it is required to upload the OpenStack certificate bundle through the OS API certificate tab of the Trilio appliance Dashboard.
The certificate bundle is located on the controller nodes of the OpenStack installation.
The default paths for each distribution are as follows:
The uploaded certificates can be verified on the Trilio appliance at the following location.
Upon login into an unconfigured Trilio Appliance, the shown page is the configurator. The configurator requires some information about the Trilio Appliance, Openstack, and Backup Storage.
The Trilio Cluster needs to be integrated into an existing environment to be able to operate correctly. This block asks for information about the Trilio Cluster operating details.
Controller Nodes
This is the list of Trilio virtual appliance IP addresses along with their hostnames.
Format: comma-separated list with pairs combined through '='
Example: 172.20.4.151=tvault-104-1,172.20.4.152=tvault-104-2,172.20.4.153=tvault-104-3’
The Trilio Cluster supports only 1 node and 3 node clusters.
Virtual IP Address
This is the Trilio cluster IP address which is mandatory
Format: IP/Subnet
Example: 172.20.4.150/24
The Virtual IP is mandatory even for single-node clusters and has to be different from any IP given at the Controller Nodes.
Name Server
List of nameservers, primarily used to resolve OpenStack service endpoints.
Format: comma-separated list
example: 10.10.10.1,172.20.4.1
If defining OpenStack endpoint hostnames in the /etc/hosts file on the VM is preferred over a DNS solution you may set the nameserver to 0.0.0.0, the default gateway.
Domain Search Order
The domain the Trilio Cluster will use.
Format: comma-separated list
example: trilio.io,trilio.demo
NTP Servers
NTP servers the Trilio Cluster will use
format: comma-separated list
example: 0.pool.ntp.org,10.10.10.10
Timezone
Timezone the Trilio Cluster will use internally
format: pre-populated list
example: UTC
The Trilio appliance integrates with one RHV environment. This block asks for the information required to access and connect with the RHV Cluster.
Keystone URL
The Keystone endpoint used to fetch authentication for configuration
format: URL
example: https://keystone.trilio.io:5000/v3
Endpoint Type
Defines which endpoint type will be used to communicate with the Openstack endpoints
format: predefined list of radio buttons
example: Public
When FQDNs are used for the Keystone endpoints it is necessary to configure at least one DNS server before the configuration.
Otherwise, the validation of the Openstack Credentials will fail.
Domain ID
domain the provided user and tenant are located in
format: ID
example: default
Administrator
Username of an account with the domain admin role
format: String
example: admin
Password
password for the prior provided user
format: String
example: password
Trilio requires domain admin role access. To provide domain admin role to a user, the following command can be used:
openstack role add --domain <domain id> --user <username> admin
The Trilio configurator verifies after every entry if it is possible to login into Openstack using the provided credentials.
This verification will fail until all entries are set and correct.
When the verification is successful it is possible to choose the Admin tenant, the Region, and the Trustee role without any error message shown.
Admin Tenant
The tenant to be used together with the provided user
format: a pre-populated list
example: admin
Region
Openstack Region the user and tenant are located in
format: a pre-populated list
example: RegionOne
Trustee Role
The Openstack role required to be able to use Trilio functionalities
format: a pre-populated list
example: _member_
This block is requesting the necessary information about the backup target that the Trilio installation will be used to store and read backups.
Openstack Dist
RHOSP and Kolla Ansible require a special mount point to be used
format: predefined list
example: RHOSP
Backup Storage
Defines the Backup Storage protocol to use
format: predefined list of radio buttons
example: NFS
NFS Export
The path under which the NFS Volumes to be used can be found
format: comma-separated list of NFS Volumes paths
example: 10.10.2.20:/upstream,10.10.5.100:/nfs2
NFS Options
NFS options used by the Trilio Cluster when mounting the NFS Exports
format: NFS options
example: nolock,soft,timeo=180,intr,lookupcache=none
Please use the predefined NFS Options and only change them when it is know that changes are necessary.
Trilio is testing against the predefined NFS options.
S3 Compatible
Switch between Amazon and other S3 compatible storage solutions
format: predefined list
example: Amazon S3
(S3 compatible) Endpoint URL
URL to be used to reach and access the provided S3 compatible storage
format: URL
example: objects.trilio.io
Access Key
Access Key necessary to login into the S3 storage
format: access key
example: SFHSAFHPFFSVVBSVBSZRF
Secret Key
Secret Key necessary to login into the S3 storage
format: secret key
example: bfAEURFGHsnvd3435BdfeF
Region
Configured Region for the S3 Bucket (keep the default for S3 compatible without Region)
format: String
example: us-east-1
Signature Version
S3 signature version to use for signing into the S3 storage
format: string
example: default
Bucket Name
Name of the bucket to be used as Backup target
format: string
example: Trilio-backup
When using secured connection with a non-aws S3 storage like CEPH you have to provide the certificate used for the connection.
To enter this certificate type the https:// based endpoint into the field Endpoint URL.
Once you tab out of the field will the upload certificate button be shown. See picture below.
Check this box in case of reinitialization or reinstallation of the Trilio Appliance to import all matching Workloads located on the Backup Target.
Workloads that are not assigned to an existing tenant will fail to import and need to be reassigned manually once the configuration is done.
At the end of the configurator is the option to activate the advanced settings.
Activating this option does provide the possibility to configure the Keystone endpoints used for the Datamover API and Trilio.
Trilio generates Keystone endpoints for 2 services. The Trilio Datamover API and the Trilio Workloadmanager.
Modern Openstack installation have the endpoint types split over multiple networks. The advanced settings for the Datamover API endpoints and Trilio Workloadmanager endpoints allow configuring Trilio accordingly.
Used IP addresses are added as additional VIPs to the Trilio cluster.
In the case of FQDN used for those endpoints will the Trilio configurator resolve the FQDN to learn of the IPs that are then set as VIPs.
It is recommended to verify the datamover api settings against the ones configured during installation of the Trilio components.
If these endpoints do already exist in Keystone are the values prefilled and can not be changed. In case of a change required, delete the old Keystone endpoints first.
Providing an URL with https activates the TLS enabled configuration, which requires the upload of certificates and the connected private key.
Trilio allows the use of an external MySQL or MariaDB database.
This database needs to be prepared by creating the empty workloadmgr database, creating the workloadmgr user and setting the right permissions. An example command to create this database would be:
Provide the connection string to the Trilio configurator.
This value can only be set upon an initial configuration of the Trilio solution.
When the Cluster has been configured to use the internal database, then the connection string will not be shown in the next configuration attempt.
In case of an external database, will the connection string be shown, but is uneditable.
Trilio is using a service user that is located in the Openstack service project.
The password for this service user will be generated randomly or can be defined in the advanced settings.
Once all entries have been set and all validations are error-free the configurator can be started.
Click Finish
Reconfirm in the pop-up that you want to start the configuration
Wait for the configurator to finish
Some elements of the configurator take time. Even when it looks like the configurator is stuck, please wait till the configurator finishes. Should the configurator have not finished after 6h, please contact Trilio Support for help.
The configurator is using Ansible and a few Trilio internal API calls. After each configuration block or after the configurator finished it is possible to visit the Ansible output.
At the end of a successful configuration does the configurator forward to the set VIP.
Starting Trilio for Openstack 4.0 does Trilio for Openstack allow in-place upgrades.
The following versions can be upgraded to each other:
Old | New |
---|---|
The upgrade process contains upgrading the Trilio appliance and the Openstack components and is dependent on the underlying operating system.
The Upgrade of Trilio for Canonical Openstack is managed through the charms.
Due to the new installation method of Trilio for Kolla OpenStack, it is required to reinstall the Trilio components running on the Kolla OpenStack nodes when upgrading from Trilio 4.0.
The Trilio appliance can be upgraded as documented .
Trilio 4.1 can be upgraded without reinstallation to a higher version of T4O if available.
Please ensure the following points are met before starting the upgrade process:
No Snapshot or Restore is running
Global job scheduler is disabled
wlm-cron is disabled on the Trilio Appliance
Access to the gemfury repository to fetch new packages
The following sets of commands will disable the wlm-cron service and verify that it is has been completly shut down.
Add the Gemfury repository on each dmapi, horizon containers & compute nodes.
Create file /etc/apt/sources.list.d/fury.list
and add the below line to it.
The following commands can be used to verify the connection to the gemfury repository and to check for available packages.
Add trilio repo on each dmapi, horizon containers & compute nodes.
Modify the file /etc/yum.repos.d/trilio.repo
and add below line in it.
The following commands can be used to verify the connection to the Trilio rpm server and to check for available packages.
The following steps represent the best practice procedure to upgrade the dmapi service.
Login to dmapi container
Take a backup of the dmapi configuration in /etc/dmapi/
use apt list --upgradeable
to identify the package used for the dmapi service
Update the dmapi package
restore the backed-up config files into /etc/dmapi/
Restart the dmapi container
Check the status of the dmapi service
These steps are done with the following commands. This example is assuming that the more common python3 packages are used.
The following steps represent the best practice procedure to update the Horizon plugin.
Login to Horizon Container
use apt list --upgradeable
to identify the package the Trilio packages for the workloadmgrclient, contegoclient and Horizon plugin
Install the tvault-horizon-plugin package in the required python version
install the workloadmgrclient package
install the contegoclient
Restart the Horizon webserver
check the installed version of the workloadmgrclient
These steps are done with the following commands. This example is assuming that the more common python3 packages are used.
The following steps represent the best practice procedure to update the tvault-contego service on the compute nodes.
Login into the compute node
Take a backup of the config files in
(NFS and S3) /etc/tvault-contego/
(S3 only) /etc/tvault-object-store
use apt list --upgradeable
to identify the tvault-contego package used
Unmount backup storage
upgrade the tvault-contego package in the required python version
(S3 only) upgrade the s3-fuse-plugin package
restore the config files into /etc/tvault-contego/
(S3 only) Restart the tvault-object-store service
Restart the tvault-contego service
check the status
These steps are done with the following commands. This example is assuming that the more common python3 packages are used.
Following are the haproxy cfg parameters recommended for optimal performance of dmapi service. File location on controller /etc/haproxy/haproxy.cfg
If values were already updated during any of the previous releases, further steps can be skipped.
Remove below content, if present in the file/etc/openstack_deploy/user_variables.yml
on ansible host.
Add the below lines at end of the file /etc/openstack_deploy/user_variables.yml
on the ansible host.
Update Haproxy configuration using the below command on ansible host.
This describes the upgrade process from Trilio 4.0 or Trilio 4.0SP1 to Trilio 4.1 GA or its hotfix releases.
Kolla Ansible Openstack only: The mount point for the Trilio Backup Target has changed in Trilio 4.1. A reconfiguration after the upgrade is required.
The prerequisites should already be fulfilled from upgrading the Trilio components on the Controller and Compute nodes.
Please ensure to complete the upgrade of all the Trilio components on the Openstack controller & compute nodes before starting the rolling upgrade of TVM.
The mentioned Gemfury repository should be accessible from TVault VM.
Please ensure the following points before starting the upgrade process:
No snapshot OR restore to be running.
Global job-scheduler should be disabled.
wlm-cron should be disabled and any lingering process should be killed.
The following sets of commands will disable the wlm-cron service and verify that it has been completely shut down.
Verify if the service is shut down with the below set of commands and expected output:
Take a backup of the conf files on all TVM nodes.
Check if Python 3.8 virtual environment exists on the T4O nodes
If the virtual environment does not exist, perform the below steps on the T4O nodes
Activate the Python3.6 virtual environment on all T4O nodes for wlm services upgrade
Ansible doesn't support the upgrade from previous versions to the latest one (2.10.4) and needs to be uninstalled for that reason
Run the following command on all TVM nodes to upgrade the pip package
Run the following commands on all TVM nodes to upgrade s3fuse and its dependent packages.
Run the following commands on all TVM nodes to upgrade s3fuse packages only.
Post upgrade, the password for T4O configurator will be reset to the default one i.e. 'password' for user 'admin'. Reset T4O configurator password after the upgrade.
Make sure the correct virtual environment(myansible_3.8) has been activated
Run the following command on all TVM nodes to upgrade tvault-configurator and its dependent packages.
Run the following command on all TVM nodes to upgrade tvault-configurator packages only.
During the update of the tvault-configurator the following error might be shown:
This error can be ignored.
Run the upgrade command on all TVM nodes to upgrade workloadmgr and its dependent packages.
Run the upgrade command on all TVM nodes to upgrade workloadmgr packages only.
Run the upgrade command on all TVM nodes to upgrade workloadmgr and its dependent packages.
Run the upgrade command on all TVM nodes to upgrade workloadmgr packages only.
Run the upgrade command on all TVM nodes to upgrade contegoclient and its dependent packages.
Run the upgrade command on all TVM nodes to upgrade contegoclient packages only.
Using the latest available oslo.messaging version can lead to stuck RPC and API calls.
It is therefore required to fix the oslo.messaging version on the TVM.
Delete the wlm-scheduler pcs resource because in 4.1 it is not a part of pcs
Restart the following services on all node(s) using respective commands\
tvault-object-store restart required only if Trilio is configured with S3 backend storage
Enable Global Job Scheduler ****Restart pcs resources only on the primary node
tvault-object-store will run only if TVault configured with S3 backend storage
Additional check for wlm-cron on the primary node
The above command should show only 2 processes running: sample below:
Check the mount point using “df -h” command
Trilio for Openstack 4.1 HF1 is introducing several new config parameters, which will be automatically set upon reconfiguration.
Trilio for Openstack 4.1 is changing the Trilio mount point as follows:
RHOSP 13 & 16.0 & 16.1: /var/lib/nova/triliovault-mounts
Kolla Ansible Ussuri: /var/trilio/triliovault-mounts
Reconfiguring the Trilio Appliance will automatically handle this change.
Trilio for Openstack 4.1 is changing the Trilio mount point as follows:
RHOSP 13 & 16.0 & 16.1: /var/lib/nova/triliovault-mounts
Kolla Ansible Ussuri: /var/trilio/triliovault-mounts
After reconfiguration of the Trilio Appliance, it is necessary to create a mount bind between the old and new mount points to provide full access to the old Trilio backups.
For RHOSP:
For Kolla:
To have this change persistent it is recommended to change the fstab accordingly:
For RHOSP:
For Kolla:
Red Hat OpenStack and Kolla Ansible Openstack are using the nova UID/GID of 42436 inside their containers instead of 162:162 which is the standard in other Openstack environments.
Please verify that the nova UID/GID on the Trilio Appliance is still 42436,
In case of the UID/GID is changed back to 162:162 follow these steps to set it back to 42436:42436.
Download the shell script that will change the user id
Assign executable permissions
Execute the script
Verify that nova
user and group ids have changed to '42436'
The first step is to remove the datamover container and to unmount the old mounts. This is necessary to make sure, that the new datamover container with the new backend target is not getting any interference from the old backup target.
Edit the globals.yml file to contain the new backup target.
For the major upgrade from 4.0 to 4.1 use the JuJu charms upgrade path.
The charms will always install the latest version available of T4O 4.1. This will only work when upgrading from 4.0 to 4.1.
The following charms exist:
Installs and manages Trilio Controller services.
Installs and manages the Trilio Datamover API service.
Installs and manages the Trilio Datamover service.
Installs and manages the Trilio Horizon Plugin.
The documentation of the charms can be found here:
The following steps have been tested and verified within Trilio environments. There have been cases where these steps updated all packages inside the LXC containers, leading to failures in basic OpenStack services.
It is recommended to run each of these steps in dry-run first.
When any other packages but Trilio packages are getting updated, stop the upgrade procedure and contact your Trilio customer success manager.
Trilio is releasing hotfixes, which require updating the packages inside the containers. These hotfixes can not be installed using the Juju charms as they don't require an update to the charms.
Either 4.1 GA OR any hotfix patch against 4.1 should be already deployed for performing upgrades mentioned in the current document.
No snapshot OR restore to be running.
Global job scheduler should be disabled.
wlm-cron should be disabled ( Following commands are to be run on MAAS node)
If trilio-wlm is HA enabled, set the cluster configuration to maintenance mode ( this command will fail for single node deployment)
juju exec [-m <model>] --unit trilio-wlm/leader "sudo crm configure property maintenance-mode=true"
Stop wlm-cron service
juju exec [-m <model>] --application trilio-wlm "sudo systemctl stop wlm-cron"
Ensure that no stale wlm-cron processes are there
juju exec [-m <model>] --application trilio-wlm "sudo ps -ef | grep [w]orkloadmgr-cron"
If any stale process is found, that needs to be killed manually.
The mentioned gemfury repository should be accessible from trilio units.
The deployed Trilio version is controlled by the triliovault-pkg-source
charm configuration option.
For each trilio charm it should be pointing to below gemfury repository.
This can be checked via juju [-m ] config triliovault-pkg-source
command output.
This is the preferred, recommended and tested method to update the packages is through the Juju command line.
Run below commands form MASS node
Check the trilio units status in juju status [-m ] | grep trilio
output. All the trilio units will be with new package.
Run the below command to update the schema
Check the schema head with below command. It should point to latest schema head.
Run below command to restart the apache2 service on horizon container
If the trilio-wlm nodes are HA enabled:
Make sure the wlm-cron services are down after the pkg upgrade.
Run the following command for the same:juju exec [-m <model>] --application trilio-wlm "sudo systemctl stop wlm-cron"
Unset the cluster maintenance modejuju exec [-m <model>] --unit trilio-wlm/leader "sudo crm configure property maintenance-mode=false"
Make sure the wlm-cron service up and running on any one node.juju exec [-m <model>] --application trilio-wlm "sudo systemctl status wlm-cron"
Set the Global Job Scheduler to the original state.
If any trilio unit get into error state with message
hook failed: "update-status"
Follow below steps
<license_file>
path to the license file
/usr/local/share/kolla-ansible/ansible/roles/
There is a role triliovault
/etc/kolla/globals.yml
Trilio entries had been appended at the end of the file
/etc/kolla/passwords.yml
Trilio entries had been appended at the end of the file
/usr/local/share/kolla-ansible/ansible/site.yml
Trilio entries had been appended at the end of the file
/root/multinode
Trilio entries had been appended at the end of this example inventory file
Download the latest available version of the below-mentioned packages. To know more about the latest releases, check out the latest release note under section.
Parameter | Defaults/choices | comments |
---|
Follow the to learn about the globals.yml Trilio variables.
Follow with the new backup target.
4.0 GA (4.0.92)
4.0 SP1 (4.0.115)
4.0 GA (4.0.92)
4.1 GA (4.1.94)
4.1 GA (4.1.94)
4.1 HF1 (4.1.94-hotfix1)
4.1 GA (4.1.94)
4.1 HF2 (4.1.94-hotfix2)
4.1 HF1 (4.1.94-hotfix1)
4.1 HF2 (4.1.94-hotfix2)
Trilio is using a base64 hash for the mount point of NFS Backup targets. This hash makes sure, that multiple NFS Shares can be used with the same Trilio installation.
This base64 hash is part of the Trilio incremental backups as an absolute path of the backing files. This requires the usage of mount bind during a DR scenario or quick migration scenario.
In the case that there is time for a thorough migration there is another option to change the backing file and make the Trilio backups available on a different NFS Share. This option is updating the backing file to the new NFS Share mount point.
Trilio is providing a shell script for the purpose of changing the backing file. This script is used after the Trilio appliance has been reconfigured to use the new NFS share.
The Shell script is publicly available at:
The following requirements need to be met before the change of the backing file can be attempted.
The Trilio Appliance has been reconfigured with the new NFS Share
Please check here for reconfiguring the Trilio Appliance
The workloads are available on the new NFS Share
The workloads are owned by nova:nova user
The shell script is changing one workload at a time.
The shell script has to run as nova user, otherwise the owner will get changed and the backup can not be used by Trilio.
Run the following command:
with
/var/triliovault-mounts/<base64>/
being the new NFS mount path
workload_<workload_id>
being the workload to rebase
The shell script is generating the following log file at the following location:
The log file will not get overwritten when the script is run multiple times. Each run of the script will append the available log file.
By default is the Trilio GUI available on all NICs on port 443.
To limit this to only one IP the following steps need to be applied.
The Trilio Appliance provides by default the possibility of 4 VIPs.
A general VIP which can be used for everything
A public VIP for the public endpoint
An internal VIP for the internal endpoint
An admin VIP for the admin endpoint
Should an additional VIP be required to restrict the access of the Trilio Dashboard to this VIP the new VIP needs to be created as a new resource inside the PCS cluster.
When the new dashboard_ip has been created or decided, then the next step is to set up the proxy forwarding inside Nginx, which will make the Trilio GUI available through port 8000.
All of the following steps need to be done all Trilio appliances of the cluster.
Create new conf file at /etc/nginx/conf.d/tvault-dashboard.conf
. Replace variables dashboard_ip
and virtual_ip
as configured or decided.
edit /etc/nginx/nginx.conf
and uncomment line
#include /etc/nginx/conf.d/*.conf;
check nginx syntax: nginx -t
reload nginx conf: nginx -s reload
Verify if the new cluster resource is visible or not using pcs resource
command and
by accessing the dashboard_ip.
The configured dashboard_ip will always end on the nginx service on port 8000 and will then be forwarded to the local dashboard service on port 443.
This configuration limits the required access to the local dashboard service to the Trilio appliance cluster itself. All other connections on port 443 can be dropped.
The following commands will set the required iptable rules.
At this point is the Trilio GUI only reachable on the dashboard_ip on port 8000. Accessing the Trilio GUI through any other IP or on port 443 is not allowed.
To configure the banner shown upon accessing the Trilio Appliance GUI do the following.
Login into Trilio Appliance console
edit the banner.yaml at /etc/tvault-config/banner.yaml
restart tvault-config service
The content of the banner.yaml looks as follows and can be edited as required:
In case of the Trilio Dashboard being lost it can be resetted as long as SSH access to the appliance is available.
To reset the password to its default do the following:
The dashboard login will be reset to:
The Trilio appliance can be reconfigured at any time to adjust the Trilio cluster to any changes in the Openstack environment or the general backup solution.
To reconfigure the Trilio Cluster go to the "Configure". The configure page shows the current configuration of the TriloVault cluster.
The configuration page also gives access to the ansible playbooks of the last successful configuration.
To start the reconfiguration of the Trilio Cluster click "Reconfigure" at the end of the table.
Follow the Configuring Trilio guide afterwards.
Once the Trilio configurator has started, it needs to run through successfully to continue to use Trilio.
The cluster will not roll back to its last working state in case of any errors.
When the reconfiguration is required to switch to an external database it is necessary to reinitialize the Trilio appliance and configure it from scratch.
It is possible to download the Trilio logs directly through the Trilio web gui.
To download logs throught the Trilio web gui:
Login into the Trilio web gui
Go to "Logs"
Choose the log to be downloaded
Each log for every Trilio Appliance can be downloaded seperatly
or a zip of all logfiles can be created and downloaded
This will download the current log files. Already rotated logs need to be downloaded through SSH from the Trilio appliance directly. All logs, including rotated old logs, can be found at:
/var/logs/workloadmgr/
The Trilio Appliance can be reinitialized, which will delete all workload related values from the Trilio database.
To reinitialize the Trilio Appliance do:
Login into the Trilio Dashboard
Click on "admin" in the upper right corner to open the submenu
Choose "Reinitialize"
Verify that you want to reinitialize the Trilio
In complex environments it is sometimes necessary to restart a single service or the complete solution. Rarely is restarting the complete node, where a service is running possible or even the ideal solution.
This page describes the services running by Trilio and how to restart those.
The Trilio Appliance is the controller of Trilio. Most services on the Appliance are running in a High Availability mode on a 3-node cluster.
The wlm-api service takes the API calls against the Trilio Appliance. It is running in active-active mode on all nodes of the Trilio cluster.
To restart the wlm-api service run on each Trilio node:
The wlm-scheduler service is taking job requests and identifies which Trilio node should take the request. It is running in active-active mode on all nodes of the Trilio cluster.
To restart the wlm-scheduler service run on each Trilio node:
The wlm-workloads service is the task worker of Trilio executing all jobs given to the Trilio node. It is running in active-active mode on all nodes of the Trilio cluster.
To restart the wlm-workloads service run on each Trilio node:
The wlm-cron service is responsible for starting scheduled Backups according to the configurtation of Tenant Workloads. It is running in active-passive mode and controlled by the pacemaker cluster.
To restart the wlm-workloads service run on the Trilio node with VIP assigned:
The Trilio appliance is running 1 to 4 virtual IPs on the Trilio cluster. These are controlled by the pacemaker cluster and provided through NGINX.
To restart these resources the pacemaker NGINX resource is getting restarted:
The Trilio cluster is using RabbitMQ as messaging service. It is running in active-active mode on all nodes of the Trilio cluster.
RabbitMQ is a complex system in itself. This guide will only provide the basic commands to do a restart of a node and check the health of the cluster afterward. For complete documentation of how to restart RabbitMQ, please follow the official RabbitMQ documentation.
To restart a RabbitMQ node run on each Trilio node:
It is recommended to wait for the node to rejoin and sync with the cluster before restarting another RabbitMQ node.
When the complete cluster is getting stopped and restarted it is important to keep the order of nodes in mind. The last node to be stopped needs to be the first node to be started.
The Galera Cluster is managing the Trilio MariaDB database. It is running in active-active mode on all nodes of the Trilio cluster.
Galera Cluster is a complex system in itself. This guide will only provide the basic commands to do a restart of a node and check the health of the cluster afterward. For complete documentation of how to restart Galera clusters, please follow the official Galera documentation.
When restarting Galera two different use-cases need to be considered:
Restarting a single node
Restarting the whole cluster
A single node can be restarted without any issues. It will automatically rejoin the cluster and sync against the remaining nodes.
The following commands will gracefully stop and restart the mysqld service.
After a restart will the cluster start the syncing process. Don't restart node after node to reach a complete cluster restart.
Check the cluster health after the restart.
Restarting a complete cluster requires some additional steps as the Galera cluster is basically destroyed once all nodes have been shut down. It needs to be rebuild afterwards.
First gracefully shutdown the Galera cluster on all nodes:
The second step is to identify the Galera node with the latest dataset. This can be achieved by reading the grastate.dat
file on the Trilio nodes.
When this documentation is followed the last mysqld service that got shut down will be the one with the latest dataset.
The value to check for are the seqno
.
The node with the highest seqno is the node that contains the latest data. This node will also contain safe_to_bootstrap: 1
to indicate that the Galera cluster can be rebuild from this node.
On the identified node the new cluster is getting generated with the following command:
Running galera_new_cluster on the wrong node will lead to data loss as this command will set the node the command is issued on as the first node of the cluster. All nodes which join afterward will sync against the data of this first node.
After the command has been issued is the mysqld service running on this node. Now the other nodes can be restarted one by one. The started nodes will automatically rejoin the cluster and sync against the master node. Once a synced status has been reached is each node a primary node in the cluster.
Check the Cluster health after all services are up again.
Verify the cluster health by running the following commands inside each Trilio MariaDB. The values returned from these statements have to be the same for each node.
Canonical Openstack is not using the Trilio Appliance. In Canonical environments is the Trilio controller unit part of the JuJu deployment as workloadmgr container.
To restart the services inside this container the following commands are to be issued.
On all nodes:
On a single node:
The Trilio dmapi service is running on the Openstack controller nodes. Depending on the Openstack Distribution Trilio is installed on different commands are issued to restart the dmapi service.
RHOSP13 is running the Trilio services as docker containers. The dmapi service can be restarted by issuing the following command on the host running the dmapi service.
RHOSP16 is running the Trilio services as docker containers. The dmapi service can be restarted by issuing the following command on the host running the dmapi service.
Canonical is running the Trilio services in JuJu controlled LXD containers. The dmapi service can be restarted by issuing the following command from the MASS node.
Kolla-Ansible Openstack is running the Trilio services as docker containers. The dmapi service can be restarted by issuing the following command on the host running the dmapi service.
Ansible Openstack is running the Trilio services as LXD containers. The dmapi service can be restarted by issuing the following command on the host running the dmapi service.
The Trilio datamover service is running on the Openstack compute nodes. Depending on the Openstack Distribution Trilio is installed on different commands are issued to restart the datamover service.
RHOSP13 is running the Trilio services as docker containers. The datamover service can be restarted by issuing the following command on the compute node.
RHOSP16 is running the Trilio services as docker containers. The datamover service can be restarted by issuing the following command on the compute node.
Canonical is running the Trilio services in JuJu controlled LXD containers. The datamover service can be restarted by issuing the following command from the MASS node.
Kolla-Ansible Openstack is running the Trilio services as docker containers. The dmapi service can be restarted by issuing the following command on the host running the dmapi service.
Ansible Openstack is running the Trilio datamover service directly on the compute node. The datamover service can be restarted by issuing the following command on.
triliovault_tag | <triliovault_tag> | Trilio Build Version |
horizon_image___full | commented out | Uncomment to install Trilio Horizon Container instead of previous installed container |
triliovault_docker___username | triliodocker |
triliovault_docker___password | triliopassword |
triliovault_docker___registry | Default: docker.io |
triliovault_backup___target | nfs amazon_s3 ceph_s3 | 'nfs': If the backup target is NFS 'amazon_s3': If the backup target is Amazon S3 'ceph_s3': If the backup target type is S3 but not amazon S3. |
dmapi_workers | Default: 16 | If dmapi_workers field is not present in config file. The Default value will be equals to number of cores present on the node |
triliovault_nfs___shares | Only with nfs for triliovault_backup_target User needs to provide NFS share path, e.g.: 192.168.122.101:/opt/tvault |
triliovault_nfs___options | Default: nolock, soft, timeo=180, intr, lookupcache=none | Only with nfs for triliovault_backup_target Keep default values if unclear |
triliovault_s3___access_key | Only with amazon_s3 or cephs3 for triliovault_backuptarget Provide S3 access key |
triliovault_s3___secret_key | Only with amazon_s3 or cephs3 for triliovault_backuptarget Provide S3 secret key |
triliovault_s3___region_name | Default: us-east-1 | Only with amazon_s3 or cephs3 for triliovault_backuptarget Provide S3 region or keep default if no region required |
triliovault_s3___bucket_name | Only with amazon_s3 or cephs3 for triliovault_backuptarget Provide S3 bucket |
triliovault_s3___endpoint_url | Only with cephs3 for triliovault_backuptarget Provide S3 endpoint URL |
triliovault_s3___ssl_enabled | True False | Only with ceph_s3 for triliovault_backup_target Set to true if endpoint is on HTTPS |
triliovault_s3__ssl_cert__file_name | s3-cert-pem | Only with ceph_s3 for triliovault_backup_target and if SSL is enabled on S3 endpoint URL and SSL certificates are self-signed OR issued by a private authority user needs to copy the 'ceph s3 ca chain file' to "/etc/kolla/config/triliovault/" directory on ansible server. Create this directory if it does not exist already. |
triliovault_copy__ceph_s3__ssl_cert | True False | Set to true if: ceph_s3 for triliovault_backup_target and if SSL is enabled on S3 endpoint URL and SSL certificates are self-signed OR issued by a private authority |
A workload is a backup job that protects one or more Virtual Machines according to a configured policy. There can be as many workloads as needed. But each VM can only be part of one Workload.
To view all available workloads of a project inside Horizon do:
Login to Horizon
Navigate to Backups
Navigate to Workloads
The overview in Horizon lists all workloads with the following additional information:
Creation time
Workload Name
Workload description
Total amount of Snapshots inside this workload
Total amount of succeeded Snapshots
Total amount of failed Snapshots
Workload Type
Status of the Workload
To create a workload inside Horizon do the following steps:
Login to Horizon
Navigate to the Backups
Navigate to Workloads
Click "Create Workload"
Provide Workload Name and Workload Description on the first tab "Details"
Choose between Serial or Parallel workload on the first tab "Details"
Choose the Policy if available to use on the first tab "Details"
Choose the VMs to protect on the second Tab "Workload Members"
Decide for the schedule of the workload on the Tab "Schedule"
Provide the Retention policy on the Tab "Policy"
Choose the Full Backup Interval on the Tab "Policy"
If required check "Pause VM" on the Tab "Options"
Click create
The created Workload will be available after a few seconds and starts to take backups according to the provided schedule and policy.
A workload contains many information, which can be seen in the workload overview.
To enter the workload overview inside Horizon do the following steps:
Login to Horizon
Navigate to the Backups
Navigate to Workloads
Identify the workload to show the details on
Click the workload name to enter the Workload overview
The Workload Details tab provides you with the general most important information about the workload:
Name
Description
Availability Zone
List of protected VMs including the information of qemu guest agent availability
The status of the qemu-guest-agent just shows, whether the necessary Openstack configuration has been done for this VM to provide qemu guest agent integration. It does not check, whether the qemu guest agent is installed and configured on the VM.
It is possible to navigate to the protected VM directly from the list of protected VMs.
The Workload Snapshots Tab shows the list of all available Snapshots in the chosen Workload.
From here it is possible to work with the Snapshots, create Snapshots on demand and start Restores.
The Workload Policy Tab gives an overview of the current configured scheduler and retention policy. The following elements are shown:
Scheduler Enabled / Disabled
Start Date / Time
End Date / Time
RPO
Time till next Snapshot run
Retention Policy and Value
Full Backup Interval policy and value
The Workload Filesearch Tab provides access to the powerful search engine, which allows to find files and folders on Snapshots without the need of a restore.
Please refer to the File Search User Guide to learn more about this feature.
The Workload Miscellaneous Tab shows the remaining metadata of the Workload. The following information are provided:
Creation time
last update time
Workload ID
Workload Type
Workloads can be modified in all components to match changing needs.
Editing a Workload will set the User, who edits the Workload, as the new owner.
To edit a workload in Horizon do the following steps:
Login to the Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload to be modified
Click the small arrow next to "Create Snapshot" to open the sub-menu
Click "Edit Workload"
Modify the workload as desired - All parameters except workload type can be changed
Click "Update"
Once a workload is no longer needed it can be safely deleted.
All Snapshots need to be deleted before the workload gets deleted. Please refer to the Snapshots User Guide to learn how to delete Snapshots.
To delete a workload do the following steps:
Login to Horizon
Navigate to the Backups
Navigate to Workloads
Identify the workload to be deleted
Click the small arrow next to "Create Snapshot" to open the sub-menu
Click "Delete Workload"
Confirm by clicking "Delete Workload" yet again
Workloads that are actively taking backups or restores are locked for further tasks. It is possible to unlock a workload by force if necessary.
It is highly recommend to use this feature only as last resort in case of backups/restores being stuck without failing or a restore is required while a backup is running.
Login to the Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload to unlock
Click the small arrow next to "Create Snapshot" to open the sub-menu
Click "Unlock Workload"
Confirm by clicking "Unlock Workload" yet again
In rare cases it might be necessary to start a backup chain all over again, to ensure the quality of the created backups. To not recreate a Workload in such cases is it possible to reset a Workload.
The Workload reset will:
Cancel all ongoing tasks
Delete all existing Openstack Trilio Snapshots from the protected VMs
recalculate the next Snapshot time
take a full backup at the next Snapshot
To reset a Workload do the following steps:
Login to the Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload to reset
Click the small arrow next to "Create Snapshot" to open the sub-menu
Click "Reset Workload"
Confirm by clicking "Reset Workload" yet again
A Snapshot is a single Trilio backup of a workload including all data and metadata. It contains the information of all VM's that are protected by the workload.
Login to Horizon
Navigate to the Backups
Navigate to Workloads
Identify the workload to show the details on
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
The List of Snapshots for the chosen Workload contains the following additional information:
Creation Time
Name of the Snapshot
Description of the Snapshot
Total amount of Restores from this Snapshot
Total amount of succeeded Restores
Total amount of failed Restores
Snapshot Type
Snapshot Size
Snapshot Status
Snapshots are automatically created by the Trilio scheduler. If necessary or in case of deactivated scheduler is it possible to create a Snapshot on demand.
There are 2 possibilities to create a snapshot on demand.
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that shall create a Snapshot
Click "Create Snapshot"
Provide a name and description for the Snapshot
Decide between Full and Incremental Snapshot
Click "Create"
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that shall create a Snapshot
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Click "Create Snapshot"
Provide a name and description for the Snapshot
Decide between Full and Incremental Snapshot
Click "Create"
Each Snapshot contains a lot of information about the backup. These information can be seen in the Snapshot overview.
To reach the Snapshot Overview follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to show
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the Snapshot Name
The Snapshot Details Tab shows the most important information about the Snapshot.
Snapshot Name / Description
Snapshot Type
Time Taken
Size
Which VMs are part of the Snapshot
for each VM in the Snapshot
Instance Info - Name & Status
Security Group(s) - Name & Type
Flavor - vCPUs, Disk & RAM
Networks - IP, Networkname & Mac Address
Attached Volumes - Name, Type, size (GB), Mount Point & Restore Size
Misc - Original ID of the VM
The Snapshot Restores Tab shows the list of Restores that have been started from the chosen Snapshot. It is possible to start Restores from here.
The Snapshot Miscellaneous Tab provides the remaining metadata information about the Snapshot.
Creation Time
Last Update time
Snapshot ID
Workload ID of the Workload containing the Snapshot
Once a Snapshot is no longer needed, it can be safely deleted from a Workload.
The retention policy will automatically delete the oldest Snapshots according to the configure policy.
You have to delete all Snapshots to be able to delete a Workload.
Deleting a Trilio Snapshot will not delete any Openstack Cinder Snapshots. Those need to be deleted separately if desired.
There are 2 possibilities to delete a Snapshot.
To delete a single Snapshot through the submenu follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to delete
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the small arrow in the line of the Snapshot next to "One Click Restore" to open the submenu
Click "Delete Snapshot"
Confirm by clicking "Delete"
To delete one or more Snapshots through the Snapshot overview do the following:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to show
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshots in the Snapshot list
Check the checkbox for each Snapshot that shall be deleted
Click "Delete Snapshots"
Confirm by clicking "Delete"
Ongoing Snapshots can be canceled.
Canceled Snapshots will be treated like errored Snapshots
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to cancel
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click "Cancel" on the same line as the identified Snapshot
Confirm by clicking "Cancel"
The file search functionality allows the user to search for files and folders located on a chosen VM in a workload in one or more Backups.
The file search tab is part of every workload overview. To reach it follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload a file search shall be done in
Click the workload name to enter the Workload overview
Click File Search to enter the file search tab
A file search runs against a single virtual machine for a chosen subset of backups using a provided search string.
To run a file search the following elements need to be decided and configured
Under VM Name/ID choose the VM that the search is done upon. The drop down menu provides a list of all VMs that are part of any Snapshot in the Workload.
VMs that are no longer activly protected by the Workload but are still part of an existing Snapshot are listed in red.
The File Path defines the search string that is run against the chosen VM and Snapshots. This search string does support basic RegEx.
The File Path has to start with a '/'
Windows partitions are fully supported. Each partition is its own Volume with its own root. Use '/Windows' instead of 'C:\Windows'
The file search does not go into deeper directories and always searches on the directory provided in the File Path
Example File Path for all files inside /etc : /etc/*
"Filter Snapshots by" is the third and last component that needs to be set. This defines which Snapshots are going to be searched.
There are 3 possibilities for a pre-filtering:
All Snapshots - Lists all Snapshots that contain the chosen VM from all available Snapshots
Last Snapshots - Choose between the last 10, 25, 50, or custom Snapshots and click Apply to get the list of the available Snapshots for the chosen VM that match the criteria.
Date Range - Set a start and end date and click apply to get the list of all available Snapshots for the chosen VM within the set dates.
After the pre-filtering is done all matching Snapshots are automatic prechosen. Uncheck any Snapshot that shall not be searched.
When no Snapshot is chosen the file search will not start.
To start a File Search the following elements need to be set:
A VM to search in has to be chosen
A valid File Path provided
At least one Snapshot to search in selected
Once those have been set click "Search" to start the file search.
Do not navigate to any other Horizon tab or website after starting the File Search. Results are lost and the search has to be repeated to regain them.
After a short time the results will be presented. The results are presented in a tabular format grouped by Snapshots and Volumes inside the Snapshot.
For each found file or folder the following information are provided:
POSIX permissions
Amount of links pointing to the file or folder
User ID who owns the file or folder
Group ID assigned to the file or folder
Actual size in Bytes of the file or folder
Time of creation
Time of last modification
Time of last access
Full path to the found file or folder
Once the Snapshot of interest has been identified it is possible to go directly to the Snapshot using the "View Snapshot" option at the top of the table. It is also possible to directly mount the Snapshot using the "Mound Snapshot" Button at the end of the table.
Trilio can notify users via E-Mail upon the completion of backup and restore jobs.
The E-Mail will be sent to the owner of the Workload.
To use the E-mail notifications, two requirements need to be met.
Both requirements need to be set or configured by the Openstack Administrator. Please contact your Openstack Administrator to verify the requirements.
As the E-Mail will be sent to the owner of the Workload does the Openstack User, who created the workload, require to have an E-Mail address associated.
Trilio needs to know which E-Mail server to use, to send the E-mail notifications. Backup Administrators can do this in the "Backup Admin" area.
E-Mail notifications are activated tenant wide. To activate the E-Mail notification feature for a tenant follow these steps:
Login to Horizon
Navigate to the Backups
Navigate to Settings
Check/Uncheck the box for "Enable Email Alerts"
The following screenshots show example E-mails send by Trilio.
Every Workload has its own schedule. Those schedules can be activated, deactivated and modified.
A schedule is defined by:
Status (Enabled/Disabled)
Start Day/Time
End Day
Hrs between 2 snapshots
To disable the scheduler of a single Workload in Horizon do the following steps:
Login to the Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload to be modified
Click the small arrow next to "Create Snapshot" to open the sub-menu
Click "Edit Workload"
Navigate to the tab "Schedule"
Uncheck "Enabled"
Click "Update"
To disable the scheduler of a single Workload in Horizon do the following steps:
Login to the Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload to be modified
Click the small arrow next to "Create Snapshot" to open the sub-menu
Click "Edit Workload"
Navigate to the tab "Schedule"
check "Enabled"
Click "Update"
To modify a schedule the workload itself needs to be modified.
This system is used during all backup and restore features.
As a trust is bound to a specific user for each Workload does the Trilio Horizon plugin show the status of the Scheduler on the Workload list page.
A Restore is the workflow to bring back the backed up VMs from a Trilio Snapshot.
To reach the list of Restores for a Snapshot follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to show
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the Snapshot Name
Navigate to the Restores tab
To reach the detailed Restore overview follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to show
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the Snapshot Name
Navigate to the Restores tab
Identify the restore to show
Click the restore name
The Restore Details Tab shows the most important information about the Restore.
Name
Description
Restore Type
Status
Time taken
Size
Progress Message
Progress
Host
Restore Options
The Restore Options are the restore.json provided to Trilio.
List of VMs restored
restored VM Name
restored VM Status
restored VM ID
The Misc tab provides additional Metadata information.
Creation Time
Restore ID
Snapshot ID containing the Restore
Workload
Once a Restore is no longer needed, it can be safely deleted from a Workload.
Deleting a Restore will only delete the Trilio information about this Restore. No Openstack resources are getting deleted.
There are 2 possibilities to delete a Restore.
To delete a single Restore through the submenu follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to delete
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the Snapshot Name
Navigate to the Restore tab
Click "Delete Restore" in the line of the restore in question
Confirm by clicking "Delete Restore"
To delete one or more Restores through the Restore list do the following:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to show
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshots in the Snapshot list
Enter the Snapshot by clicking the Snapshot name
Navigate to the Restore tab
Check the checkbox for each Restore that shall be deleted
Click "Delete Restore" in the menu above
Confirm by clicking "Delete Restore"
Ongoing Restores can be canceled.
To cancel a Restore in Horizon follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to delete
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the Snapshot Name
Navigate to the Restore tab
Identify the ongoing Restore
Click "Cancel Restore" in the line of the restore in question
Confirm by clicking "Cancel Restore"
The One Click Restore will bring back all VMs from the Snapshot in the same state as they were backed up. They will:
be located in the same cluster in the same datacenter
use the same storage domain
connect to the same network
have the same flavor
The user can't change any Metadata.
The One Click Restore requires, that the original VM's that have been backed up are deleted or otherwise lost. If even one VM is still existing, will the One Click Restore fail.
The One Click Restore will automatically update the Workload to protect the restored VMs.
There are 2 possibilities to start a One Click Restore.
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to be restored
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the Snapshot to be restored
Click "One Click Restore" in the same line as the identified Snapshot
(Optional) Provide a name / description
Click "Create"
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to be restored
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the Snapshot to be restored
Click the Snapshot Name
Navigate to the "Restores" tab
Click "One Click Restore"
(Optional) Provide a name / description
Click "Create"
The Selective Restore is the most complex restore Trilio has to offer. It allows to adapt the restored VMs to the exact needs of the User.
With the selective restore the following things can be changed:
Which VMs are getting restored
Name of the restored VMs
Which networks to connect with
Which Storage domain to use
Which DataCenter / Cluster to restore into
Which flavor the restored VMs will use
The Selective Restore is always available and doesn't have any prerequirements.
The Selective Restore will automatically update the Workload to protect the created instance in the case that the original instance is no longer existing.
There are 2 possibilities to start a Selective Restore.
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to be restored
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the Snapshot to be restored
Click on the small arrow next to "One Click Restore" in the same line as the identified Snapshot
Click on "Selective Restore"
Configure the Selective Restore as desired
Click "Restore"
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to be restored
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the Snapshot to be restored
Click the Snapshot Name
Navigate to the "Restores" tab
Click "Selective Restore"
Configure the Selective Restore as desired
Click "Restore"
The Inplace Restore covers those use cases, where the VM and its Volumes are still available, but the data got corrupted or needs to a rollback for other reasons.
It allows the user to restore only the data of a selected Volume, which is part of a backup.
The Inplace Restore only works when the original VM and the original Volume are still available and connected. Trilio is checking this by the saved Object-ID.
The Inplace Restore will not create any new RHV resources. Please use one of the other restore options if new Volumes or VMs are required.
There are 2 possibilities to start an Inplace Restore.
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to be restored
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the Snapshot to be restored
Click on the small arrow next to "One Click Restore" in the same line as the identified Snapshot
Click on "Inplace Restore"
Configure the Inplace Restore as desired
Click "Restore"
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to be restored
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the Snapshot to be restored
Click the Snapshot Name
Navigate to the "Restores" tab
Click "Inplace Restore"
Configure the Inplace Restore as desired
Click "Restore"
The workloadmgr client CLI is using a restore.json file to define the restore parameters for the selective and the inplace restore.
An example for a selective restore of this restore.json is shown below. A detailed analysis and explanation is given afterwards.
Before the exact details of the restore are to be provided it is necessary to provide the general metadata for the restore.
openstack
starts the exact definition of the restore
The Selective Restore requires a lot of information to be able to execute the restore as desired.
Those information are divided into 3 components:
instances
restore_topology
networks_mapping
This part contains all information about all instances that are part of the Snapshot to restore and how they are to be restored.
Even when VMs are not to be restored are they required inside the restore.json to allow a clean execution of the restore.
Each instance requires the following information
All further information are only required, when the instance is part of the restore.
To use the next free IP available in the set Nics to an empty list [ ]
Using an empty list for Nics combined with the Network Topology Restore, will the restore automatically restore the original IP address of the instance.
The root disk needs to be at least as big as the root disk of the backed up instance was.
The following example describes a single instance with all values.
Do not mix network topology restore together with network mapping.
To activate a network topology restore set:
To activate network mapping set:
When the network mapping is activated it is used, it is necessary to provide the mapping details, which are part of the networks_mapping block:
The Inplace Restore requires less information thana selective restore. It only requires the base file with some information about the Instances and Volumes to be restored.
When the boot disk is at the same time a Cinder Disk, both values need to be set true.
There are no network information required, but the field have to exist as empty value for the restore to work.
The Trilio Appliance Dashboard gives an overview of the running services and their Status inside the Cluster. This dashboard is accessible through the virtual IP.
If service status panels on the dashboard page are not visible then access the virtual IP on port 3001 (https://<TVO-VIP>:3001/) and accept the SSL exception, and then refresh the dashboard page.
It shows for each Trilio Appliance the Status of the following Trilio services:
wlm-workloads
wlm-scheduler
wlm-api
wlm-cron
The wlm-cron
service runs on only one Trilio appliance at all times. That they are shown inactive on other nodes is not an error
To give administrators an overview of the HA status, does the dashboard also show the service status for:
Pacemaker
RabbitMQ
MySQL Galera Cluster \
The following Trilio services are providing certificates for secured access to the Trilio solution.
Service | Port used | Description |
---|---|---|
The TVault-Config service and the Nginx Resource for the Grafana Dashboard are using the same certificate.
The certificate used is a symlink to a host-specific certificate. Each Trilio VM has its own self-signed certificate by default which is getting recreated every time the TVault-Config service is restarted.
When the certificate for the TVault-Config and Nginx (Grafana) is to be changed to a customer chosen certificate it is required to deactivate the recreation of the certificates upon service restart.
Trilio is planning to change this behavior to make it easier for customers to change the certificate in the future.
Login into the Trilio VM via SSH
Edit the following file:
/home/stack/myansible/lib/python3.6/site-packages/tvault_configurator/tvault_config_bottle.py
Look for create_ssl_certificates() in the main function
Comment out create_ssl_certificates()
Repeat for all nodes of the Trilio cluster
The resulting main function will look like this:
Afterward, the certificates can be replaced manually by overwriting the files.
Once the certificates have been replaced by the desired ones restart the TVault-Config service and the Nginx pcs resource.
The certificate provided by the Nginx for the wlm-api service is set during configuration when HTTPS endpoints are configured for the Trilio appliance. This certificate is provided to the end-user or Openstack every time an API call to the Trilio solution is sent.
To change the certificate through the configurator make sure to create HTTPS endpoints and upload the certificate and key using the advanced options of the configurator.
The certificates can be changed manually if necessary.
They are located under /opt/stack/data/cert/
These certificates can be replaced manually and the Nginx resource restarted afterward.
Trilio allows you to view or download a file from the snapshot. Any changes to the files or directories when snapshot is mounted are temporary and are discarded when the snapshot is unmounted. Mounting is a faster way to restore a single or multiple files. To mount a snapshot follow these steps.
It is recommended to do these steps once to the chosen cloud-Image and then upload the modified cloud image to Glance.
Create an Openstack image using a Linux based cloud-image like Ubuntu, CentOS or RHEL with the following metadata parameters.
Spin up an instance from that image It is recommended to have at least 8GB RAM for the mount operation. Bigger Snapshots can require more RAM.
install and activate qemu-guest-agent
Edit /etc/sysconfig/qemu-ga
and remove the following from BLACKLIST_RPC section
Disable SELINUX in /etc/sysconfig/selinux
Install python3 and lvm2
Reboot the Instance
install and activate qemu-guest-agent
Verify the loaded path of qemu-guest-agent
Follow this path when systemctl returns the following loaded path
Edit /etc/init.d/qemu-guest-agent
and add Freeze-Hook file path in daemon args
Follow this path when systemctl returns the following loaded path
Edit qemu-guest-agent systemd
file
Add the following lines
Restart qemu-guest-agent service
Install Python3
Reboot the VM
Mounting a Snapshot to a File Recovery Manager provides read access to all data that is located on the in the mounted Snapshot.
It is possible to run the mounting process against any Openstack instance. During this process will the instance be rebooted.
Always mount Snapshots to File Recovery Manager instances only.
To be able to successfully mount Windows (NTFS) Snapshots the ntfs filesystem support is required on the File Recovery Manager instance.
Unmount any mounted Snapshot once there is no further need to keep it mounted. Mounted Snapshots will not be purged by the Retention policy.
There are 2 possibilities to mount a Snapshot in Horizon.
To mount a Snapshot through the Snapshot list follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to mount
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Identify the searched Snapshot in the Snapshot list
Click the small arrow in the line of the Snapshot next to "One Click Restore" to open the submenu
Click "Mount Snapshot"
Choose the File Recovery Manager instance to mount to
Confirm by clicking "Mount"
Should all instances of the project be listed and there is a File Recovery Manager instance existing verify together with the administrator that the File Recovery Manager image has the following property set:
tvault_recovery_manager=yes
To mount a Snapshot through the File Search results follow these steps:
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to mount
Click the workload name to enter the Workload overview
Navigate to the File Search tab
Identify the Snapshot to be mounted
Click "Mount Snapshot" for the chosen Snapshot
Choose the File Recovery Manager instance to mount to
Confirm by clicking "Mount"
Should all instances of the project be listed and there is a File Recovery Manager instance existing verify together with the administrator that the File Recovery Manager image has the following property set:
tvault_recovery_manager=yes
The File Recovery Manager is a normal Linux based Openstack instance.
It can be accessed via SSH or SSH based tools like FileZila or WinSCP.
SSH login is often disabled by default in cloud-images. Enable SSH login if necessary.
The mounted Snapshot can be found at the following path:
/home/ubuntu/tvault-mounts/mounts/
Each VM in the Snapshot has its own directory using the VM_ID as the identifier.
Sometimes a Snapshot is mounted for a longer time and it needs to be identified, which Snapshots are mounted.
There are 2 possibilities to identify mounted Snapshots inside Horizon.
Login to Horizon
Navigate to Compute
Navigate to Instances
Identify the File Recovery Manager Instance
Click on the Name of the File Recovery Manager Instance to bring up its details
On the Overview tab look for Metadata
Identify the value for mounted_snapshot_url
The mounted_snapshot_url
contains the Snapshot ID of the Snapshot that has been mounted last.
This value only gets updated, when a new Snapshot is mounted.
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to mount
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Search for the Snapshot that has the option "Unmount Snapshot"
Once a mounted Snapshot is no longer needed it is possible and recommended to unmount the snapshot.
Unmounting a Snapshot frees the File Recovery Manager instance to mount the next Snapshot and allows Trilio retention policy to purge the former mounted Snapshot.
Deleting the File Recovery Manager instance will not update the Trilio appliance. The Snapshot will be considered mounted until an unmount command has been received.
Login to Horizon
Navigate to Backups
Navigate to Workloads
Identify the workload that contains the Snapshot to mount
Click the workload name to enter the Workload overview
Navigate to the Snapshots tab
Search for the Snapshot that has the option "Unmount Snapshot"
Click "Unmount Snapshot"
If users want to use a different container registry for the triliovault containers, then the user can edit this value. In that case, the user first needs to manually pull triliovault containers from and push them to the other registry.
--all {True,False}
List all workloads of all projects (valid for admin user only)
--nfsshare <nfsshare>
List all workloads of nfsshare (valid for admin user only)
--display-name
Optional workload name. (Default=None)
--display-description
Optional workload description. (Default=None)
--workload-type-id
Workload Type ID is required
--source-platform
Workload source platform is required. Supported platforms is 'openstack'
--instance
Specify an instance to include in the workload. Specify option multiple times to include multiple instances. instance-id: include the instance with this UUID
--jobschedule
Specify following key value pairs for jobschedule Specify option multiple times to include multiple keys. 'start_date' : '06/05/2014' 'end_date' : '07/15/2014' 'start_time' : '2:30 PM' 'interval' : '1 hr' 'snapshots_to_retain' : '2'
--metadata
Specify a key value pairs to include in the workload_type metadata Specify option multiple times to include multiple keys. key=value
--policy-id <policy_id>
ID of the policy to assign to the workload
<workload_id>
ID/name of the workload to show
--verbose
option to show additional information about the workload
--display-name
Optional workload name. (Default=None)
--display-description
Optional workload description. (Default=None)
--instance <instance-id=instance-uuid>
Specify an instance to include in the workload. Specify option multiple times to include multiple instances. instance-id: include the instance with this UUID
--jobschedule <key=key-name>
Specify following key value pairs for jobschedule Specify option multiple times to include multiple keys. If don't specify timezone, then by default it takes your local machine timezone 'start_date' : '06/05/2014' 'end_date' : '07/15/2014' 'start_time' : '2:30 PM' 'interval' : '1 hr' 'retention_policy_type' : 'Number of Snapshots to Keep' or 'Number of days to retain Snapshots' 'retention_policy_value' : '30'
--metadata <key=key-name>
Specify a key value pairs to include in the workload_type metadata Specify option multiple times to include multiple keys. key=value
--policy-id <policy_id>
ID of the policy to assign
<workload_id>
ID of the workload to edit
<workload_id>
ID/name of the workload to delete
--database_only <True/False>
Keep True if want to delete from database only.(Default=False)
<workload_id>
ID of the workload to unlock
<workload_id>
ID/name of the workload to reset
--workload_id <workload_id>
Filter results by workload_id
--tvault_node <host>
List all the snapshot operations scheduled on a tvault node(Default=None)
--date_from <date_from>
From date in format 'YYYY-MM-DDTHH:MM:SS' eg 2016-10-10T00:00:00, If don't specify time then it takes 00:00 by default
--date_to <date_to>
To date in format 'YYYY-MM-DDTHH:MM:SS'(defult is current day), Specify HH:MM:SS to get snapshots within same day inclusive/exclusive results for date_from and date_to
--all {True,False}
List all snapshots of all the projects(valid for admin user only)
<workload_id>
ID of the workload to snapshot.
--full
Specify if a full snapshot is required.
--display-name <display-name>
Optional snapshot name. (Default=None)
--display-description <display-description>
Optional snapshot description. (Default=None)
Please refer to the User Guide to learn more about Restores.
<snapshot_id>
ID of the snapshot to be shown
--output <output>
Option to get additional snapshot details, Specify --output metadata for snapshot metadata, Specify --output networks for snapshot vms networks, Specify --output disks for snapshot vms disks
<snapshot_id>
ID of the snapshot to be deleted
<snapshot_id>
ID of the snapshot to be canceled
<vm_id>
ID of the VM to be searched
<file_path>
Path of the file to search for
--snapshotids <snapshotid>
Search only in specified snapshot ids snapshot-id: include the instance with this UUID
--end_filter <end_filter>
Displays last snapshots, example , last 10 snapshots, default 0 means displays all snapshots
--start_filter <start_filter>
Displays snapshots starting from , example , snapshot starting from 5, default 0 means starts from first snapshot
--date_from <date_from>
From date in format 'YYYY-MM-DDTHH:MM:SS' eg 2016-10-10T00:00:00, If time isn't specified then it takes 00:00 by default
--date_to <date_to>
To date in format 'YYYY-MM-DDTHH:MM:SS'(defult is current day),Specify HH:MM:SS to get snapshots within same day inclusive/exclusive results for date_from and date_to
--workloadid <workloadid>
Requires at least one workloadid, Specify an ID of the workload whose scheduler disables. Specify option multiple times to include multiple workloads. --workloadids <workloadid> --workloadids <workloadid>
--workloadid <workloadid>
Requires at least one workloadid, Specify an ID of the workload whose scheduler disables. Specify option multiple times to include multiple workloads. --workloadids <workloadid> --workloadids <workloadid>
Please follow this procedure to .
Trilio is using the which enables the Trilio service user to act in the name of another Openstack user.
<workload_id>
ID of the workload to validate
--snapshot_id <snapshot_id>
ID of the Snapshot to show the restores of
<restore_id>
ID of the restore to be shown
--output <output>
Option to get additional restore details, Specify --output metadata for restore metadata,--output networks --output subnets --output routers --output flavors
<restore_id>
ID of the restore to be deleted
<restore_id>
ID of the restore to be deleted
<snapshot_id>
ID of the snapshot to restore.
--display-name <display-name>
Optional name for the restore.
--display-description <display-description>
Optional description for restore.
<snapshot_id>
ID of the snapshot to restore.
--display-name <display-name>
Optional name for the restore.
--display-description <display-description>
Optional description for restore.
--filename <filename>
Provide file path(relative or absolute) including file name , by default it will read file: /usr/lib/python2.7/site-packages/workloadmgrclient/input-files/restore.json .You can use this for reference or replace values into this file.
<snapshot_id>
ID of the snapshot to restore.
--display-name <display-name>
Optional name for the restore.
--display-description <display-description>
Optional description for restore.
--filename <filename>
Provide file path(relative or absolute) including file name , by default it will read file: /usr/lib/python2.7/site-packages/workloadmgrclient/input-files/restore.json .You can use this for reference or replace values into this file.
The restore.json requires many information about the backed up resources. All required information can be gathered in the .
name
the name of the restore
description
the description of the restore
oneclickrestore <True/False>
If the restore is a oneclick restore. Setting this to True will override all other settings and a One Click Restore is started.
restore_type <oneclick/selective/inplace>
defines the restore that is intended
type openstack
defines that the restore is into an openstack cloud.
id
original id of the instance
include <True/False>
Set True when the instance shall be restored
name
new name of the instance
availability_zone
Nova Availability Zone the instance shall be restored into. Leave empty for "Any Availability Zone"
Nics
list of openstack Neutron ports that shall be attached to the instance. Each Neutron Port consists of:
id
ID of the Neutron port to use
mac_address
Mac Address of the Neutron port
ip_address
IP Address of the Neutron port
network
network the port is assigned to. Contains the following information:
id
ID of the network the Neutron port is part of
subnet
subnet the port is assigned to. Contains the following information:
id
ID of the network the Neutron port is part of
vdisks
List of all Volumes that are part of the instance. Each Volume requires the following information:
id
Original ID of the Volume
new_volume_type
The Volume Type to use for the restored Volume. Leave empty for Volume Type None
availability_zone
The Cinder Availability Zone to use for the Volume. The default Availability Zone of Cinder is Nova
flavor
Defines the Flavor to use for the restored instance. Contains the following information:
ram
How much RAM the restored instance will have (in MB)
ephemeral
How big the ephemeral disk of the instance will be (in GB)
vcpus
How many vcpus the restored instance will have available
swap
How big the Swap of the restored instance will be (in MB). Leave empty for none.
disk
Size of the root disk the instance will boot with
id
ID of the flavor that matches the provided information
networks
list of snapshot_network and target_network pairs
snapshot_network
the network backed up in the snapshot, contains the following:
id
Original ID of the network backed up
subnet
the subnet of the network backed up in the snapshot, contains the following:
id
Original ID of the subnet backed up
target_network
the existing network to map to, contains the following
id
ID of the network to map to
subnet
the subnet of the network backed up in the snapshot, contains the following:
id
ID of the subnet to map to
id
ID of the instance inside the Snapshot
restore_boot_disk
Set to True if the boot disk of that VM shall be restored.
include
Set to True if at least one Volume from this instance shall be restored
vdisks
List of disks, that are connected to the instance. Each disk contains:
id
Original ID of the Volume
restore_cinder_volume
set to true if the Volume shall be restored
<snapshot_id>
ID of the Snapshot to be mounted
<mount_vm_id>
ID of the File Recovery Manager instance to mount the Snapshot to.
--workloadid <workloadid>
Restrict the list to snapshots in the provided workload
<snapshot_id>
ID of the snapshot to unmount.
TVault-Config
443
Webservice providing the TrilIoVault Dashboard
Nginx (wlm-api)
8780
provides the VIP for wlm-api service
Nginx (Grafana)
3001
VIP for the dashboard of Grafana service running on TrilIioVault VM
Trilio Workloads are designed to allow a Desaster Recovery without the need to backup the Trilio database.
As long as the Trilio Workloads are existing on the Backup Target Storage and a Trilio installation has access to them, it is possible to restore the Workloads.
Notify users to of Workloads being available
This procedure is designed to be applicable to all Openstack installations using Trilio. It is to be used as a starting point to develop the exact Desaster Recovery process of a specific environment.
In case that instead of noticing the users, the workloads shall be restored is it necessary to have an User in each Project, that has the necessary privileges to restore.
Trilio incremental Snapshots involve a backing file to the prior backup taken, which makes every Trilio incremental backup a synthetic full backup.
Trilio is using qcow2 backing files for this feature:
As can be seen in the example is the backing file an absolute path, which makes it necessary, that this path exists so the backing files can be accessed.
Trilio is using the base64 hashing algorithm for the NFS mount-paths, to allow the configuration of multiple NFS Volumes at the same time. The hash value is calculated using the provided NFS path.
When the path of the backing file is not available on the Trilio appliance and Compute nodes, will the restores of incremental backups fail.
The tested and recommended method to make the backing files available is creating the required directory path and using mount --bind
to make the path available for the backups.
Running the mount --bind command will make the necessary path available until the next reboot. If it is required to have access to the path beyond a reboot is it necessary to edit the fstab.
Troubleshooting inside a complex environment like Openstack can be very time-consuming. The following tipps will help to speed up the troubleshooting process to identify root causes.
Openstack and Trilio are divided into multiple services. Each service has a very specific purpose that is called during a backup or recovery procedure. Knowing which service is doing what helps to understand where the error is happening, allowing more focused troubleshooting.
The Trilio Cluster is the Controller of Trilio. It receives all Workload related requests from the users.
Every task of a backup or restore process is triggered and managed from here. This includes the creation of the directory structure and initial metadata files on the Backup Target.
During a backup process is the Trilio cluster also responsible to gather the metadata about the backed up VMs and networks from the Openstack environment. It is sending API calls towards the Openstack endpoints on the configured endpoint type to fetch this information. Once the metadata has been received does the Trilio Cluster write it as json files on the Backup Target.
The Trilio cluster is also sending the Cinder Snapshot command.
During restore process is the Trilio cluster reading the VM metadata from its Database and is using the metadata to create the Shell for the restore. It is sending API calls to the Openstack environment to create the necessary resources.
The dmapi service is the connector between the Trilio cluster and the datamover running on the compute nodes.
The purpose of the dmapi service is to identify which compute node is responsible for the current backup or restore task. To do so is the dmapi service connecting to the nova api database requesting the compute hose of a provided VM.
Once the compute host has been identified is the dmapi forwarding the command from the Trilio Cluster to the datamover running on the identified compute host.
The datamover is the Trilio service running on the compute nodes.
Each datamover is responsible for the VMs running on top of its compute node. A datamover can not work with VMs running on a different compute node.
The datamover is controlling the freeze and thaw of VMs as well as the actual movement of the data.
Trilio is reading and writing on the Backup Target as nova:nova.
The POSIX user-id and group-id of nova:nova need to be aligned between the Trilio Cluster and all compute nodes. Otherwise backup or restores may fail with permission or file not found issues.
Alternativ ways to achieve the goal are possible, as long as all required nodes can fully write and read as nova:nova on the Backup Target.
It is recommended to verify the required permissions on the Backup Target in case of any errors during the data transfer phase or in case of any file permission errors.
Trilio is using RBAC to allow the usage of Trilio features to users.
This trustee role is absolutely required and can not be overwritten using the admin role.
It is recommended to verify the assignment of the Trilio Trustee Role in case of any permission errors from Trilio during creation of Workloads, backups or restores.
Trilio is creating Cinder Snapshots and temporary Cinder Volumes. The Openstack Quotas need to allow that.
Every disk that is getting backed up requires one temporary Cinder Volumes.
Every Cinder Volume that is getting backup requires two Cinder Snapshots. The second Cinder Snapshot is temporary to calculate the incremental.
Once Trilio is configured use virtual IP to access its dashboard. If service status panels on the dashboard page is not visible then access virtual IP on port 3001 (https://<TVO-VIP>:3001/) by accepting the exception and again refresh the dashboard page.
The Trilio Cluster contains multiple log files.
The main log is workloadmgr-workloads.log, which contains all logs about ongoing and past Trilio backup and restore tasks. It can be found at:
/var/log/workloadmgr/workloadmgr-workloads.log
The next important log is the workloadmgr-api.log, which contains all logs about API calls received by the Trilio Cluster. It can be found at:
/var/log/workloadmgr/workloadmgr-api.log
The log for the third service is the workloadmgr-scheduler.log, which contains all logs about the internal job scheduling between Trilio nodes in the Trilio Cluster.
/var/log/workloadmgr/workloadmgr-scheduler.log
The last but not least service running on the Trilio Nodes is the wlm-cron service, which is controlling the scheduled automated backups.
/var/log/workloadmgr/workloadmgr-workloads.log
In the case of using S3 as a backup target is there also a log file that keeps track of the S3-Fuse plugin used to connect with the S3 storage.
/var/log/workloadmgr/s3vaultfuse.py.log
Canonical Openstack is having these logs inside the workloadmgr container.
The log for the Trilio Datamover API service is located on the nodes, typically controller, where the Trilio Datamover API container is running under:
/var/log/containers/trilio-datamover-api/dmapi.log
The log for the Trilio Datamover service is located on the nodes, typically compute, where the Trilio Datamover container is running under:
/var/log/containers/trilio-datamover/tvault-contego.log
In case of S3 being used in the log for the S3 Fuse plugin located on the same nodes under:
/var/log/containers/trilio-datamover/tvault-object-store.log
The log for the Trilio Datamover API service is located on the nodes, typically controller, where the Trilio Datamover API container is running under:
/var/log/kolla/trilio-datamover-api/dmapi.log
The log for the Trilio Datamover service is located on the nodes, typically compute, where the Trilio Datamover container is running under:
/var/log/kolla/triliovault-datamover/tvault-contego.log
In case of S3 being used in the log for the S3 Fuse plugin located on the same nodes under:
/var/log/kolla/trilio-datamover/tvault-object-store.log
The log for the Trilio Datamover API service is located on the nodes, typically controller, where the Trilio Datamover API container is running.
Log into the dmapi container using lxc-attach
command (example below).
lxc-attach -n controller_dmapi_container-a11984bf
The log file is then located under:
/var/log/dmapi/dmapi.log
The log for the Trilio Datamover service is typically located on the compute nodes and the logs can be found here:
/var/log/tvault-contego/tvault-contego.log
In case of S3 being used in the log for the S3 Fuse plugin located on the same nodes under:
/var/log/tvault-object-store/tvault-object-store.log
Trilio enables Openstack administrators to set Project Quotas against the usage of Trilio.
The following Quotas can be set:
Number of Workloads a Project is allowed to have
Number of Snapshots a Project is allowed to have
Number of VMs a Project is allowed to protect
Amount of Storage a Project is allowed to use on the Backup Target
The Trilio Quota feature is available for all supported Openstack versions and distributions, but only Train and higher releases include the Horizon integration of the Quota feature.
Workload Quotas are managed like any other Project Quotas.
Login into Horizon as user with admin role
Navigate to Identity
Navigate to Projects
Identify the Project to modify or show the quotas on
Use the small arrow next to "Manage Members" to open the submenu
Choose "Modify Quotas"
Navigate to "Workload Manager"
Edit Quotas as desired
Click "Save"
Trilio is providing several different Quotas. The following command allows listing those.
Trilio 4.1 do not yet have the Quota Type Volume integrated. Using this will not generate any Quotas a Tenant has to apply to.
The following command will show the details of a provided Quota Type.
The following command will create a Quota for a given project and set the provided value.
The high watermark is automatically set to 80% of the allowed value when set via Horizon.
A created Quota will generate an allowed_quota_object with its own ID. This is ID is needed when continuing to work with the created Quota.
The following command lists all Trilio Quotas set for a given project.
The following command shows the details about a provided allowed Quota.
The following command shows how to update the value of an already existing allowed Quota.
The following command will delete an allowed Quota and sets the value of the connected Quota Type back to unlimited for the affected project.
Trilio provides Backup-as-a-Service, which allows Openstack Users to manage and control their backups themselves. This doesn't eradicate the need for a Backup Administrator, who has an overview of the complete Backup Solution.
To provide Backup Administrators with the tools they need does Trilio for Openstack provide a Backup-Admin area in Horizon in addition to the API and CLI.
To access the Backups-Admin area follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin Tab.
Navigate to Trilio page.
The Backups-Admin area provides the following features.
It is possible to reduce the shown information down to a single tenant. That way seeing the exact impact the chosen Tenant has.
The status overview is always visible in the Backups-Admin area. It provides the most needed information on a glance, including:
Storage Usage (nfs only)
Number of protected VMs compared to number of existing VMs
Number of currently running Snapshots
Status of TVault Nodes
Status of Contego Nodes
The status of nodes is filled when the services are running and in good status.
This tab provides information about all currently existing Workloads. It is the most important overview tab for every Backup Administrator and therefor the default tab shown when opening the Backup-Admins area.
The following information are shown:
User-ID that owns the Workload
Project that contains the Workload
Workload name
Workload Type
Availability Zone
Amount of protected VMs
Performance information about the last 30 backups
How much data was backed up (green bars)
How long did the Backup take (red line)
Piechart showing amount of Full (Blue) Backups compared to Incremental (Red) Backups
Number of successful Backups
Number of failed Backups
Storage used by that Workload
Which Backup target is used
When is the next Snapshot run
What is the general intervall of the Workload
Scheduler Status including a Switch to deactivate/activate the Workload
Administrators often need to figure out, where a lot of resources are used up, or they need to quickly provide usage information to a billing system. This tab helps in these tasks by providing the following information:
Storage used by a Tenant
VMs protected by a Tenant
It is possible to drill down to see the same information per workload and finally per protected VM.
The Usage tab includes workloads and VMs that are no longer actively used by a Tenant, but exist on the backup target.
This tab displays information about Trilio cluster nodes. The following information are shown:
Node name
Node ID
Trilio Version of the node
IP Address
Active Controller Node (True/False)
Status of the Node
The Virtual IP is shown as it's own node. It is typically shown directly below the current active Controller Node.
This tab displays information about Trilio contego service. The following information are shown:
Service-Name
Compute Node the service is running on
Zone
Service Status from Openstack perspective (enabled/disabled)
Version of the Service
General Status
last time the Status was updated
This tab displays information about the backup target storage. It contains the following information:
Storage Name
Clicking on the Storage name provides an overview of all workloads stored on that storage.
Capacity of the storage
Total utilization of the storage
Status of the storage
Statistic information
Percentage all storages are used
Percentage how much storage is used for full backups
Amount of Full backups versus Incremental backups
Audit logs provide the sequence of workload related activities done by users, like workload creation, snapshot creation, etc. The following information are shown:
Time of the entry
What task has been done
Project the task has performed in
User that performed the task
The Audit log can be searched for strings to find for example only entries down by a specific user.
Additionally, can the shown timeframe be changed as necessary.
The license tab provides an overview over the current license and allows to upload new licenses, or validate the current license.
A license validation is automatically done, when opening the tab.
The following information about an active license are shown:
Organization (License name)
License ID
Purchase date - when was the license created
License Expiry Date
Maintenance Expiry Date
License value
License Edition
License Version
License Type
Description of the License
Evaluation (True/False)
Trilio will stop all activities once a license is no longer valid or expired.
The policy tab gives Administrators the possibility to work with workload policies.
Please use Workload Policies in the Admin guide to learn more about how to create and use Workload Policies.
This tab manages all global settings for the whole cloud. Trilio has two types of settings:
Email settings
Job scheduler settings.
These settings will be used by Trilio to send email reports of snapshots and restores to users.
Configuring the Email settings is a must-have to provide Email notification to Openstack users.
The following information are required to configure the email settings:
SMTP Server
SMTP username
SMTP password
SMTP port
SMTP timeout
Sender email address
A test email can be sent directly from the configuration page.
To work with email settings through CLI use the following commands:
To set an email setting for the first time or after deletion use:
To update an already set email setting through CLI use:
To show an already set email setting use:
To delete a set email setting use:
The Global Job Scheduler can be used to deactivate all scheduled workloads without modifying each one of them.
To activate/deactivate the Global Job Scheduler through the Backups-Admin area:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin Tab.
Navigate to Trilio page.
Navigate to the Settings tab
Click "Disable/Enable Job Scheduler"
Check or Uncheck the box for "Job Scheduler Enabled"
Confirm by clicking on "Change"
The Global Job Scheduler can be controlled through CLI as well.
To get the status of the Global Job Scheduler use:
To deactivate the Global Job Scheduler use:
To activate the Global Job Scheduler use:
Trilio’s tenant driven backup service gives tenants control over backup policies. However, sometimes it may be too much control to tenants and the cloud admins may want to limit what policies are allowed by tenants. For example, a tenant may become overzealous and only uses full backups every 1 hr interval. If every tenant were to pursue this backup policy, it puts a severe strain on cloud infrastructure. Instead, if cloud admin can define predefined backup policies and each tenant is only limited to those policies then cloud administrators can exert better control over backup service.
Workload policy is similar to nova flavor where a tenant cannot create arbitrary instances. Instead, each tenant is only allowed to use the nova flavors published by the admin.
To see all available Workload policies in Horizon follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin
Navigate to Trilio
Navigate to Policy
The following information are shown in the policy tab for each available policy:
Creation time
name
description
status
set interval
set retention type
set retention value
To create a policy in Horizon follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin
Navigate to Trilio
Navigate to Policy
Click new policy
provide a policy name on the Details tab
provide a description on the Details tab
provide the RPO in the Policy tab
Choose the Snapshot Retention Type
provide the Retention value
Choose the Full Backup Interval
Click create
To edit a policy in Horizon follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin
Navigate to Trilio
Navigate to Policy
identify the policy to edit
click on "Edit policy" at the end of the line of the chosen policy
edit the policy as desired - all values can be changed
Click "Update"
To assign or remove a policy in Horizon follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin
Navigate to Trilio
Navigate to Policy
identify the policy to assign/remove
click on the small arrow at the end of the line of the chosen policy to open the submenu
click "Add/Remove Projects"
Choose projects to add or remove by using the plus/minus buttons
Click "Apply"
To delete a policy in Horizon follow these steps:
Login to Horizon using admin user.
Click on Admin Tab.
Navigate to Backups-Admin
Navigate to Trilio
Navigate to Policy
identify the policy to assign/remove
click on the small arrow at the end of the line of the chosen policy to open the submenu
click "Delete Policy"
Confirm by clicking "Delete"
Trilio is using the Openstack Keystone Trust system which enables the Trilio service user to act in the name of another Openstack user.
This system is used during all backup and restore features.
Openstack Administrators should never have the need to directly work with the trusts created.
The cloud-trust is created during the Trilio configuration and further trusts are created as necessary upon creating or modifying a workload.
Trusts can only be worked with via CLI
<trust_id> ID of the trust to show
<role_name>
Name of the role that trust is created for
--is_cloud_trust {True,False}
Set to true if creating cloud admin trust. While creating cloud trust use same user and tenant which used to configure Trilio and keep the role admin.
Each Trilio Workload has a dedicated owner. The ownership of a Workload is defined by:
Openstack User - The Openstack User-ID is assigned to a Workload
Openstack Project - The Openstack Project-ID is assigned to a Workload
Openstack Cloud - The Trilio Serviceuser-ID is assigned to a Workload
Openstack Users can update the User ownership of a Workload by modifying the Workload.
This ownership secures, that only the owners of a Workload are able to work with it.
Openstack Administrators can reassign Workloads or reimport Workloads from older Trilio installations.
Workload import allows to import Workloads existing on the Backup Target into the Trilio database.
The Workload import is designed to import Workloads, which are owned by the Cloud.
It will not import or list any Workloads that are owned by a different cloud.
To get a list of importable Workloads use the following CLI command:
--project_id <project_id>
List workloads belongs to given project only.
To import Workloads into the Trilio database use the following CLI command:
--workloadids <workloadid>
Specify workload ids to import only specified workloads. Repeat option for multiple workloads.
The definition of an orphaned Workload is from the perspective of a specific Trilio installation. Any workload that is located on the Backup Target Storage, but not known to the TrilioVualt installation is considered orphaned.
Further is to divide between Workloads that were previously owned by Projects/Users in the same cloud or are migrated from a different cloud.
The following CLI command provides the list of orphaned workloads:
Running this command against a Backup Target with many Workloads can take a bit of time. Trilio is reading the complete Storage and verifies every found Workload against the Workloads known in the database.
Openstack administrators are able to reassign a Workload to a new owner. This involves the possibility to migrate a Workload from one cloud to another or between projects.
Reassigning a workload only changes the database of the target Trilio installation. When the Workload was managed before by a different Trilio installation, will that installation not be updated.
Use the following CLI command to reassign a Workload:
A sample mapping file with explanations is shown below:
To use the workloadmgr CLI tool on the Trilio appliance it is only necessary to activate the virtual environment of the workloadmgr
An rc-file to authenticate against Openstack is required.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workloads
Provides the list of all workloads for the given tenant/project id
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads
Creates a workload in the provided Tenant/Project with the given details.
Workload create requires a Body in json format, to provide the requested information.
Using a policy-id will pull the following information from the policy. Values provided in the Body will be overwritten with the values from the Policy.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>
Shows all details of a specified workload
PUT
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>
Modifies a workload in the provided Tenant/Project with the given details.
Workload modify requires a Body in json format, to provide the information about the values to modify.
All values in the body are optional.
Using a policy-id will pull the following information from the policy. Values provided in the Body will be overwritten with the values from the Policy.
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>
Deletes the specified Workload.
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>/unlock
Unlocks the specified Workload
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>/reset
Resets the defined workload
Trilio is composed of multiple services, which can be checked in case of any errors.
Trilio is using 4 main services on the Trilio Appliance:
wlm-api
wlm-scheduler
wlm-workloads
wlm-cron
Those can be verified to be up and running using the systemctl status
command.
The second component to check the Trilio Appliance's health is the nginx and pacemaker cluster.
Checking the availability of the Trilio API on the chosen endpoints is recommended.
The following example curl command lists the available workload-types and verifies that the connection is available and working:
Please check the API guide for more commands and how to generate the X-Auth-Token.
The dmapi service has its own Keystone endpoints, which should be checked in addition to the actual service status.
In order to check the dmapi service go to dmapi container which is residing on controller nodes and run below command
The datamover service is running on each compute node. Logging to compute node and run below command
The dmapi service has its own Keystone endpoints, which should be checked in addition to the actual service status.
Run the following command on “nova-api” nodes and make sure “triliovault_datamover_api” container is in started state.
Run the following command on "nova-compute" nodes and make sure the container is in a started state.
Run the following command on horizon nodes and make sure the container is in a started state.
Run the following command on MAAS nodes and make sure all trilio units like trilio-data-mover
, trilio-dm-api
, trilio-horizon-plugin
, trilio-wlm
are in active state
Make sure the Trilio dmapi and horizon containers (shown below) are in a running state and no other Trilio container is deployed on controller nodes. If the containers are in restarting state or not listed by the following command then your deployment is not done correctly. Please note that the 'Horizon' container is replaced with the Trilio Horizon container. This container will have the latest OpenStack horizon + Trilio's horizon plugin.
Make sure the Trilio datamover container (shown below) is in a running state and no other Trilio container is deployed on compute nodes. If the containers are in restarting state or not listed by the following command then your deployment is not done correctly.
Please check dmapi endpoints on overcloud node.
<quota_type_id>
ID of the Quota Type to show
<quota_type_id>
ID of the Quota Type to be created
<allowed_value>
Value to set for this Quota Type
<high_watermark>
Value to set for High Watermark warnings
<project_id>
Project to assign the quota to
<project_id>
Project to list the Quotas from
<allowed_quota_id>
ID of the allowed Quota to show.
<allowed_value>
Value to set for this Quota Type
<high_watermark>
Value to set for High Watermark warnings
<project_id>
Project to assign the quota to
<allowed_quota_id>
ID of the allowed Quota to update
<allowed_quota_id>
ID of the allowed Quota to delete
--description
Optional description (Default=None) Not required for email settings
--category
Optional setting category (Default=None) Not required for email settings
--type
settings type set to email_settings
--is-public
sets if the setting can be seen publicly set to False
--is-hidden
sets if the setting will always be hidden set to False
--metadata
sets if the setting can be seen publicly Not required for email settings
<name>name of the setting Take from the list below
<value>value of the setting Take value type from the list below
--description
Optional description (Default=None) Not required for email settings
--category
Optional setting category (Default=None) Not required for email settings
--type
settings type set to email_settings
--is-public
sets if the setting can be seen publicly set to False
--is-hidden
sets if the setting will always be hidden set to False
--metadata
sets if the setting can be seen publicly Not required for email settings
<name>
name of the setting Take from the list below
<value>
value of the setting Take value type from the list below
--get_hidden
show hidden settings (True) or not (False) Not required for email settings, use False
if set
<setting_name>
name of the setting to show Take from the list below
<setting_name>
name of the setting to delete Take from the list below
Setting name | Value type | example |
---|---|---|
<policy_id>
Id of the policy to show
--policy-fields <key=key-name>
Specify following key value pairs for policy fields Specify option multiple times to include multiple keys. 'interval' : '1 hr' 'retention_policy_type' : 'Number of Snapshots to Keep' or 'Number of days to retain Snapshots' 'retention_policy_value' : '30' 'fullbackup_interval' : '-1' (Enter Number of incremental snapshots to take Full Backup between 1 to 999, '-1' for 'NEVER' and '0' for 'ALWAYS')For example --policy-fields interval='1 hr' --policy-fields retention_policy_type='Number of Snapshots to Keep '--policy-fields retention_policy_value='30' --policy- fields fullbackup_interval='2'
--display-description <display_description>
Optional policy description. (Default=No description)
--metadata <key=keyname>
Specify a key value pairs to include in the workload_type metadata Specify option multiple times to include multiple keys. key=value
<display_name>
the name the policy will get
--display-name <display-name>
Name of the policy
--display-description <display_description>
Optional policy description. (Default=No description)
--policy-fields <key=key-name>
Specify following key value pairs for policy fields Specify option multiple times to include multiple keys. 'interval' : '1 hr' 'retention_policy_type' : 'Number of Snapshots to Keep' or 'Number of days to retain Snapshots' 'retention_policy_value' : '30' 'fullbackup_interval' : '-1' (Enter Number of incremental snapshots to take Full Backup between 1 to 999, '-1' for 'NEVER' and '0' for 'ALWAYS')For example --policy-fields interval='1 hr' --policy-fields retention_policy_type='Number of Snapshots to Keep '--policy-fields retention_policy_value='30' --policy- fields fullbackup_interval='2'
--metadata <key=keyname>
Specify a key value pairs to include in the workload_type metadata Specify option multiple times to include multiple keys. key=value
<policy_id>
the name the policy will get
--add_project <project_id>
ID of the project to assign policy to. Use multiple times to assign multiple projects.
--remove_project <project_id>
ID of the project to remove policy from. Use multiple times to remove multiple projects.
<policy_id>
policy to be assigned or removed
<policy_id>
ID of the policy to be deleted
<trust_id>
ID of the trust to be deleted
--migrate_cloud {True,False}
Set to True if you want to list workloads from other clouds as well. Default is False.
--generate_yaml {True,False}
Set to True if want to generate output file in yaml format, which would be further used as input for workload reassign API.
--old_tenant_ids <old_tenant_id>
Specify old tenant ids from which workloads need to reassign to new tenant. Specify multiple times to choose Workloads from multiple tenants.
--new_tenant_id <new_tenant_id>
Specify new tenant id to which workloads need to reassign from old tenant. Only one target tenant can be specified.
--workload_ids <workload_id>
Specify workload_ids which need to reassign to new tenant. If not provided then all the workloads from old tenant will get reassigned to new tenant. Specifiy multiple times for multiple workloads.
--user_id <user_id>
Specify user id to which workloads need to reassign from old tenant. only one target user can be specified.
--migrate_cloud {True,False}
Set to True if want to reassign workloads from other clouds as well. Default if False
--map_file
Provide file path(relative or absolute) including file name of reassign map file. Provide list of old workloads mapped to new tenants. Format for this file is YAML.
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
Name | Type | Description |
---|
smtp_default___recipient
String
admin@example.net
smtp_default___sender
String
admin@example.net
smtp_port
Integer
587
smtp_server_name
String
Mailserver_A
smtp_server_username
String
admin
smtp_server_password
String
password
smtp_timeout
Integer
10
smtp_email_enable
Boolean
True
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Tenant/Project to create the workload in |
X-Auth-Project-Id | string | Project to run the authentication against |
X-Auth-Token | string | Authentication token to use |
Content-Type | string | application/json |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Project/Tenant where to find the Workload |
workload_id | string | ID of the Workload to show |
X-Auth-Project-Id | string | Project to run the authentication against |
X-Auth-Token | string | Authentication token to use |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Tenant/Project where to find the workload in |
workload_id | string | ID of the Workload to modify |
X-Auth-Project-Id | string | Project to run the authentication against |
X-Auth-Token | string | Authentication token to use |
Content-Type | string | application/json |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Tenant where to find the Workload in |
workload_id | string | ID of the Workload to delete |
database_only | boolean | True leaves the Workload data on the Backup Target |
X-Auth-Project-Id | string | Project to run the authentication against |
X-Auth-Token | string | Authentication Token to use |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Tenant where to find the Workload in |
workload_id | string | ID of the Workload to unlock |
X-Auth-Project-Id | string | Project to run the authentication against |
X-Auth-Token | string | Authentication Token to use |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Tenant where to find the Workload in |
workload_id | string | ID of the Workload to reset |
X-Auth-Project-Id | string | Project to run the authentication against |
X-Auth-Token | string | Authentication Token to use |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
tvm_address | string | IP or FQDN of Trilio Service |
tenant_id | string | ID of the Tenant/Project to fetch the workloads from |
nfs_share | string | lists workloads located on a specific nfs-share |
all_workloads | boolean | admin role required - True lists workloads of all tenants/projects |
X-Auth-Project-Id | string | project to run the authentication against |
X-Auth-Token | string | Authentication token to use |
Accept | string | application/json |
User-Agent | string | python-workloadmgrclient |
This runbook will demonstrate how to set up Disaster Recovery with Trilio for a given scenario.
The chosen scenario is following an actively used Trilio customer environment.
There are two Openstack clouds available "Openstack Cloud A" and Openstack Cloud B". "Openstack Cloud B" is the Disaster Recovery restore point of "Openstack Cloud A" and vice versa. Both clouds have an independent Trilio installation integrated. These Trilio installations are writing their Backups to NFS targets. "Trilio A" is writing to "NFS A1" and "Trilio B" is writing to "NFS B1". The NFS Volumes used are getting synced against another NFS Volume on the other side. "NFS A1" is syncing with "NFS B2" and "NFS B1" is syncing with "NFS A2". The syncing process is set up independently from Trilio and will always favor the newer dataset.
This scenario will cover the Disaster Recovery of a single Workload and a complete Cloud. All processes are done be the Openstack administrator.
This runbook will assume that the following is true:
"Openstack Cloud A" and "Openstack Cloud B" both have an active Trilio installation with a valid license
"Openstack Cloud A" and "Openstack Cloud B" have free resources to host additional VMs
"Openstack Cloud A" and "Openstack Cloud B" have Tenants/Projects available that are the designated restore points for Tenant/Projects of the other side
Access to a user with the admin role permissions on domain level
One of the Openstack clouds is down/lost
For ease of writing will this runbook assume from here on, that "Openstack Cloud A" is down and the Workloads are getting restored into "Openstack Cloud B".
In the case of the usage of shared Tenant networks, beyond the floating IP, the following additional requirement is needed: All Tenant Networks, Routers, Ports, Floating IPs, and DNS Zones are created
A single Workload can do a Disaster Recovery in this Scenario, while both Clouds are still active. To do so the following high-level process needs to be followed:
Copy the Workload directories to the configured NFS Volume
Make the right Mount-Paths available
Reassign the Workload
Restore the Workload
Clean up
This process only shows how to get a Workload from "Openstack Cloud A" to "Openstack Cloud B". The vice versa process is similar.
As only a single Workload is to be recovered it is more efficient to copy the data of that single Workload over to the "NFS B1" Volume, which is used by "Trilio B".
It is recommended to use the Trilio VM as a connector between both NFS Volumes, as the nova user is available on the Trilio VM.
Trilio Workloads are identified by their ID und which they are stored on the Backup Target. See below example:
In the case that the Workload ID is not known can available Metadata inside the Workload directories be used to identify the correct Workload.
The identified workload needs to be copied with all subdirectories and files. Afterward, it is necessary to adjust the ownership to nova:nova with the right permissions.
Trilio backups are using qcow2 backing files, which make every incremental backup a full synthetic backup. These backing files can be made visible using the qemu-img tool.
The MTAuMTAuMi4yMDovdXBzdHJlYW0=
part of the backing file path is the base64 hash value, which will be calculated upon the configuration of a Trilio installation for each provided NFS-Share.
This hash value is calculated based on the provided NFS-Share path: <NFS_IP>/<path> If even one character in the NFS-Share path is different between the provided NFS-Share paths a completely different hash value is generated.
Workloads, that have been moved between NFS-Shares, require that their incremental backups can follow the same path as on their original Source Cloud. To achieve this it is necessary to create the mount path on all compute nodes of the Target Cloud.
Afterwards a mount bind is used to make the workloads data accessible over the old and the new mount path. The following example shows the process of how to successfully identify the necessary mount points and create the mount bind.
The used hash values can be calculated using the base64 tool in any Linux distribution.
Based on the identified base64 hash values the following paths are required on each Compute node.
/var/triliovault-mounts/MTAuMTAuMi4yMDovdXBzdHJlYW1fc291cmNl
and
/var/triliovault-mounts/MTAuMjAuMy4yMjovdXBzdHJlYW1fdGFyZ2V0
In the scenario of this runbook is the workload coming from the NFS_A1 NFS-Share, which means the mount path of that NFS-Share needs to be created and bound to the Target Cloud.
To keep the desired mount past a reboot it is recommended to edit the fstab of all compute nodes accordingly.
Trilio workloads have clear ownership. When a workload is moved to a different cloud it is necessary to change the ownership. The ownership can only be changed by Openstack administrators.
To fulfill the required tasks an admin role user is used. This user will be used until the workload has been restored. Therefore, it is necessary to provide this user access to the desired Target Project on the Target Cloud.
Each Trilio installation maintains a database of workloads that are known to the Trilio installation. Workloads that are not maintained by a specific Trilio installation, are from the perspective of that installation, orphaned workloads. An orphaned workload is a workload accessible on the NFS-Share, that is not assigned to any existing project in the Cloud the Trilio installation is protecting.
The identified orphaned workloads need to be assigned to their new projects. The following provides the list of all available projects viewable by the used admin-user in the target_domain.
To allow project owners to work with the workloads as well will they get assigned to a user with the backup trustee role that is existing in the target project.
Now that all informations have been gathered the workload can be reassigned to the target project.
After the workload has been assigned to the new project it is recommended to verify the workload is managed by the Target Trilio and is assigned to the right project and user.
The reassigned workload can be restored using Horizon following the procedure described here.
This runbook will continue on the CLI only path.
To be able to do the necessary selective restore a few pieces of information about the snapshot to be restored are required. The following process will provide all necessary information.
List all Snapshots of the workload to restore to identify the snapshot to restore
Get Snapshot Details with network details for the desired snapshot
Get Snapshot Details with disk details for the desired Snapshot
The selective restore is using a restore.json file for the CLI command. This restore.json file needs to be adjusted according to the desired restore.
To do the actual restore use the following command:
To verify the success of the restore from a Trilio perspective the restore status is checked.
After the Desaster Recovery Process has been successfully completed it is recommended to bring the TVM installation back into its original state to be ready for the next DR process.
Delete the workload that got restored.
The Trilio database is following the Openstack standard of not deleting any database entries upon deletion of the cloud object. Any Workload, Snapshot or Restore, which gets deleted, is marked as deleted only.
To allow the Trilio installation to be ready for another disaster recovery it is necessary to completely delete the entries of the Workloads, which have been restored.
Trilio does provide and maintain a script to safely delete workload entries and all connected entities from the Trilio database.
This script can be found here: https://github.com/trilioData/solutions/tree/master/openstack/CleanWlmDatabase
After all restores for the target project have been achieved it is recommended to remove the used admin user from the project again.
This Scenario will cover the Disaster Recovery of a full cloud. It is assumed that the source cloud is down or lost completly. To do the disaster recovery the following high-level process needs to be followed:
Reconfigure the Target Trilio installation
Make the right Mount-Paths available
Reassign the Workload
Restore the Workload
Reconfigure the Target Trilio installation back to the original one
Clean up
Before the Desaster Recovery Process can start is it necessary to make the backups to be restored available for the Trilio installation. The following steps need to be done to completely reconfigure the Trilio installation.
During the reconfiguration process will all backups of the Target Region be on hold and it is not recommended to create new Backup Jobs until the Desaster Recovery Process has finished and the original Trilio configuration has been restored.
To add the NFS-Vol2 to the Trilio Appliance cluster the Trilio can either be fully reconfigured to use both NFS Volumes or it is possible to edit the configuration file and then restart all services. This procedure describes how to edit the conf file and restart the services. This needs to be repeated on every Trilio Appliance.
Edit the workloadmgr.conf
Look for the line defining the NFS mounts
Add NFS B2 to that as comma-seperated list. Space is not necessary, but can be set.
Write and close the workloadmgr.conf
Restart the wlm-workloads service
Trilio is integrating natively into the Openstack deployment tools. When using the Red Hat director or JuJu charms it is recommended to adapt the environment files for these orchestrators and update the Datamovers through them.
To add the NFS B2 to the Trilio Datamovers manually the tvault-contego.conf file needs to be edited and the service restarted.
Edit the tvault-contego.conf
Look for the line defining the NFS mounts
Add NFS B2 to that as comma-seperated list. Space is not necessary, but can be set.
Write and close the tvault-contego.conf
Restart the tvault-contego service
Trilio backups are using qcow2 backing files, which make every incremental backup a full synthetic backup. These backing files can be made visible using the qemu-img tool.
The MTAuMTAuMi4yMDovdXBzdHJlYW0=
part of the backing file path is the base64 hash value, which will be calculated upon the configuration of a Trilio installation for each provided NFS-Share.
This hash value is calculated based on the provided NFS-Share path: <NFS_IP>/<path> If even one character in the NFS-Share path is different between the provided NFS-Share paths a completely different hash value is generated.
Workloads, that have been moved between NFS-Shares, require that their incremental backups can follow the same path as on their original Source Cloud. To achieve this it is necessary to create the mount path on all compute nodes of the Target Cloud.
Afterwards a mount bind is used to make the workloads data accessible over the old and the new mount path. The following example shows the process of how to successfully identify the necessary mount points and create the mount bind.
The used hash values can be calculated using the base64 tool in any Linux distribution.
Based on the identified base64 hash values the following paths are required on each Compute node.
/var/triliovault-mounts/MTAuMTAuMi4yMDovdXBzdHJlYW1fc291cmNl
and
/var/triliovault-mounts/MTAuMjAuMy4yMjovdXBzdHJlYW1fdGFyZ2V0
In the scenario of this runbook is the workload coming from the NFS_A1 NFS-Share, which means the mount path of that NFS-Share needs to be created and bound to the Target Cloud.
To keep the desired mount past a reboot it is recommended to edit the fstab of all compute nodes accordingly.
Trilio workloads have clear ownership. When a workload is moved to a different cloud it is necessary to change the ownership. The ownership can only be changed by Openstack administrators.
To fulfill the required tasks an admin role user is used. This user will be used until the workload has been restored. Therefore, it is necessary to provide this user access to the desired Target Project on the Target Cloud.
Each Trilio installation maintains a database of workloads that are known to the Trilio installation. Workloads that are not maintained by a specific Trilio installation, are from the perspective of that installation, orphaned workloads. An orphaned workload is a workload accessible on the NFS-Share, that is not assigned to any existing project in the Cloud the Trilio installation is protecting.
The identified orphaned workloads need to be assigned to their new projects. The following provides the list of all available projects viewable by the used admin-user in the target_domain.
To allow project owners to work with the workloads as well will they get assigned to a user with the backup trustee role that is existing in the target project.
Now that all informations have been gathered the workload can be reassigned to the target project.
After the workload has been assigned to the new project it is recommended to verify the workload is managed by the Target Trilio and is assigned to the right project and user.
The reassigned workload can be restored using Horizon following the procedure described here.
This runbook will continue on the CLI only path.
To be able to do the necessary selective restore a few pieces of information about the snapshot to be restored are required. The following process will provide all necessary information.
List all Snapshots of the workload to restore to identify the snapshot to restore
Get Snapshot Details with network details for the desired snapshot
Get Snapshot Details with disk details for the desired Snapshot
The selective restore is using a restore.json file for the CLI command. This restore.json file needs to be adjusted according to the desired restore.
To do the actual restore use the following command:
To verify the success of the restore from a Trilio perspective the restore status is checked.
After the Desaster Recovery Process has finished it is necessary to return the Trilio installation to its original configuration. The following steps need to be done to completely reconfigure the Trilio installation.
During the reconfiguration process will all backups of the Target Region be on hold and it is not recommended to create new Backup Jobs until the Desaster Recovery Process has finished and the original Trilio configuration has been restored.
To add the NFS-Vol2 to the Trilio Appliance cluster the Trilio can either be fully reconfigured to use both NFS Volumes or it is possible to edit the configuration file and then restart all services. This procedure describes how to edit the conf file and restart the services. This needs to be repeated on every Trilio Appliance.
Edit the workloadmgr.conf
Look for the line defining the NFS mounts
Delete NFS B2 from the comma-seperated list
Write and close the workloadmgr.conf
Restart the wlm-workloads service
Trilio is integrating natively into the Openstack deployment tools. When using the Red Hat director or JuJu charms it is recommended to adapt the environment files for these orchestrators and update the Datamovers through them.
To add the NFS B2 to the Trilio Datamovers manually the tvault-contego.conf file needs to be edited and the service restarted.
Edit the tvault-contego.conf
Look for the line defining the NFS mounts
Add NFS B2 to that as comma-seperated list. Space is not necessary, but can be set.
Write and close the tvault-contego.conf
Restart the tvault-contego service
After the Desaster Recovery Process has been successfully completed and the Trilio installation reconfigured to its original state, it is recommended to do the following additional steps to be ready for the next Disaster Recovery process.
The Trilio database is following the Openstack standard of not deleting any database entries upon deletion of the cloud object. Any Workload, Snapshot or Restore, which gets deleted, is marked as deleted only.
To allow the Trilio installation to be ready for another disaster recovery it is necessary to completely delete the entries of the Workloads, which have been restored.
Trilio does provide and maintain a script to safely delete workload entries and all connected entities from the Trilio database.
This script can be found here: https://github.com/trilioData/solutions/tree/master/openstack/CleanWlmDatabase
After all restores for the target project have been achieved it is recommended to remove the used admin user from the project again.
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>/pause
Disables the scheduler of a given Workload
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>/resume
Enables the scheduler of a given Workload
GET
https://$(tvm_address):8780/v1/$(tenant_id)/trusts/validate/<workload_id>
Validates the Scheduler trust for a given Workload
All following API commands require an Authentication token against a user with admin-role in the authentication project.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/global_job_scheduler
Requests the status of the Global Job Scheduler
POST
https://$(tvm_address):8780/v1/$(tenant_id)/global_job_scheduler/disable
Requests disabling the Global Job Scheduler
POST
https://$(tvm_address):8780/v1/$(tenant_id)/global_job_scheduler/enable
Requests enabling the Global Job Scheduler
E-Mail Notification Settings are done through the settings API. Use the values from the following table to set Email Notifications up through API.
Setting name | Settings Type | Value type | example |
---|---|---|---|
POST
https://$(tvm_address):8780/v1/$(tenant_id)/settings
Creates a Trilio setting.
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Setting create requires a Body in json format, to provide the requested information.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/settings/<setting_name>
Shows all details of a specified setting
PUT
https://$(tvm_address):8780/v1/$(tenant_id)/settings
Modifies the provided setting with the given details.
Workload modify requires a Body in json format, to provide the information about the values to modify.
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/settings/<setting_name>
Deletes the specified Workload.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots
Lists all Snapshots.
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>
When creating a Snapshot it is possible to provide additional information
This Body is completely optional
GET
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>
Shows the details of a specified Snapshot
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>
Deletes a specified Snapshot
GET
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>/cancel
Cancels the Snapshot process of a given Snapshot
POST
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>/mount
Mounts a Snapshot to the provided File Recovery Manager
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
GET
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/mounted/list
Provides the list of all Snapshots mounted in a Tenant
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/<workload_id>/snapshots/mounted/list
Provides the list of all Snapshots mounted in a specified Workload
POST
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>/dismount
Unmounts a Snapshot of the provided File Recovery Manager
GET
https://$(tvm_address):8780/v1/$(tenant_id)/restores/detail
Lists Restores with details
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
GET
https://$(tvm_address):8780/v1/$(tenant_id)/restores/<restore_id>
Provides all details about the specified Restore
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/restores/<restore_id>
Deletes the specified Restore
GET
https://$(tvm_address):8780/v1/$(tenant_id)/restores/<restore_id>/cancel
Cancels an ongoing Restore
POST
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>
Starts a restore according to the provided information
The One-Click restore requires a body to provide all necessary information in json format.
POST
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>
Starts a restore according to the provided information.
The One-Click restore requires a body to provide all necessary information in json format.
POST
https://$(tvm_address):8780/v1/$(tenant_id)/snapshots/<snapshot_id>
Starts a restore according to the provided information
The One-Click restore requires a body to provide all necessary information in json format.
POST
https://$(tvm_address):8780/v1/$(tenant_id)/search
Starts a File Search with the given parameters
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
POST
https://$(tvm_address):8780/v1/$(tenant_id)/search/<search_id>
Starts a filesearch with the given parameters
Openstack Administrators should never have the need to directly work with the trusts created.
The cloud-trust is created during the Trilio configuration and further trusts are created as necessary upon creating or modifying a workload.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/trusts
Provides the lists of trusts for the given Tenant.
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
POST
https://$(tvm_address):8780/v1/$(tenant_id)/trusts
Creates a workload in the provided Tenant/Project with the given details.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/trusts/<trust_id>
Shows all details of a specified trust
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/trusts/<trust_id>
Deletes the specified trust.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/trusts/validate/<workload_id>
Validates the Trust of a given Workload.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy
Requests the list of available Workload Policies
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy/<policy_id>
Requests the details of a given policy
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy/assigned/<project_id>
Requests the lists of Policies assigned to a Project.
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy
Creates a Policy with the given parameters
PUT
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy/<policy-id>
Updates a Policy with the given information
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy/<policy-id>
Updates a Policy with the given information
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/workload_policy/<policy_id>
Deletes a given Policy
GET
https://$(tvm_address):8780/v1/$(tenant_id)/projects_quota_types
Lists all available Quota Types
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
GET
https://$(tvm_address):8780/v1/$(tenant_id)/projects_quota_types/<quota_type_id>
Requests the details of a Quota Type
POST
https://$(tvm_address):8780/v1/$(tenant_id)/project_allowed_quotas/<project_id>
Creates an allowed Quota with the given parameters
GET
https://$(tvm_address):8780/v1/$(tenant_id)/project_allowed_quotas/<project_id>
Lists all allowed Quotas for a given project.
GET
https://$(tvm_address):8780/v1/$(tenant_id)/project_allowed_quota/<allowed_quota_id>
Shows details for a given allowed Quota
PUT
https://$(tvm_address):8780/v1/$(tenant_id)/update_allowed_quota/<allowed_quota_id>
Updates an allowed Quota with the given parameters
DELETE
https://$(tvm_address):8780/v1/$(tenant_id)/project_allowed_quotas/<allowed_quota_id>
Deletes a given allowed Quota
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/get_list/import_workloads
Provides the list of all importable workloads
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
GET
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/orphan_workloads
Provides the list of all orphaned workloads
POST
https://$(tvm_address):8780/v1/$(tenant_id)/workloads/import_workloads
Imports all or the provided workloads
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
Name | Type | Description |
---|---|---|
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project the Workload is located in
workload_id
string
ID of the Workload to disable the Scheduler in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project the Workload is located in
workload_id
string
ID of the Workload to disable the Scheduler in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project the Workload is located in
workload_id
string
ID of the Workload to disable the Scheduler in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project the Workload is located in
workload_id
string
ID of the Workload to disable the Scheduler in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project the Workload is located in
workload_id
string
ID of the Workload to disable the Scheduler in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project the Workload is located in
workload_id
string
ID of the Workload to disable the Scheduler in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
smtp_default___recipient
email_settings
String
admin@example.net
smtp_default___sender
email_settings
String
admin@example.net
smtp_port
email_settings
Integer
587
smtp_server_name
email_settings
String
Mailserver_A
smtp_server_username
email_settings
String
admin
smtp_server_password
email_settings
String
password
smtp_timeout
email_settings
Integer
10
smtp_email_enable
email_settings
Boolean
True
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant/Project to work with
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Project/Tenant where to find the Workload
setting_name
string
Name of the setting to show
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant/Project to work with w
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant where to find the Workload in
setting_name
string
Name of the setting to delete
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication Token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Projects to fetch the Snapshots from
host
string
host name of the TVM that took the Snapshot
workload_id
string
ID of the Workload to list the Snapshots off
date_from
string
starting date of Snapshots to show
\
Format: YYYY-MM-DDTHH:MM:SS
string
ending date of Snapshots to show
\
Format: YYYY-MM-DDTHH:MM:SS
all
boolean
admin role required - True lists all Snapshots of all Workloads
X-Auth-Project-Id
string
project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of the Trilio Service
tenant_id
string
ID of the Tenant/Project to take the Snapshot in
workload_id
string
ID of the Workload to take the Snapshot in
full
boolean
True creates a full Snapshot
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of the Trilio Service
tenant_id
string
ID of the Tenant/Project to take the Snapshot from
snapshot_id
string
ID of the Snapshot to show
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to find the Snapshot in
snapshot_id
string
ID of the Snapshot to delete
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to find the Snapshot in
snapshot_id
string
ID of the Snapshot to cancel
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project the Snapshot is located in
snapshot_id
string
ID of the Snapshot to mount
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant to search for mounted Snapshots
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgr
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant to search for mounted Snapshots
workload_id
string
ID of the Workload to search for mounted Snapshots
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgr
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project the Snapshot is located in
snapshot_id
string
ID of the Snapshot to dismount
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to fetch the Restores from
snapshot_id
string
ID of the Snapshot to fetch the Restores from
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to fetch the restore from
restore_id
string
ID of the restore to show
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to fetch the Restore from
restore_id
string
ID of the Restore to be deleted
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of the Trilio service
tenant_id
string
ID of the Tenant/Project to fetch the Restore from
restore_id
string
ID of the Restore to cancel
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to do the restore in
snapshot_id
string
ID of the snapshot to restore
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to do the restore in
snapshot_id
string
ID of the snapshot to restore
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to do the restore in
snapshot_id
string
ID of the snapshot to restore
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to run the search in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant/Project to run the search in
search_id
string
ID of the File Search to get
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_name
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant / Project to fetch the trusts from
is_cloud_admin
boolean
true/false
X-Auth-Project-Id
string
project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant/Project to create the Trust for
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Project/Tenant where to find the Workload
workload_id
string
ID of the Workload to show
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant where to find the Trust in
trust_id
string
ID of the Trust to delete
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication Token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Project/Tenant where to find the Workload
workload_id
string
ID of the Workload to validate the Trust of
X-Auth-Project-Id
string
Project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project
policy_id
string
ID of the Policy to show
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project
project_id
string
ID of the Project to fetch assigned Policies from
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to do the restore in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to do the restore in
policy_id
string
ID of the Policy to update
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to do the restore in
policy_id
string
ID of the Policy to assign
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project
policy_id
string
ID of the Policy to delete
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of Tenant/Project to work in
quota_type_id
string
ID of the Quota Type to show
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to work in
project_id
string
ID of the Tenant/Project to create the allowed Quota in
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to work in
project_id
string
ID of the Tenant/Project to list allowed Quotas from
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to work in
<allowed_quota_id>
string
ID of the allowed Quota to show
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to work in
<allowed_quota_id>
string
ID of the allowed Quota to update
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio service
tenant_id
string
ID of the Tenant/Project to work in
<allowed_quota_id>
string
ID of the allowed Quota to delete
X-Auth-Project-Id
string
Project to authenticate against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant/Project to work in
project_id
string
restricts the output to the given project
X-Auth-Project-Id
string
project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of Trilio Service
tenant_id
string
ID of the Tenant/Project to work in
migrate_cloud
boolean
True also shows Workloads from different clouds
X-Auth-Project-Id
string
project to run the authentication against
X-Auth-Token
string
Authentication token to use
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
tvm_address
string
IP or FQDN of the Trilio Service
tenant_id
string
ID of the Tenant/Project to take the Snapshot in
X-Auth-Project-Id
string
Project to run authentication against
X-Auth-Token
string
Authentication token to use
Content-Type
string
application/json
Accept
string
application/json
User-Agent
string
python-workloadmgrclient
To gracefully shutdown/restart the Trilio cluster the following steps are recommended.
It is recommended to verify that no snapshots or restores are running on the Trilio Cluster.
Stopping or restarting the Trilio cluster will cancel all running actively running backup or restore jobs. These jobs will be marked as errored after the system has come up again.
This can be verified using the following two commands:
The Trilio cluster is using the pacemaker service for setting the VIP(s) of the cluster and controlling the active node for the wlm-cron service. The identified node will be the last to shut down in case that the whole cluster gets shut down.
This can be checked using the following command:
In the following example is the master node the tvm1
A single node in the cluster can be shut down or restarted without issues. All services will come up and the RabbitMQ and Galeera service will rejoin the remaining cluster.
When the master node gets shutdown or restarted the VIP(s) and the wlm-cron service will switch to one of the remaining cluster nodes.
To speed up the shutdown/restart process it is recommended to stop the Trilio services, the RabbitMQ service, and the MariaDB service on the node.
The wlm-cron service and the VIP(s) are not getting stopped when only the master node gets rebooted or shut down. The pacemaker will automatically move the wlm-cron service and the VIP(s) to one of the remaining nodes.
After the services have been stopped the node can be restarted or shut down using standard Linux commands.
Restarting the whole cluster node by node follows the same procedure as restarting a single node, with the difference that each restarted node needs to be fully started again before the next node can be restarted.
When the complete cluster needs to get stopped and restarted at the same time the following procedure needs to be completed.
The procedure on a high level is:
Shutdown the two slave nodes
Shutdown the master node
Start the master node
Enable the Galeera cluster
Start the two slave nodes
Before shutting down the two slave nodes it is recommended to stop running Trilio services, the RabbitMQ server, and the MariaDB on the nodes.
Afterward, the nodes can be shut down.
Before shutting down the master node it is recommended to stop running Trilio services, the RabbitmQ server, the MariaDB, the wlm-cron and the VIP(s) resource in Pacemaker.
Afterward, the node can be shut down.
The first server that is getting booted will be the master node. It is highly recommended that the old master node will be booted first again.
Not booting the old mater node first again can lead to data loss when the Galeera Cluster is restarted.
Login into the freshly started master node and run the following command. This will restart the Galeera cluster with this node as master.
After the master node has been booted and the Galeera cluster started the remaining nodes can be started and will automatically rejoin the Trilio cluster.