Best Practices for Cloud Backup
Best Practices for Cloud Backup
- Purpose of this Guide
- Document Conventions
- Key Concepts
- Choosing What to Backup
- Backup and Restore Best Practices
- Frequently Encountered Issues
This Guide shows you how to get the most out of Rackspace Cloud Backup well and addresses frequently encountered scenarios. Working through this guide, you will better understand the key backup concepts, how to make smart choices about what to backup (and how often), how to make the most of data restoration, and how to resolve the most commonly encountered Cloud Backup issues.
|Best Practice. Look to these when you need to make good backup decisions.|
|Caution. Consider the potential outcomes before you do these actions.|
|Do Not Attempt. These are warnings against possible future trouble.|
Knowing the language of backup goes a long way towards helping you make informed decisions about your backup operations.
- Backup—a spare copy of data to be used in the event of a failure or loss of the original.
- Backup Configuration—instructions to the backup agent that indicate 1) what you want to backup and 2) how often you'd like to save that information.
- Block—a chunk of data from a file.
- Bundle—several blocks packaged together.
- Cloud Backup Agent—a program installed on your server that knows how to perform backups and restores.
- Data Churn—how often a file changes.
- Encryption—a method of scrambling the contents of data using a key or a password so that only those that have the password or key can read the data.
- Restore—to bring your system back to a previously saved state, usually using a backup as the checkpoint.
- Snapshot—a checkpoint of system data/state, usually a backup.
- The Vault—disk space that contains information about every block of data that you've ever stored. It is shared between all of your backup configurations.
Our best guidance is not what to backup, but what not to backup. The three items that should not be backed up through the Cloud Backup Agent:
- Caches and session files
- Frequently changing files, such as logs
The Cloud Backup Agent tries to be helpful, and skips the below types of files automatically. You may, however, manually add them to your backup.
- Memory-only file systems (Linux: /proc, etc. )
- Cloud Backup agent data directories
- Recovery information (Windows)
- Recycle Bin (Windows)
- desktop.ini and thumbs.db (Windows)
These file types either change too rapidly (databases, logs, caches) or don't exist long enough to be backed up (session files). Session files should be avoided entirely. And if the information is valuable to your business, log files should track it. Caches should also be avoided, as their data is meant to be discarded.
If you do need to back up these files, our suggested workarounds are:
- Databases—Take a snapshot of the database (e.g., a database dump) and back up the dump.
- Log files—Take snapshots of your log files and back them up.
- To avoid running out of disk space, rotate your log files periodically.
|Do not back up session files or caches at all. Don't back up databases or log files directly; use a snapshot instead.|
There are many ways to configure backups and restores. To make Cloud Backup work best for you, it helps to understand some of the tradeoffs you make when you configure the many options available to you.
NOTE: With Performance Cloud Servers, Cloud Monitoring will only monitor the system disk. The data disk is not monitored. For data disk backup, Rackspace Cloud Backup or Cloud Block Storage should be used.
First off, choose your contact email carefully when you configure your backup. If anything goes wrong, you will be alerted there first.
When trying to determine how often to back up a file, there are three variables to consider:
- Criticality—How important is tracking changes in the file to your business processes?
- Size—How large is the file?
- Data Churn—How often does the file change?
Criticality is the most important factor to consider when making backup decisions. The more critical the file is to your business operations, the more often you'll want to back this file up.
Size is an important consideration if the speed of backups/restores is important to you. Large files take longer to backup and to restore. It may be wise to backup large files less frequently.
Data churn is the last variable to consider, and perhaps the trickiest to handle. Files that change often invalidate blocks that have been stored previously. Depending on criticality, it may still be wise to snapshot files with high data churn and backup those snapshots.
|Back up less often files that have...||Back up more often files that have...|
|Lower Criticality||Higher Criticality|
|High Data Churn||Low Data Churn|
|Larger Size||Smaller Size|
|Do not make decisions about backup frequency lightly. If you try to backup or restore files more frequently than the backup engine can keep up with, anomalous behavior may result.|
It is a good idea to periodically test your restores to make sure that they are working as you expect. You do not want to get in the situation where needed backups aren’t available because you haven't been configured as you expected.
Test your restores periodically to make sure that your data is saved as you expect it to be.
Another point to consider is the restore destination. Restoring to the original location and overwriting saves on storage space, but you risk accidentally overwriting important existing files. Proceed with caution when restoring.
Encryption is important for keeping your data confidential. However, encryption has its costs. It takes significantly longer to backup and restore encrypted data. Consider if the data you are storing must be encrypted. If not, then it is best to proceed without encryption. Encryption applies across the board, and once you encrypt a backup server, you may not remove it.
|Once you've set a server to encrypt data, all backups performed on that server will be encrypted.|
This section is similar to an FAQ. If you encounter a problem and need immediate help, look it up here first. There's a good chance a solution already exists.
To minimize your chances of issues, keep your Backup Agent updated. Many errors and issues are fixed between releases.
Issue—My backups are getting randomly corrupted. Why?
- Does your server have a backup agent and did you clone it to create a new backup system? This means that two backup agents exist with the same credentials writing to the same Vault.
Solution—Ensure the agent on the cloned backup server is re-registered before any backups are run.
Issue—My backup is experiencing network errors.
Solution—Make sure that your backup server has a connection to both Service Net and public net. If it is on an isolated network, the backup agent will not be able to function properly.
Issue—Sometimes, my backups fail.
- This is most commonly caused by either a failure to communicate with Cloud Files, running out of disk space, or a failure to communicate with Cloud Backup.
- Sometimes, the agent may fail to backup a particular file because of a permissions error. Either the file is in use when the agent attempts to save it or the file in question cannot be read by the agent. Consider permissions when hunting for the root cause of a backup failure.
Solution—Ensure you're running the latest agent release. Then, attempt to determine the cause of the error. Try the backup/restore again if it is an intermittent error. We're always working on making Cloud Backup more robust.
Issue—My backup/restore is slow. What can I do?
- If your backup or restore is encrypted, it will be especially slow. Encryption comes at a cost. Otherwise, look to the section above, “What Should I Not Save?”. The less there is to save/restore, the faster it will be.
- We're always working on optimizing the agent to make backup and restore faster.
Carry on the conversation in the Rackspace Community.
© 2015 Rackspace US, Inc.
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License
See license specifics and DISCLAIMER