If you want your backup strategy to save your money instead of wasting it, setting up a proper backup retention policy will be the core element to focus on.
We witness the following mistake so often: businesses back up everything they have and keep it for as long as possible. As a result, they end up with enormous bills for storage or compliance violations.
A good backup retention policy solves this problem by determining what to keep and to delete. Thanks to this, you can find and restore the necessary files much quicker while automatically getting rid of the data clutter.
But how do you determine what data to keep and delete, and when? As a data protection company, we’ve composed a list of data backup retention policy best practices and tried-and-true tips. We’ve saved the most useful bits of advice for the end, so be sure to stick with us!
Table of Contents
So, What Is a Retention Policy?
A retention policy is a protocol that defines the lifecycle of data in an organization.
This lifecycle describes the following things:
- For how long the organization will retain a piece of information;
- How this information will be stored;
- What data should be stored and why;
- When to dispose of the particular data.
A retention policy is crucial for businesses of every size. It helps you manage your data and backups, allowing you to control your records’ growth. Not having this policy will, at the very least, result in you spending lots of money on the storage for unnecessary files. In the worst-case scenario, not having a retention policy may lead you to break the law by not keeping some data long enough or keeping it for no good reason.
Also, a thorough backup retention policy helps you to quickly find the information you need so you can restore it or present it as evidence in a legal case.
Two Things That Define Your Retention Policy
To create a data retention policy, you need to know two things:
- Business needs the retention policy must solve for your organization;
- Compliance regulations regarding data applied to your organization.
How do you find out these things? Simply by seeking assistance from the law department and C-level management of your company.
Compliance regulations that apply to your organization
There are different compliance regulations that can be applied to your data. There are regional, country-specific, or industry-related laws. To comply with these, you may need to keep some data sets for years in data centers located in specific regions.
Sometimes, these regulations can even be mutually exclusive. In these cases, you need to address these issues with your law department or outsource compliance experts.
Read more about the most common data compliance regulations here→
Company business needs
To function properly, most companies rely on operational day-to-day data flow like emails, spreadsheets, text documents, etc. By backing up this business-critical information, you secure it from data losses and reduce the potential downtime due to disruptions that usually cost businesses a fortune.
The time for which you have to store this data depends solely on your business goals. The mistake many companies make is keeping this type of data for as long as possible. It feels like a safe choice, but it will only take up space and computing resources by piling up your storage with useless information.
To avoid this mistake, answer the following questions:
- What data to keep?
- Why do we need to keep it?
- For how long do we need to keep it?
If you aren’t sure about the answers, direct these questions to your company’s management/business department.
Backup Retention Policy: Best Practices to Follow
1. Classify data by type and needs
As we pointed out earlier, your backup retention policy will be defined largely by external (law) and internal (company) needs. By this division, you understand the ultimate goal of every data piece and what the company needs they cover.
You may never need to use some data, but it has to stay put in a secure place for legal purposes. Meanwhile, the need for business continuity might dictate you to come back (recover) other data ASAP in the event of an incident.
When you categorize data this way, your retention policy’s other components will naturally fall into place. Plus, you get to understand the Recovery Time Objective (RTO) and Recovery Point Objective (RPO) for your data, which are the core of your disaster recovery strategy.
Here is how you need to classify data:
- What data is valuable from the point of compliance regulations;
- What data is valuable from the point of your business needs;
- What data refers to public, proprietary, or confidential information.
2. Categorize data by lifecycle
Again, the lifecycle of backed up data is strictly determined by the compliance requirements and business goals.
Not all data should be stored for the same period. Moreover, some of the data simply can’t be stored and must be erased after a certain period of time. For one set of data, you may need to archive it for ten years and delete it right after this period has expired. For the other set of data, it must be is more reasonable and cost-effective to delete it after some time because it’s outdated and not susceptible to any regulations.
By defining this, you can create a separate backup plan for each dataset depending on the time they should be kept and their purposes.
Here is how you can categorize data by their lifecycle:
- Records that should be retained for up to six months;
- Records that should be retained for one year;
- Records that should be retained for up to three years, and so on.
3. Decide what and when to delete
Deleting data in time is one of those critical rules companies often fail to follow because it may feel counterintuitive. And yet, holding onto data when you should’ve deleted it potentially puts your company in hot water.
Here are the possible implications you may experience for not deleting data in time:
- Putting your company at risk of legal proceedings and penalties for non-compliance;
- Risking your client’s data security and your reputation;
- Cluttering and overburdening your hardware/software with unnecessary data;
- Spending money on extra storage occupied with data that has no value for the company;
- Making data navigation too complicated.
Some compliance regulations make it obligatory to schedule the secure deletion of data that falls under certain categories and matches certain criteria. For example, article 5 of the General Data Protection Regulation (GDPR) states that organizations must destroy personal data that are proven to be no longer required for business or legal purposes.
By defining what data should be deleted and setting up the Purge function, you automatically erase files that match certain criteria. But we recommend keeping at least one last version of the file on the off-chance unless the law determines otherwise.
4. Define the number and type of versions to store
The backup version is a copy of the original file that contains all changes that have been made to a document.
Use these parameters of the versioned backup:
By the number of versions to store:
- Additional (inactive) versions
- Last versions of files that have been deleted
By the amount of time to store:
- Existing data
- Deleted data
5. Decide about the types of backup and their frequency
There are three types of backup:
- Full backup – a full copy of all existing files;
- Differential backup – a copy of all changes made from the last full backup;
- Incremental backup – a copy of all changes since the last backup of any kind (full, differential, or incremental).
To understand which backup type will be the most beneficial for your business, read our guide to Full vs. Differential vs. Incremental Backups.
6. Choose a compliant and cost-effective data backup service
Unless you have already set up on-premises systems to back up your data or chosen a third-party service, you’ll need to select a reliable backup service. There are many secure backup providers on the market, and you need to choose the one that meets your needs perfectly.
Here are the key things to consider when choosing a perfect backup provider:
1. Cloud-to-cloud or on-premises. There are many pros and cons to both options, but we are of the opinion that if you keep your data in the cloud services like Google or Microsoft, the cloud-to-cloud backup is the best option for your business. We justify our opinion in this article.
2. Scalability and flexibility. By these words, we mean the ability to start from the minimum number of licenses and scale it when your business grows. Some backup services have prefixed numbers of licenses you have to buy to start using the service, which may be a waste of money for small businesses.
The SaaS subscription model with monthly payments and a minimum number of licenses to start from is usually the most cost-effective option for most businesses due to its flexibility.
3. Type of backup and restore. For small-to-medium businesses that keep data in the cloud, a backup service with an incremental-based backup model, granular restore, and version control will be the best fit.
To get the full list, read our guide →How to Choose a Perfect Backup Service in 10 Steps
P.S.: To see how professional backup service works: