Population of Global Cache
You can manually pre-populate the global cache to avoid the situation when the cache remains empty. As a result, by the time a remote job starts, the global cache will contain data blocks that can be used for data deduplication.
Population of the global cache can be helpful in the following scenarios:
- First run of a remote job. When you run a first session of a remote job, the global cache is empty, and the whole amount of VM data needs to be transferred over WAN. It is recommended that you populate the global cache before you start a remote job for the first time.
- Global cache corruption. If the global cache gets corrupted for some reason, Veeam Backup & Replication needs to perform at least one remote job session to replace corrupted data blocks with valid data blocks. In this situation, you can clean the global cache and populate it with valid data before a remote job begins.
Limitations for Population of Global Cache
The global cache population task has the following limitations:
- Veeam Backup & Replication does not use encrypted backups for global cache population.
- You can start the global cache population task for the target WAN accelerator that is not currently used by any remote job.
- If the global cache population task is currently running, the corresponding target WAN accelerator is locked. You cannot start any remote job using this target WAN accelerator.
- [For global cache corruption scenario] You must clean the global cache before you populate it with valid data. If the global cache contains data blocks, Veeam Backup & Replication will fail to perform the population task.
How Population of Global Cache Works
Global cache population is a manual operation performed by the user. When you run the global cache population task, Veeam Backup & Replication creates a ‘default cache’ on the target WAN accelerator. The default cache is used as a basic, universal cache for every new remote job.
To populate the default cache, Veeam Backup & Replication uses backup files stored on backup repositories as a source of data. Veeam Backup & Replication writes only data blocks for OSes to the default cache. Application data blocks are not written to the cache.
The procedure of global cache population includes the following steps:
- The user starts the global cache population tasks and selects backup repositories from which data blocks should be retrieved.
- Veeam Backup & Replication scans backup repositories and makes up a list of OSes whose data blocks are available in backup files.
- Veeam Backup & Replication copies data blocks from backup repositories and populates the default cache with these data blocks.
When a remote job starts, Veeam Backup & Replication renames the default cache to a folder for the source WAN accelerator engaged in the job. As a result, Veeam Backup & Replication can use data blocks in this folder for deduplication starting from the very first session of a remote job.
 
 
In many-to-one WAN accelerator deployment scenarios, the global cache may have folders for other source WAN accelerators, and these folders may contain data blocks for some OSes. If the global cache already contains some data, the procedure of global cache population includes the following steps:
- The user starts the global cache population tasks and selects backup repositories from which data blocks should be retrieved.
- Veeam Backup & Replication scans backup repositories and makes up a list of OSes whose data blocks are available in backup files.
- Veeam Backup & Replication scans folders for other source WAN accelerators in the global cache and makes up a list of OSes whose data blocks are available there.
- The list of OSes in the global cache is compared to the list of OSes on backup repositories. This way, Veeam Backup & Replication detects data blocks for which OSes are missing in the global cache.
- In the global cache, Veeam Backup & Replication detects a folder with the maximum amount of data. This folder is used as a basis for the default cache.
- Veeam Backup & Replication copies data blocks only for missing OSes from backup repositories and populates the default cache with these blocks. Data blocks for OSes available in folders for other source WAN accelerators are not copied to the default cache during the population task. Veeam Backup & Replication copies these data blocks on the fly, when a remote job runs. For more information, see Many to One WAN Acceleration.
 
 
Related Topics