HP (Hewlett-Packard) D2D manual

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131

Go to page of

A good user manual

The rules should oblige the seller to give the purchaser an operating instrucion of HP (Hewlett-Packard) D2D, along with an item. The lack of an instruction or false information given to customer shall constitute grounds to apply for a complaint because of nonconformity of goods with the contract. In accordance with the law, a customer can receive an instruction in non-paper form; lately graphic and electronic forms of the manuals, as well as instructional videos have been majorly used. A necessary precondition for this is the unmistakable, legible character of an instruction.

What is an instruction?

The term originates from the Latin word „instructio”, which means organizing. Therefore, in an instruction of HP (Hewlett-Packard) D2D one could find a process description. An instruction's purpose is to teach, to ease the start-up and an item's use or performance of certain activities. An instruction is a compilation of information about an item/a service, it is a clue.

Unfortunately, only a few customers devote their time to read an instruction of HP (Hewlett-Packard) D2D. A good user manual introduces us to a number of additional functionalities of the purchased item, and also helps us to avoid the formation of most of the defects.

What should a perfect user manual contain?

First and foremost, an user manual of HP (Hewlett-Packard) D2D should contain:
- informations concerning technical data of HP (Hewlett-Packard) D2D
- name of the manufacturer and a year of construction of the HP (Hewlett-Packard) D2D item
- rules of operation, control and maintenance of the HP (Hewlett-Packard) D2D item
- safety signs and mark certificates which confirm compatibility with appropriate standards

Why don't we read the manuals?

Usually it results from the lack of time and certainty about functionalities of purchased items. Unfortunately, networking and start-up of HP (Hewlett-Packard) D2D alone are not enough. An instruction contains a number of clues concerning respective functionalities, safety rules, maintenance methods (what means should be used), eventual defects of HP (Hewlett-Packard) D2D, and methods of problem resolution. Eventually, when one still can't find the answer to his problems, he will be directed to the HP (Hewlett-Packard) service. Lately animated manuals and instructional videos are quite popular among customers. These kinds of user manuals are effective; they assure that a customer will familiarize himself with the whole material, and won't skip complicated, technical information of HP (Hewlett-Packard) D2D.

Why one should read the manuals?

It is mostly in the manuals where we will find the details concerning construction and possibility of the HP (Hewlett-Packard) D2D item, and its use of respective accessory, as well as information concerning all the functions and facilities.

After a successful purchase of an item one should find a moment and get to know with every part of an instruction. Currently the manuals are carefully prearranged and translated, so they could be fully understood by its users. The manuals will serve as an informational aid.

Table of contents for the manual

  • Page 1

    1 HP D2D Backup Systems Best practices f or VTL, NAS and Replication impl ementations Table of contents Abstract .............................................................................................................................................. 4 Related products ...........................................................................[...]

  • Page 2

    2 Fibre Channel configuration ............................................................. 19 Fibre Channel topologies ................................................................................................................ 19 Switched fabric ..................................................................................................[...]

  • Page 3

    3 Seeding and why it is required ........................................................................................................ 55 Seeding methods in more detail ....................................................................................................... 57 Replication and other D2D operations ..................................[...]

  • Page 4

    4 Abstract The HP StorageWorks D2D Backup Sy stem products with Dynamic Data Deduplic ation are Virtual Tape library and NAS appliances designed to provide a cost- effective, consolidated backup sol ution for business data and fast restore of data in the event of loss. In order to get th e best performance from a D2D Backup System there are so me c[...]

  • Page 5

    5 Executive summ ary This document contains detailed information on best practices to get good performanc e from an HP D2D Backup System with HP StoreOnce Deduplication Technology. HP StoreOnce Technology is designed to increase the amount of historical backup d ata that can be stored without increasing the disk space needed. A backup p roduct usin[...]

  • Page 6

    6 NAS best pract ices at a glan ce Configure multiple shares and separate data types into their own shares. Adhere to the suggested maximum number of concurrent operations per share/appliance. Choose disk backup file sizes in backup software to meet the maximum backup size. Disable software compression, deduplication and synthetic full backups. Do [...]

  • Page 7

    7 HP StoreOnce Techn ology A basic understanding of t he way that HP StoreOnce Tec hnology works is necessary in order to understand factors th at may impact performance of the overall system a n d to ensure optimal performan ce of your backup solution. HP StoreOnce Technology is an “inline” data deduplication process. It uses hash- based chunk[...]

  • Page 8

    8 Appended backups need to “clone” the cartridge on the target side, so performance of a ppended tape replication will not be significantly faster than replicating the whole cartridge. If a lot of similar data exists on remote office D2D libraries, replicating these into a single target library will give a better deduplication ratio on the targ[...]

  • Page 9

    9 hold-off to prevent impacting the performance of other operations. It is, however, important to note that the hold- off is not binary, (i.e. on or off) so, even if backup jobs are in process, some low level of housekeeping will still take place which may have a slight impact on backup performance. Housekeeping is an important process in order to [...]

  • Page 10

    10 The following graph illustrates only the relationship between the number of active data streams and performance. It is not based on real data. Data compressio n and encryption b ackup appl ication featu res Both software compression and encryption will randomize the source data and will, therefore, not result in a high deduplication ratio for th[...]

  • Page 11

    11 Network configu ration Al l D2D appliances have two 1GBit Ethernet ports, the D2D4312 and D2D4324 appliances also have two 10GBit Ethernet ports. The Ethernet ports are used for data transfer to iSCSI VTL devices and CIFS/NFS shares and also for management access to the Web Management In terface. In order to deliver best performance when backing[...]

  • Page 12

    12 Single Port mod e 1Gbit ports: use this mode only if no other ports are available on the switch network or if the appliance is being used to transfer data over fibre channel ports only. On an HP D2D4312 or D2D4324 with 10Gbit ports it is possible that a single 10Gbit port will deliver good performance in most environments. Network configuration,[...]

  • Page 13

    13 Dual Port mode Use this mode if: Servers to be backed up are split across two physical networks which need independent access to the D2D appliance. In this case virtual libraries and shares will be a vailable on b oth network ports; the host configuration defines which port is used. Separate data (“Network SAN”) and management LANs are being[...]

  • Page 14

    14 High availability p ort mode (Port f a ilover) In this mode, no special switch configuration is required other than to ensure that both Ethernet ports in the pair from the D2D appliance are connected to the same switch. This mode sets up “bonded” network ports, where both network ports are connected to the same physical switch and behave as [...]

  • Page 15

    15 10Gbit Ethernet ports on the 4312/43 24 appliances 10Gbit Ethernet is provided as a viable alternative to the Fibre Channel interface for providing maximum VTL performance and also comparable NAS performance. When using 10Gbit Ethernet it is common to configure a “Network SAN” , which is a dedicated network for backup that is separate to the[...]

  • Page 16

    16 Broadly there are two possible configurations which allow both: Access to the Active Directory server for AD authentication and Separation of Corporate LAN and Network SAN traffic Option 1: HP D2D Backup System on Corporate SAN and Network SAN In this option, the D2D device has a port in the Corporate SAN which has access to the Active Directory[...]

  • Page 17

    17 Option 2: HP D2D Backup System on Network SAN only with Gateway In this option the D2D has connections only to the Network SAN, but there is a network router or Gateway server providing access to the Active Directory domain controller on the Corporate LA N. In order to ensure two -way communication between the Network SAN and Corp orate LA N the[...]

  • Page 18

    18 Backup server networking It is important to consider the whole network when considering backu p performance, any server acting as a backup server should be configured where possible with multiple network ports that are teamed / bonded in order to provide a fast connection to the LAN. Client servers (those that back up via a backup server) may be[...]

  • Page 19

    19 Fibre Channel config uration Fibre Channel topol ogies The D2D appliances support both switched fabric and direct attach (private loop) topologies. Direct Attach (point to point) topology is not supported. Switched fabric using NPIV (N Port ID Virtualisation) offers a number of ad vantages and is the preferred topology for D2D appliances. Switch[...]

  • Page 20

    20 Direct Attach (Priv ate Loop) A direct attach (private loop) topology is implemented by connecting the D2D appliance ports directly to a Host Bus Adapter (HBA). In this configuration the Fibre Channel private loop protocol must be used. Fibre Channel, direct attach (private loop) topology Virt ual Lib rary 1 (e.g . D2D G eneric ) Virt ual Lib ra[...]

  • Page 21

    21 Zoning may not always be required for configurations that are alrea dy small or simple. Typically the larger the SAN, the more zoning is needed. Use the following guidelines to determine how and when to use zoning. Small fabric (16 ports or less) — may not need zoning. Small to medium fabric (16 - 128 ports) — use host-centric zoning. Host-c[...]

  • Page 22

    22 Another page on the Configuration – Fibre Channel page of the Web Management interface shows the status for all the FC devices that are configured on the D2D appliance. It lists the connection state, port ID, Port type and number of logins for each virtual library and drive connection. This page is mainly for in f ormation and is useful in tro[...]

  • Page 23

    23 A mixture of iSCSI and FC port virtual libraries and NAS shares can be configured on the same D2D appliance to balance performance needs. Sizing solutions The following diagram provides a simple sizing guide for the HP D2D Generation 2 product family for backups and backups and replication. HP D2D Backup System Gen 2 sizing guide Daily backup: t[...]

  • Page 24

    24 The use of thi s tool enables accurate capacity sizing , retention period decisions and replica tion link sizing and performance for the most complex D2D environments. A fully worked example using the Sizing Tool and best practices is contained later in the document, see Appendix B.[...]

  • Page 25

    25 VTL best practic es Summary of best practices Tape drive emulation types have no effect on performance or functionality. Configuring multiple tape drives per library enables multi-streaming operation per library for good aggregate performance. Do not exceed the recommended maximum concurrent backup streams per libra ry and appliance if maximum p[...]

  • Page 26

    26 cartridges that can be configured per library has also increased compared to G1 products. The table below lists the key parameters for both G1 and G2 products. To achieve best performance the recommended maximum concurrent backu p streams per library and appliance in the table should be followed. As an example, while it is possible to configure [...]

  • Page 27

    27 A similar limitation exists for Fibre Channel. Although there is a theoretical limit of 255 devices per FC port on a host or switch, the actual limit appears to be 128 for many switches and HBAs. You should either balance drives across FC ports or configure less than 128 drives per library . Some backup applications will deliver less than optimu[...]

  • Page 28

    28 Some minor setting changes to upstream infrastructure might be required to allow backups with greater than 256 KB block size to be performed. For example, Microsoft’s iSCSI initiator implementation, by defau lt, does not allow block sizes that are greater than 256 KB. To use a block size greater than this you need to modify the following regis[...]

  • Page 29

    29 Overwrite versus append of media Overwriting and appending to cartridges is also a concept where virtual tape has a benefit. With physical media it is often sensible to append multiple backup jobs to a single cartridge in order to redu ce media costs; the downside of this is that cartridges cannot be overwritten unt il the retention policy for t[...]

  • Page 30

    30 D2D NAS best pra ctices Introduction to D2D N AS backup tar ge ts The HP StorageWorks D2D Backup System now supports the ability to create a NAS (CIFS or NFS) share to be used as a target for backup applications. The NAS shares provide data deduplication in order to make efficient use of the physical disk capacity when performing backup workload[...]

  • Page 31

    31 Shares and dedu plication stor es Each NAS shar e created on the D2D system has its own deduplication “store”; any data backed up to a share will be deduplicated against all of the other data in that store, there is no option to create non-deduplicating NAS shares and there is no deduplication between differen t shares on the same D2D. Once [...]

  • Page 32

    32 The number of concurrently open files in the table above do not guarantee that the D2D will perform optimally with this number of concurrent backups, nor do they take into account the fact that host systems may report a file as having been closed before the actual close takes place, this means that the limits provided in the table could be excee[...]

  • Page 33

    33 If a write- in -place operation does occur, the D2D will create a new backup item that is not deduplicated, a pointer to this new item is then created so that when the file is read the new write - in -place item will be accessed instead of the original data within the backup file. Backup size of data file with write- in -place item If a backup a[...]

  • Page 34

    34 Backup job time, assuming no housekeeping or replication windows are set Disk space pre-allocation Some backup applications allow the user to choose whether to “pre - al locate” the disk space for each file at creation time, i.e. as soon as a backup file is created an empty file is created of the maximum size that the backup file can reach. [...]

  • Page 35

    35 below ensures that multiple backups or streams can run concurrently whilst remaining within the concurrent file limits for each D2D share. Multiple servers, single stream backup Multiple servers, multi-stream backup Multiple servers, multiple single-stream backups The table below shows the recommended maximum number of backup strea ms or jobs pe[...]

  • Page 36

    36 If backing up using application agents (e.g. Exchange, SQL, Oracle) it is recommended that only one backup per share is run concurrently because these application agents frequently open more concurrent files than standard file type backups. D2D2502 G1 D2D2503 G1 D2D2504 G1 D2D400X G1 D2D4112 G1 Suggested maximum concurrent operations per share 4[...]

  • Page 37

    37 Verify By default most backup applications will perform a verify pass on each back job, in which they read the backup data from the D2D and check against the original data. Due to the nature of deduplication the process of reading data is slower than writing as data needs to be re- hydrated. Thus running a verify will more than double the overal[...]

  • Page 38

    38 operations. Housekeeping remains an important part of the data deduplication solution and enough time must be allowed for it to complete in order to make best use of available storage capacity. See Housekeeping monitoring and control on page 71 for more information on best practices for applying blackout windows and how to monitor the housekeepi[...]

  • Page 39

    39 A local user with the same username and password must be created on the media server that will be using the D2D CIFS share. The backup application services must be configured to run as the local user (how this is configu red varies by backup application). The best practice when using User authentication is to create a “ backup ” user account[...]

  • Page 40

    40 1. Create a new Host(A) record in the forward lookup zone for the domain to which the D2D belongs with the hostname and IP address of the D2D. Click Add Host . 2. Also create a Pointer(PTR) in the reverse lookup zone for the domain for the D2D appliance by providing hostname and IP address. Click OK .[...]

  • Page 41

    41 Now that the D2D is a member of the domain its shares can be managed from any computer on the domain by configuring a customized Microsoft Management Console (MMC) with the Shared Fold ers snap -in. Once you have created shares you can manage them as follows. 1. Open a new MMC window by typing mmc at the command prompt or from the start search b[...]

  • Page 42

    42 3. Now click Add > In the dialog box choose the computer to be managed and select Shares from th e View options. 4. Finally select Finish and OK to complete the snap-in set up. Note that the Folder Path field contains an internal path on the D2D Backup System. 5. Save this customized snap-in for future use.[...]

  • Page 43

    43 6. Double click a share name in the right-h and pane and select the Permissions tab. Add a user or group of users from the domain. Specify the level of permission that the users will receive and click Apply . Leaving an AD domain The user may wish to leave an AD domain in order to: Temporarily leave then rejoin the same domain Join a different A[...]

  • Page 44

    44 VTL and NAS – Data source perf ormance bottlen ec k identification In a lot of cases backup and restore performance using the HP D2D Backup System is limited by factors outside of the appliance itself. For example the speed at which data can be transferred to and from the source disk system (the system being backed up), or the performance of t[...]

  • Page 45

    45 The activity graph below shows the start of a Virtual Tape Write and the current throughput being achieved. The activity graph below shows the end of a Virtual Tape Write and the start of a Virtual Tape Read and the throughput achieved.[...]

  • Page 46

    46 The activity graph below shows the end of a Virtual Tape Read. How to use the D2D storage and dedup l ication ratio re p orting metrics D2D appliances with software at version 1.0.0 and 2.0.0 and later provide more detailed storage reporting and deduplication ratio metrics on the Web Management Interface. These will indicate the storage and dedu[...]

  • Page 47

    47 This example is from the Storage Reporting GUI page and shows the Disk Storage Capacity Growth for b oth User Data and Physical Data for the current week for the whole appliance as more b ackups have been sent to it during the week. This chart can also display this information for a month period. Deduplication Ratio and Daily and Weekly Change r[...]

  • Page 48

    48 D2D Replication The HP StorageWorks D2D products provide deduplication-enabled, low bandwidth replication for both VTL and NAS devices. Replication enables data on a “replication source” D2D to be rep licated to a “replication target” D2D system. Replication provides a point- in - time “mirror” of the data on the source D2D at a targ[...]

  • Page 49

    49 Replication us age models There are four main usage models for replication using D2D devices. Active/Passive – A D2D system at an alternate site is dedicated solely as a target for replication from a D2D at a primary location. Active/Active – Both D2D systems are backing up local data as well as receiving replicated data from each other. Man[...]

  • Page 50

    50 Many to One conf iguration[...]

  • Page 51

    51 N-way configuration In most cases D2D VTL and D2D NAS replication is the same, the only significant configuration difference being that VTL replication allows multiple source libraries to replicate into a single target library, NA S mappings however are 1:1, one replication target share may only receive data from a single replication source shar[...]

  • Page 52

    52 Replication overview What to replicat e D2D VTL replication allows for a subset of the cartridges within a library to be mapped for replication rather than the entire library (NAS replication does not allow this). Some retention policies may not require that all backu ps are replicated, for example daily incremental backups may not need to go of[...]

  • Page 53

    53 Appliance, library a nd share replication fan in/out Each D2D model has a different level of support for the number of other D2D app liances that can be involved in replication mappings with it, and also the number of libraries that may replicate into a single library on the device as follows: Max Appliance Fan out The maximum number of target a[...]

  • Page 54

    54 Concurrent re plication jobs Each D2D model has a different maximum number of concurrently running replication jobs when it is acting as a source or target for replication. The table below shows these values. When many items are available for replication, this is the number of jobs that will be running at any one time. As soon as one item has fi[...]

  • Page 55

    55 Amount of data in each backup Data change per backup (deduplication ratio) Number of D2D systems replicating Number of concurrent replication jobs from each source Number of concurrent replication jobs to each target As a general rule of thumb, however, a minimum bandwidth of 2 Mb/s per replication job should be allowed. For example, if a replic[...]

  • Page 56

    56 Summary of possible seeding methods and likely usage models Technique Best for Concerns Comments Seed over the WAN link Active -- Passive a nd Many to 1 replication model s with: Initial Small Volu mes of Bac kup data OR Gradual migr ation of larger backup volumes/jobs t o D2D over time As an example , the first 500 GB full backup over a 5M bit [...]

  • Page 57

    57 Seeding metho ds in more detai l Seeding over a WAN link With this seeding method the final replication set-up (mappings) can be established immediately. Active/Passive Active/Active WAN seeding over the first backup is, in fact, the first wholesale replication. WAN seeding after the first backup at each location is, in fact, the first wholesale[...]

  • Page 58

    58 Many to One WAN seeding over the first backup is, in fact, the first wholesale replication from the many remote sites to the Target site. Care must be taken not to run too many replications simultaneously or the Target site m ay become overloaded. Stagger the seeding process from each remote site.[...]

  • Page 59

    59 Co -location (seed over LAN) With this seeding method it is important to define the replication set-up (mappings) in advance so that in say the Many to One example the correct mapping is established at each site the target D2D visits before the target D2D is finally shipped to the Data Center Site and the replication “re - established” for t[...]

  • Page 60

    60 Many to One Co -location seeding at S ource (remote) site s; transport target D 2D betwee n remote sites. 1. Initial backup at each remote site 2. Replication to Target D2D over GbE at each remote site. 3. Move Target D2D between remote sites and repeat replication. 4. Finally take Target D2D to Data Center site. 5. Re -establish replication.[...]

  • Page 61

    61 Floating D2D method of seeding Many to Once Seeding with Floating D2D ta rget – for larg e fan-in scenarios Co -location seeding at Source (remote) sites. Transport floating target D2D between remote sites then perform replication at the Data Center site. Repeat as necessary. 1. Initial backup at each remote site. 2. Replication to floating Ta[...]

  • Page 62

    62 2. At each remote site perform a full system backup to the source D2D and then configure a 1:1 mapping relationshi p with the flo ating D2D device” e. g . SVTL1 on Remote Site A -  FTVTL1 on floating D2D. FTVTL1 = f loating target VTL1. 3. Seeding remote site A to the flo ating D2D will take plac e over the GbE link and may take se veral ho[...]

  • Page 63

    63 Seeding using physical tape or portable disk drive and ISV copy u tilities Many- to -one seeding u sing Physical Ta pe or portable di sk drives Physical tape-based or portable disk drive seeding 1. Initial backup to D2D. 2. Copy to tape(s) or a disk using backup application software on Media Server for NAS devices; only use simple drag and drop [...]

  • Page 64

    64 2. Use the backup application software to perform a full media copy of the contents of the D2D to a physical tape or removable disk target for backup also attached to the media server. In the case of removable USB disk drives the capacity is probably limited to 2 TB, in the case of physic al LTO5 media it is limited to about 3 TB per tape, but o[...]

  • Page 65

    65 Replication an d other D2D operat ions In order to either optimize the performance of replication or minimize the impact of replication on other D2D operations it is important to consider the complete workload being placed on the D2D. By default replication will start quickly after a backup completes; this win dow of time immediately after a bac[...]

  • Page 66

    66 A bandwidth limit calculator is supplied to assist with defining suitab le limits. Source Appliance Permissions It is a good practice to use the Source Appliance Permissions f unctionality provided on the Replication - Partner Appliances tab to prevent malicious or accidental configuration of replication mappings from unknown or unauthorized sou[...]

  • Page 67

    67 Replication Mon itoring It It The aim of replication is to ensure that data is “moved” offsite as quickly as possible after a backup job completes. The “maximum time to offsite” varies depending on business requirements. The D2D Backup System provides tools to help monitor replication performance and alert sy stem administrators if requi[...]

  • Page 68

    68 Replication Throughput totals Whilst replication jobs are running the Status - Source/Target Active Jobs pages show some detailed performance information averaged over several minutes. The following information is provided: Source / Target jobs running: The number of replication jobs that this appliance is running concurrently. Transmit / Receiv[...]

  • Page 69

    69 Replication share/library details Replication share/library details show the synchronization status, throughput and d i sk usag e for each replicated device. This allows the system administrator to identify the performance of each share individ ually to see the bandwidth utilization of each share and sync status. Replication File/Cartridge detai[...]

  • Page 70

    70 This is very useful to identify: Differences in bandwidth saving and therefore deduplication ratio for an individual cartridg e or file. These can be directly correlated to backup jobs and allow the backup administrator to see the deduplication efficiency of specific data backups. Individual files or cartridges that are not being replicated . T [...]

  • Page 71

    71 Housekeeping monito ring and control Terminology Housekeeping: If data is deleted from the D2D system (e.g. a virtual cartridge is overwritten or erased), any unused chunks will be marked for removal, so space can be freed up (space reclamation). The process of removing chunks of data is not an inline operation because this would significantly i[...]

  • Page 72

    72 By setting a housekeeping blackout window appropriately from 12:00 to 00:00 we can ensure the backups and replication run at maximum speed as can be seen below. The housekeeping is scheduled to run when the device is idle. However some tuning is required to determine how long to set the housekeeping windows and to d o this we must use the D2D We[...]

  • Page 73

    73 Overall section This section shows the combined information from both the Libraries and Shares sections. The key features within this section are: Housekeeping Statistics: Status has three options: OK if housekeeping has been idle within the last 24 hours, Warning if housekeeping has been processing nonstop for the last 24 hours, Caution if hous[...]

  • Page 74

    74 c) Restructure the backup regime to remove appends – as the bigger the tapes, files are allowed to grow (through appends,) the more housekeeping they generate . d) Increase the time allowed for housekeeping to run if housekeeping blackout windows are set. e) Re -schedule backu p jo bs to try and ensure all backup jobs complete at the same tim [...]

  • Page 75

    75 Tape Offload Terminology Direct Tape Offload This is when a physical tape library d evice is connec ted directly to the rear of the D2D Generation 1 products (D2D2503, 4004, 4009, 2502A, 2504A, 4112A - which are now obsolete) using a SAS host bus adapter. The D2D device itself manages the transfer of data from the D2D to physical tape AND the tr[...]

  • Page 76

    76 Tape Offload/Cop y fro m D2D vers us Mir rored Backup from Data Sou rce A summary of the supported methods is shown below. For easiest integration For optimum performance Backup application copy to tape Separate physical tape mirrored backup The backup ap plication contr ols the copy from the D2D appliance to th e network-attached t ap e drive s[...]

  • Page 77

    77 Note: Target Offload can vary from one backup application to another in terms of import functionality. Please check with your vendor. Backup application tape offload at D2D source site 1. Copy D2D to physical tape; this uses the backup Copy job to copy data from D2D to physical tape and is easy to automate and schedule, it has a slower copy perf[...]

  • Page 78

    78 Key performance f actors in Ta pe Offload performance Note in the diagram below how the read performance from a D2D4312 (red line) incr eases with the number of read streams – just like with backup. If the D2D4312 reads with a single stream (to physical tape) the copy rate is about 370 GB/hour. However, if the copy jobs are configured to use m[...]

  • Page 79

    79 2. For “Media Copies” it is always best to try and match the D2D VTL cartridge size with the physical media cartridge size to avoid wastage. For example: if using physical LTO4 drives (800 GB tapes) then when configuring D2D Virtual Tape Libraries the D2D cartridge size should also be configured to 800 GB. 3. Schedule time for Tape Offloads:[...]

  • Page 80

    80 Appendix A Key reference inform ation[...]

  • Page 81

    81 D2D Generation 2 products, software 2.1.00 D2D Gen2 Pr oducts, Software 2.1 .00 D2D2502 i D2D2504 i D2D4106 i/fc D2D4112 fc D2D4312 fc D2D4324 fc Devices Usable Disk Ca p acity (TB) 1.5 3 9 18 36 72 Max Number Dev i ces (VTL/NAS ) 4 8 16 24 50 50 Replication Max VTL Library Rep Fan Out 1 1 1 1 1 1 Max VTL Library Rep Fan In 1 1 8 8 16 16 Max App[...]

  • Page 82

    82 D2D Generation 1 products, software 1.1.00 D2D Generation 1 P roducts, Software 1.1.00 D2D2502i D2D2503i D2D2504i D2D4004i/fc D2D4009i/fc D2D4112fc Devices Usable Disk Ca p acity (TB) 1.5 2.25 3 7.5 7.5 18 Max Number Dev i ces (VTL/NAS ) 4 6 8 16 16 24 Replication Max Library Re p Fan Out 1 1 1 1 1 1 Max Library Re p Fan In 1 1 1 4 4 4 Max Appli[...]

  • Page 83

    83 Appendix B – Full y Worked Example In this section we will work through a complete multi-site, multi-region D2D design, configuration and deployment tuning. The following steps will be undertaken: Hardware and site configuration definition Backup requirements specification Using HP StorageWorks Backup sizing tool – size the hardware requirem[...]

  • Page 84

    84 Backup requirem ents specificatio n Remote Sites A/D NAS emulations required Server 1 – Filesystem 1, 100 GB, spread across 3 mount points Server 2 – SQL data, 100GB Server 3 – Filesystem 2, 100GB, spread across 2 mount points Server 4 – - Special App Data , 100GB Rotation Scheme – Weekly Fulls, 10% Incrementals during week, Keep 4 wee[...]

  • Page 85

    85 Using the HP Stora geWorks B a ckup sizing to ol Configure replication environment Click on Backup calculators and then Design D2D/VLS replication over WAN to get started. 1. Configure the replication environment for 4 source appliances to 1 target appliance, commonly known as Many to One replication. The replication window allowed is 12 hours; [...]

  • Page 86

    86 IT IS VERY IMPORTANT that when you are creating the backup specifications in the Sizer tool you pay particular attention to the Field “Number of parallel Backup Streams ”. This field determines the backup throughput and the amount of concurrent replications that are possible BUT it may require a conscious change by the customer in backup pol[...]

  • Page 87

    87 In the case of sites A and D, when we enter all the backup jobs, we will have seven backup jobs running in parallel which will give us best throughput and backup p erformance. 3. For each Job you can view the rotation scheme and predicted deduplication ratios by displayin g the Output from within the Dedupe tab. Site A Filesystem 1 uses an Incre[...]

  • Page 88

    88 4. As you specify each job in turn click Add job and the job will be loaded to the summary table (see below).[...]

  • Page 89

    89 5. Add all backup jobs for Sites A and D Please note in line with customer request, at sites A and D the D2D emulation has been selected as NAS Emulation with CIFS shares.[...]

  • Page 90

    90 6. Repeat for Sites B and C.[...]

  • Page 91

    91 7. Input backup job entries for Site E, which requires full backups every day for 29 days and is also required to have FC attach, so click FC in the System interface area. The rotation scheme for Site E is Fulls & Fulls.[...]

  • Page 92

    92 We will retain 29 days of Fulls.[...]

  • Page 93

    93 8. Press the Solve/Submit button and the Sizer will do the rest.[...]

  • Page 94

    94 Sizer output The Sizer creates two outputs. It creates an excel spreadsheet with all the parts requ ired for the solution including Service and Support, and an y licenses required together with the List Pricing. It creates a sol ution overview (see below) which indic ates the types of de vices t o be u sed at sourc e and target, the am ount of d[...]

  • Page 95

    95 The Sizer has also establi shed that each sourc e needs a 4.6 or 4.47 Mbit/sec link for th e sources, whilst the target needs a link size of ju st over 9 Mbit/sec. The Sizer shows how muc h data is replicated worst case from the sou rces to the tar get under Total TX GB The output c ontinues and shows a summary f or each site of the backup conc [...]

  • Page 96

    96 Refining the configuration In this worked example it is crucial that we have as many jobs replicating to the target simultaneously as possible. 1. Use a feature in the Sizer to force the target device to be the next model upwards – an HP D2D4106 which has an increased replication concurrency when used as a target of 24. It should also be noted[...]

  • Page 97

    97 2. Click Solve/Submit again. 3. A new parts list is generated with HP D2D4106 as the target, along with an HP D2D4106 replication license for the target.[...]

  • Page 98

    98 Note how because the replication is now more efficient we only need just over 2 Mbit/sec WAN links on each of the sources. Configure D2D so urce device s and repli ca tion target c on figuration Sites A and D The customer has already told us he wants NAS emulation at sites A and D. Server 1 – Filesystem data 1, 100GB, spread across 3 mount poi[...]

  • Page 99

    99 On sites A and D the D2D units would be configured with four NAS shares (one for each server), the filesystem servers would be configured with subdirectories for each of the mount points. These subdirectories can be created on the D2D NAS CIFS share by using Windows Explorer (e.g. Dir1, Dir2, Dir3) and the backup jobs can be configured separatel[...]

  • Page 100

    100 The final total source and target configuration is shown below. Example NAS and VTL configurations Map out the int eraction of ba ckup, housekeeping a nd replication for sources and targ et With HP D2D Backup Systems it is important to understand that the device cannot do everything at once, it is best to think of “windows” of activity. Ide[...]

  • Page 101

    101 Overlapping backups to minimize housekeeping interference Source 1 – Bad scheduling Source 2 – Good scheduling As backup DIR 1 finishes it triggers Housekeeping, which then impactsthe performance of the backup on DIR 2 If backup jobs can be scheduled to cpmplete at the same time, the impact of Housekeepong on backup performance will be geat[...]

  • Page 102

    102 Tune the soluti on using replicati on windows an d housekee ping windows The objective of this section is to allow the solution architect to design for predictable behavior and performance. Predictable configurations may not always give the fastest time to complete but in the long run they will prevent unexpected performance degradation due to [...]

  • Page 103

    103 Worked example – backup, replication and h ousekeeping overlaps Because we have sized D2D2502 for the sources - there is a limit of 4 concurrent source replication jobs at any one time. This simulation is valid for Code Versions 2.1 and above which use container matching technology a nd improved housekeeping performance. Backup Replication Ho[...]

  • Page 104

    104 Initial configurati on with replication blackout window set There is improvement in some backup job performance e.g. Share 1 DIR2 & Share 2 SQL data, but replication jobs can only run 4 at a time (2502 concurrent source replication limit). Using V2.1 code and higher Housekeeping can now be scheduled to be run for 2 p eriods every 24 hours A[...]

  • Page 105

    105 Target initial c onfiguration Some effort is required to map all the activity at the target, but it is clear that, between 20:00 and 02:00, the target has a very heavy load because local backups, replication jobs from sites A and D and housekeeping associated with replication jobs from sites B and C are all running at the same time. Target impr[...]

  • Page 106

    106 Offload to Tape requirement In this example the customer wants to know: “What is the best practice to make monthly copies to physical tape from Site E?” One fundamental issue associated with the deduplication process used on D2D is that the data is “chunked” into nominal 4K pieces before it is stored on disk. When data is read from D2D [...]

  • Page 107

    107 Avoiding bad practices The worked example describes the best practices. Typical bad practices are:- Bad Practice Results Not using the Sizing tool Incorrect models chosen because of wrong throughput calculations. Replication Link sizing incorrect Insufficient backup streams configured to run in parallel Poor backup performance, poor replication[...]

  • Page 108

    108 Appendix C HP Data Protect or Tape Offload – Worked Examples HP Data Protector has an extensive range of Copy processes. Here we will look at how to offload both D2D Virtual Tape Libraries and D2D NAS shares to physical tape. Similar processes to thi s exist for all the major backup applications. A note on termin ology Media Copy – this is [...]

  • Page 109

    109 In this example the following storage devices are configured on HP Data Protector Cell Manager “zen”: HP D2DMSL: is a Virtual Tape emulation on the D2D Backup System with 24 virtual slots (with virtual barcodes) and 1 virtual LTO5 drive. A physical MSL Tape Library configured with 2 x LTO5 drives and 24 slots with only two pieces of LTO5 me[...]

  • Page 110

    110 HP Data Protector has a context window for controlling Object operations as can be seen below. To perform a simp l e media copy 1. Right click on the media in the D2D Library in slot 1 a nd in the right -hand navigation pane select the target for the copy to be the physical library slot 1. Full media copy e.g. 50GB on D2D virtual media copied t[...]

  • Page 111

    111 2. Select the default parameters for the copy. It is important for base media copies that both the primary copy and the secondary copy media are of the same format in terms of block size, etc, as many backup applications can n ot reformat “on the fly”. 3. The media copy is shown as successful. To perform an inter acti ve object copy, VTL 1.[...]

  • Page 112

    112 2. Click Next and, depending on what backup objects have been selec ted, HP Data Protector will check that all the necessary source devices (that wrote the backups) are available. Click Next . 3. Select the target (copy to) device and the drive to be used and click Next. H ere we have chosen LTO5 drive 2 on the physical MSL Tape Library. 4. You[...]

  • Page 113

    113 5. Select one or more media depending on the objects that are to be copied. Select Next to display the Summary screen and click Finish to start the object copy. 6. These screens show the Object copy in progress from the D2D Backup System to the physical LTO-5 media. 7. This Object copy can also be scheduled using the scheduled sec tion under ob[...]

  • Page 114

    114 To perform an inter acti ve object copy, D2D NAS share 1. Select Objects in the left-hand navigation pane and locate the D1D NAS share. 2. Click Next . Note below the Source device is now a D2D NAS share or in Data Protector terminolo gy a File Library. 3. Select an LTO5 drive in the HP MSL G3 Library to create the copy. This object was backed [...]

  • Page 115

    115 4. This shows the full path of the HP Data Protector File library and the file that represents the backup. 5. In this case the File Library was in 64K block format and needed to be re-packaged because the LTO5 block size was set to 256K as can be seen in the section underlined in red below. The copy was successful. Note: Consolidation on the le[...]

  • Page 116

    116 Appendix D Making use of impro ved D2D perf ormance in 2. 1.01 and 1.1.01 software Overview HP StoreOnce D2D software released in February 2011 includes significant performance stabilization updates that reduce the disk access overhead of the deduplication process and therefore improve overall system performance. However this performance improv[...]

  • Page 117

    117 Replication for Virtu a l Device Migr ation This method involves using two D2D Backup Sy stems and has the benefi t that it does not require additional disk space to be available on the existing D2D Backup System to work. HP ProLiant DL320s UID 1 2 Original File Share Original VTL D2D A New VTL New File Share HP ProLiant DL320s UID 1 2 Replicat[...]

  • Page 118

    118 Step 3 – Recover Data to new VTL /Share 1. Run the replication recovery wizard on the original D2D appliance, this will reverse replicate the data from the replication target device back to the new source device. 2. Wait for replication to synchronize the devices. 3. Remove the replication mapping from either the source or target D2D Web Mana[...]

  • Page 119

    119 Self Replication for Virtual Device Migration Self replication is the process of replicating data between two devices on the same D2D Backup System. This model requires that there is sufficient disk space on the D2D Backup System to hold two copies of the data being migrated but, with 2.1.00 and 1.1.00 software, a replication license is not req[...]

  • Page 120

    120 Step 1 – Self replicate data for migration 1. Create a new VTL or Share on the D2D Backup System; this will be the new location for the migrated data. It is not possible to use the same Share or Library names as the original or use the same WWN/Serial numbers for VTL devices. 2. Add a new replication target device by providing the own IP addr[...]

  • Page 121

    121 Replication dev ice self replication mi gration HP ProLiant DL320s UID 1 2 Original File Share Original VTL D2D A New VTL New File Share HP ProLiant DL320s UID 1 2 Replicated File Share Replicated VTL D2D B 1 2 3 Existing Replication Mapping Self Replication 1 Step 1 – Break Existing Replication Mapping 2 Step 2 – Replicate to new VTL/Share[...]

  • Page 122

    122 5. Remove the replication mappings on both D2D Backup Systems. 6. Remove the appliance addresses from the list of replication target appliances on both D2D Backup Systems. 7. “Connect” the backup media server to the new source device. For example: – Mount the NFS share – Discover the iSCSI VTL device and connect – Zone the FC SAN so t[...]

  • Page 123

    123 Configuring S elf Migration from th e D2D Web Managem ent Interface The HP StoreOnce Backup System user guide provides step by step instructions on how to configure replication mappings on the Web Management Interface. However, there are some differences when configuring Self Replication. This chapter provides a simple step by step guide to mig[...]

  • Page 124

    124 2. The new share has now been created and after a few seconds is online. At this point there is no user data stored in that share. 3. The next step is to begin configuring replication to migrate the data . Select A dd Target Appliance from the Replication – Partner Appliances – Target Appliances page on the Web Management Interface. 4. Ente[...]

  • Page 125

    125 5. Upon successful completion the local appliance will be added to the Target App liances list. 6. Go to the Replication – NAS Mappings page, select the share to be replicated (i.e. the original share with backup data in it) and click Start Replication Wizard .[...]

  • Page 126

    126 7. There are two main steps in the Wizard, the first is to select the target appliance from a list. This list will only contain the information about the local D2D appliance and will be highlighted alread y. Click Next . 8. Select the Target Share (this is the new share that was created earlier in the process and is the target for replication) [...]

  • Page 127

    127 9. After completing the wizard, replication will begin synchronizing the data between the two shares. Synchronization will take some time to complete because all data must be replicated to the new device. Once complete, the status will change to Synchronized which means that the same data is p resent in both shares. (Note that the size on disk [...]

  • Page 128

    128 11. Now reconfigure the backup application to use the new D2D share as a backup target device. For example the backup application will need to retarget ba ckups to the new share. This should be done prior to deleting the original share to ensure the migration has been successful and that the backup application can a ccess the new share. 12. Onc[...]

  • Page 129

    129 Index 10Gbit Ethernet, 15 A Active Directory, 15 active-to-active replication, 50 active-to-passive replication, 50 activity graph, 44 AD authentication, 39 AD domain joining, 39 leaving, 43 problems joining, 39 authentication, 38 B backup application and NAS, 32 backup application considerations, 27 backup job recommendations, 29 bandwidth lim[...]

  • Page 130

    130 O open files, 31 out of sync notifications, 68 P performance activity graph, 44 deduplication, 7 libraries per appliance, 27 maximum concurrent backup jobs, 26 maximum NAS operations, 36 metrics on GUI, 44 multi-streaming, 9 NAS, 31, 34 network, 22 replication, 7 reporting metrics on GUI, 46 performance tools, 44 Permissions domain, 43 physical[...]

  • Page 131

    For more information To read more about the HP D2D Backup System, go to www.hp.com/g o/D2D Share with colleagues © Copyright 201 1- 2012 Hewlett -Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warr anties for HP products and services are set forth in the express warranty statements [...]