Next Patent: Data storage
Next Patent: Data storage
[0001] The present invention is directed to a disk based backup storage system that can be seamlessly integrated with a tape backup system or the like and, more specifically, to a method of importing data into a virtual tape library.
[0002] Backing up computer data, restoring computer data, securing computer data and managing computer data storage (collectively referred to as data protection) requires complex and disparate technical and operational solutions. Data protection is the single most expensive storage administrative task.
[0003] One data protection strategy is to use a redundant array of independent disks (RAID) and disk mirroring technology to protect data. Unfortunately, disk mirroring only prevents data loss in the event of a hardware or power failure. Mirroring does not protect data from human error, such as the accidental deletion of portions of a document. On a disk mirrored system, once data has been deleted from the primary disk, the data is automatically deleted from the mirrored disk and is not retrievable.
[0004] To address the problem of human error and computer viral damage, backup systems have been designed that are file-based and track files for many generations. One typical form of data protection backup uses physical tapes to store data in tape libraries. Physical tape backup libraries provide the ability to restore current and historical data and to recover from a variety of forms of data loss.
[0005] Referring to
[0006] A typical physical tape library
[0007] Physical tape libraries
[0008] To automate the mounting and unmounting of tapes into tape backup drives, many organizations use a robotically-controlled tape library. Actual usage of individual tape media is generally very infrequent. Backup jobs typically run at night during a period called the “backup window”. Typically, organizations use tape rotation schemes whereby the organization writes to daily tapes, weekly tapes and monthly tapes. Many of the tapes are sent off-site after being written to, and are not accessed again until either computer data must be restored or the computer data on the backup tape has expired (usually after some number of weeks, months or even years). Additionally, adding to the size of a tape library can be a complicated matter requiring the integration of additional tape libraries into the data protection application.
[0009] An essential component of a virtual tape library is its ability to work with removable data storage media. Removable data storage media is essential for offsite archival of data, for freeing up space in the virtual tape library and for interchanging data.
[0010] The importation of data into a virtual tape library from physical data storage devices is necessary for a virtual tape library to operate efficiently. Data frequently needs to be imported from removable data storage devices during the following operations: overwriting of the storage media (usually because the storage time has expired and the storage media is scheduled to be overwritten with new backup data); appending data to a preexisting storage media; reading data from the physical storage media; making the contents of the data storage media more reliably available to the data protection application (e.g., while a physical tape is not fault tolerant, a copy of a physical tape maintained in virtual tape library can be fault tolerant); conversion from legacy physical tapes to virtual tapes in preparation for generating new physical tapes using the same (refresh) or different (conversion) tape media type; optionally marking the data storage media as “obsolete” because a new virtual tape of the same name will be created; and the physical tape or a virtual tape of the same name is being marked as bad or obsolete.
[0011] Several problems exist when traditional virtual tape library's import data from a storage media. The trigger signals and workflows for various physical tapes are different. Without a virtual tape library, the normal procedure for triggering the importing of data from physical tapes is for the user to query the data protection application to find out the labels of the tape(s) to be imported. With a traditional virtual tape library, the tapes the data protection application writes to are virtual and have different labels from the physical tapes that are tracked by preexisting data protection applications. The typical import procedure must be carried out via the virtual tape library and not the data protection application. Additionally, a physical tape may contain some virtual tapes that a user does not want to import.
[0012] A second problem with traditional virtual tape libraries is that they only allow the importation of proprietary tapes. Traditional virtual tape library systems generate physical tapes that are written in the virtual tape library's proprietary format, which can only be read and interpreted by that particular virtual tape library system. The proprietary format adds an additional step in the data importing process. The data from the tape must first be loaded into the virtual tape library system before being passed to the data protection application. More importantly, the proprietary format can not be read by the data protection application unless the virtual tape library system that originally created the tape is still being used at the time the tape is needed. This can create a problem when a tape is required for the restoration of data many years after the tape was created. Thus, virtual tape libraries that use a proprietary format for creating physical tapes can only import tapes that the virtual tape library has written.
[0013] Another problem with traditional virtual tape libraries is that the import of data into a virtual tape library can be slow when the import involves the copying of all the tape's data. This process can also be wasteful if all the data is not actually required by a user (for example when the tape is imported to be overwritten).
[0014] Clearly, what is needed is a method of importing data from a physical data storage device into a virtual tape library system that preferably does not require that the physical data storage device use proprietary coding and formatting. I would also be preferable that the method incorporate multiple copy modes such as completely copying the data on the physical device, copying a portion of the data on the physical device prior to overwriting the physical device, and operating in a pass-through mode to allow a data protection application to directly read from and/or write to the physical device. Finally it would be preferable that the method is able to import data into a virtual tape library which can replace a physical tape library or act as a cache for a physical tape library for a preexisting data protection application.
[0015] One embodiment of the present invention is directed to a method of importing a plurality of data from a physical data storage device into a virtual tape library system that is used with a data protection application. The virtual tape library system is usable with an existing data protection application designed for use with a physical tape library without modifying the existing data protection application. The method includes: triggering a signal representing a command to import the plurality of data from the physical data storage device; providing a controller that receives the signal; providing a data storage medium in communication with the controller, the data storage medium having at least one virtual library defined thereon, each virtual library being capable of having a plurality of virtual tapes defined therein, the data storage medium being capable of storing the plurality of data in response to signals received by the controller; establishing communication between the data storage medium and the physical data storage device; and copying at least a portion of the plurality of data from the physical data storage device to a virtual tape.
[0016] The foregoing summary, as well as the following detailed description of preferred embodiment of the invention, will be better understood when read in conjunction with the appended drawings. For the purpose of illustrating the invention, there is shown in the drawings an embodiment which is presently preferred. It is understood, however, that the invention is not limited to the precise arrangement and instrumentality shown. In the drawings:
[0017]
[0018]
[0019]
[0020]
[0021] The term “data storage medium,” as used in the claims and in the corresponding portions of the specification, means “all non-physical tape data storage means including disks, disk subsystems, hard drives, and future developments in disk data storage and other non-volatile forms of storage.” The term “physical data storage device,” as used in the claims and in the corresponding portions of the specification, means “a media that is removable such as a disk, tape, optical disk, removable drive or the like.” The words “a” and “one,” as used in the specification and claims is specifically defined to include one or more of the referenced item unless specifically stated otherwise.
[0022] Referring to
[0023] One or more virtual libraries are defined on the data storage medium
[0024] Referring to
[0025] The method of the present invention is initiated during step S
[0026] If the import is requested by the data protection application during step S
[0027] When a user manipulates the controls to a data protection application to input into the application particular tape(s) that are to be imported (or the data protection application can initiate this automatically), the data protection application sends signals that would normally instruct a physical tape library to move the desired tapes from the entry/exit port of the physical tape library. The system controller
[0028] If the import is triggered via the user interface during step S
[0029] The user interface is preferably styled for manipulating a virtual tape backup library in a manner similar to that used to control a physical tape library. The virtual tape library provided by the system preferably performs the data storage and retrieval functions requested by the user via the user interface. Those of ordinary skill in the art will appreciate from this disclosure that the user interface can be a keypad or the like without departing from the scope of the present invention. The user interface can be displayed on a monitor attached to one of the servers that backs up data to the system
[0030] After the import process has been triggered in either step S
[0031] For example, the following copy modes (i.e., copy optimizations) can preferably be selected: complete, overwrite, or pass-through mode. The complete copy mode instructs the virtual tape library to copy substantially all of the data from the physical data storage device
[0032] The overwrite copy mode is used to speed up the overwriting process when an expired physical data storage device
[0033] The pass-through copy mode allows reads and/or writes to be directly performed between the data protection application and the physical data storage device
[0034] The copy mode can be set by policy, by default, globally or on a case by case basis. An additional option is to have the virtual tape library place the data contained in the physical data storage device
[0035] If a copy option is not set, then the appropriate selection (e.g., complete, overwrite, or pass-through) is selected during step S
[0036] If no tape library is connected, then the system
[0037] If the system
[0038] Once the importation of data is scheduled by the data protection application or by the user, the process of importing data into a virtual tape will depend upon whether there is a physical tape library (or other suitable device for reading from the physical data storage device
[0039] Whether the correct physical data storage device
[0040] The virtual barcodes allow the system
[0041] Referring to
[0042] By using virtual barcodes, the system
[0043] The virtual tapes that are presented to, selected by, managed by, or inventoried by the data protection application are preferably classified using the above mentioned “virtual barcodes.” The virtual barcodes are implemented and presented by the virtual tape and/or the virtual tape library in exactly the same manner as a physical tape library incorporates physical barcode labels. This allows the system
[0044] If the contents of the physical data storage device
[0045] Then in step S
[0046] If an incorrect tape is mounted in the tape drive, then the system
[0047] Once the correct tape is mounted in the tape drive, the system
[0048] When data from a physical data storage device
[0049] When the system
[0050] The system also preferably allows the user to specify (on a global basis, as a default, or on a tape by tape basis) what to do with the data on the physical data storage device
[0051] If during step S
[0052] After the system has copied the desired data, the system
[0053] If during step S
[0054] Then, during step S
[0055] While
[0056] The method of the present invention uses a virtual tape library system
[0057] It is further recognized by those skilled in the art, that changes may be made to the above-described embodiments of the present invention without departing from the broad inventive concept thereof. It is understood, therefore, that this invention is not limited to the particular embodiments disclosed, but is intended to cover all modifications which are within the spirit and scope of the invention as defined by the appended claims