Nearline storage (a portmanteau of "near" and "online storage") is a term used in computer science to describe an intermediate type of data storage that represents a compromise between online storage (supporting frequent, very rapid access to data) and offline storage/archiving (used for backups or long-term storage, with infrequent access to data).
Both archiving and nearline allow a reduction of database size that results in improved speed of performance for the online system. However, accessing archived data is more complex and/or slower than is the case with nearline storage, and can also negatively affect the performance of the main database, particularly when the archive data must be reloaded into that database.
Robotic nearline storage
The nearline storage system knows on which volume (cartridge) the data resides, and usually asks a robot to retrieve it from this physical location (usually: a tape library or optical jukebox) and put it into a tape drive or optical disc drive to enable access by bringing the data it contains online. This process is not instant, but it only requires a few seconds.
Nearline tape and optical storage has the advantage of relatively longer lifespans compared to spinning hard drives, simply due to the storage media being idle and usually stored in protected dust-free enclosures when not in use. In a robotic tape loading system, the tape drive used for accessing data experiences the most wear and may need occasional replacement, but the tapes themselves can last for years to decades. If there are sealable access doors between the access mechanism and the media, it is possible for the idle media storage enclosure to survive fire, floods, lightning strikes, and other disasters.
Hard drive nearline storage
MAID (massive array of idle disks) systems archive data in an array of disks. Most disks in the MAID are usually stopped. The MAID system spins up each disk on demand when necessary to read (or in some cases to write) data on that drive. For a given amount of storage capacity, MAID systems have higher densities and lower power and cooling requirements than "hot" storage systems that keep all the disks spinning at full speed at all times.
Some hard drive and storage systems vendors and suppliers use the term in reference to low-rotational speed hard drives, that are built to be more reliable than generic desktop and laptop computer hard drives, and are intended to be operational continuously for 24 hours a day, seven days a week, possibly for several years.
Nearline hard drives may be used in personal or small business network-attached storage (NAS) systems, or as non-critical moderate-performance data storage on servers, where greater durability is required for the drive to operate continuously.
By comparison, standard hard drives are assumed to only be in operation for a few hours each day, and are not spinning when the computer is either turned off or in sleep mode. Standard hard drives may also use data caching methods that can improve single-drive performance, but would interfere with the operation of multi-drive RAID storage systems, potentially causing data loss or corruption.
Specifically the term nearline hard drive is being used to refer to high-capacity Serial ATA drives that work with Serial Attached SCSI storage devices. Presumably this usage is by analogy to the high-capacity and low-access speed tape systems.
- "Nearline storage" in "A Glossary of Archival and Records Terminology". Retrieved on 2009-01-30.
- Venkatramani, Chitra and Tzi-cker Chiueh (1993). "Survey of Near-Line Storage Technologies: Devices and Systems". Experimental Computer Systems Laboratory.
- Ritchie, Arthur (2008), "Nearline and archiving in the data warehouse: what's the difference?". Database and Network Journal, October issue. Retrieved on 2012-08-28.
- Pearson, Tony (2010), "Correct use of the term Nearline." IBM Developerworks, Inside System Storage
- Seagate Technology Paper TP-543 (2005), . Retrieved on 2012-08-28.