Data Domain (corporation)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Data Domain Corporation
Subsidiary of EMC Corporation
Industry deduplication
Founded 2001
Founder Kai Li, Brian Biles
Headquarters Santa Clara, California

Data Domain Corporation was an Information Technology company from 2001-2009 specializing in target-based deduplication solutions for disk based backup.[1] Since its acquisition by EMC Corporation in 2009, Data Domain systems have been EMC's flagship protection storage platform for backup, archive, and disaster recovery.


Data Domain was founded by Kai Li, Ben Zhu, and Brian Biles.[2] Chief Architect Hugo Patterson joined 3 months after initial funding. The company incubated in a series of venture capital offices; pre-funding at USVP (where Ben was an EIR), then at NEA (where Kai was an EIR), and post-funding at Greylock (NEA and Greylock provided Series A funding in 2002). The first product revenue was in the beginning of 2004.

The goal of the company was to minimize the tape automation market with a disk-based substitute. It did this by inventing a very fast implementation of lossless data compression, optimized for streaming workloads, which compares incoming large data segments against all others stored in its multi-TB store. Originally categorized as "capacity optimization" by industry analysts, it later became more widely known as inline "data deduplication" Also, unlike most non-archival computer storage products, it went to extreme technical lengths to ensure data longevity (vs. system longevity). Its design goal was to be "the storage of last resort." Unlike most of its early competition, it was first packaged as a file-system appliance; this made it more predictable than a software product and simpler to manage than a virtual tape library system.

Former CEO Frank Slootman published a book about his experiences in 2011.[3]

Products and services[edit]

The first Data Domain system, the DD200 in 2004, had a 1.25 TB addressable capacity and was able to accept data at a rate of 40 MB/sec. Because its implementation put most of the system stress on CPU/RAM, rather than disk I/O, it was able to improve at the rate of Intel technology. EMC's current line of Data Domain systems have since significantly surpassed this capacity and speed:

DD2200 DD2500 DD4200 DD4500 DD7200 DD9500
Logical Capacity 40-860 TB 1.3-6.6 PB 1.8-9.4 PB,
3.7-18.9 PB*
2.8-14.2 PB,
5.7-28.5 PM*
4.2-21.4 PB,
8.5-42.8 PB*
8.6-43.2 PB,
17.2-86.4 PB*
Usable Capacity Up to 17.2 TB Up to 133 TB Up to 189 TB,
Up to 378 TB*
Up to 285 TB,
Up to 570 TB*
Up to 428 TB,
Up to 856 TB*
Up to 864 TB,
Up to 1.7 PB*
Speed 3.8 TB/hr 5.6 TB/hr 10.6 TB/hr 10.6 TB/hr 12.6 TB/hr 27.7 TB/hr
with DD Boost
4.7 TB/hr 13.4 TB/hr 25.6 TB/hr 25.6 TB/hr 28.3 TB/hr 58.7 TB/hr
* With DD Extended Retention[4]

Acquisition by EMC[edit]

In June 2009, EMC Corporation announced their intention to acquire Data Domain Corp for $2.4B, outbidding NetApp's previous offer.[5] In July, the two companies reached definitive agreement regarding the acquisition. Since then, Data Domain systems have been a product line brand within the EMC Core Technologies portfolio. According to IDC, EMC in 2014 captured 62.3% share of the market for purpose-built backup devices worldwide. The majority of this share was from Data Domain product revenue.[6]

Since acquiring Data Domain, EMC has integrated the platform with its Data Protection Suite software and begun offering expanded software enhancements. According to a 2013 IDC analysis, Data Domain reduced loss of user productivity from backup, restore, and retrieval operations by 81% and the researchers state that “consolidating backup and archive data on a Data Domain system can have a dramatic impact on overall costs and operations.”[7]


External links[edit]