Dell EMC ScaleIO

From Wikipedia, the free encyclopedia
  (Redirected from EMC ScaleIO)
Jump to navigation Jump to search
Dell EMC
Subsidiary
Traded as NYSE: EMC (1986–2016)[1]
Industry Computer storage
Founded 1979; 39 years ago (1979)
Founders
Headquarters Hopkinton, Massachusetts, United States
Area served
Worldwide
Key people
Jeff Clarke
(President, Infrastructure Solutions Group, Dell EMC)
Products See EMC products
Parent Dell Technologies
Website dellemc.com

Dell EMC ScaleIO is a software-defined storage product from Dell EMC that creates a server-based storage area network (SAN) from local application server storage using existing customer hardware or EMC servers. It converts direct-attached storage into shared block storage. ScaleIO is sold as software-only or preinstalled on commodity hardware ("ScaleIO Node").[2] EMC promotes its ScaleIO server storage-area network software as a way to converge computing resources and commodity storage into a "single-layer architecture."[3]

ScaleIO can scale from three compute/storage nodes to over 1,000 nodes that can drive up to 240 million IOPS of performance. Developers can deploy the ScaleIO software on-prem commodity infrastructure or in the cloud and then port their applications back into a production ScaleIO instance.[4] As of September, 2015, ScaleIO is also available from the company bundled on EMC commodity computing servers (officially called EMC ScaleIO Node).[5][6]

ScaleIO can be deployed as storage only or as a converged infrastructure combining storage, computational and networking resources into a single block. Capacity and performance of all available resources are aggregated and made available to every participating ScaleIO server and application. Storage tiers can be created with media types and drive types that match the ideal performance or capacity characteristics to best suit the application needs. It is available for free for testing (with community support[7]) or as a paid-for EMC-supported option.

History[edit]

ScaleIO was founded in 2011 by Boaz Palgi, Erez Webman, Lior Bahat, Eran Borovik, and Erez Ungar in Israel.[8] The software was designed for high performance and large systems.[9] The company was backed by venture capital firms including Greylock Partners and Norwest Venture Partners.[10] A product was announced in November 2012.[11]

EMC Corporation bought ScaleIO in June 2013 for about $200 million, only about six months after the company emerged from stealth mode.[12][8] EMC began promoting ScaleIO in 2014 and 2015, marketing it in competition with EMC’s own data storage arrays. Also in 2015, EMC introduced a model of its VCE (company) converged infrastructure hardware that supported ScaleIO storage.[13]

At its 2015 trade show, EMC announced that ScaleIO would be made freely available to developers for testing. By May, 2015, developers could download the ScaleIO software.[4] In September 2015, EMC announced the availability of the previously software-only ScaleIO pre-bundled on EMC commodity hardware, called EMC ScaleIO Node.[14]

In May, 2017, Dell EMC announced ScaleIO.Next, featuring inline compression, thin provisioning and flash-based snapshots. The new release features enhanced snapshot tools and features and full support for VMware Virtual Volumes (VVols), as well as volume migration for deployments that want to take advantage of lower cost media for low-priority data.[15]

In March, 2018, The Register reported that Dell would restrict future ScaleIO sales to installations of Dell's VxRACK FLEX engineered solution.[16]

Architecture[edit]

EMC ScaleIO uses storage and compute resources of commodity hardware. It combines HDDs, SSDs, and PCIe flash cards to create a virtual pool of block storage with varying performance tiers. It features on-demand performance and storage scalability, as well as enterprise-grade data protection, multi-tenant capabilities, and add-on enterprise features such as QoS, thin provisioning and snapshots. ScaleIO operates on multiple hardware platforms and supports physical and/or virtual application servers.[17]

ScaleIO works by installing software components on application hosts. Application hosts contribute internal disks and any other direct attached storage resources to the ScaleIO cluster by installing the SDS software. Hosts can then be presented volumes from the ScaleIO cluster by leveraging the SDC software. These components can run alongside other applications on any server (physical, virtual, or cloud) using any type of storage media (disk drives, flash drives, PCIe flash cards, or cloud storage).[18]

The ScaleIO architecture is built on two components: a data client and a data server. The ScaleIO Data Client (SDC) is a lightweight device driver situated in each host whose application or file system requires access to the ScaleIO virtual SAN block devices. The SDC exposes block devices representing the ScaleIO volumes that are currently mapped to that host. The SDCs maintain a small in-memory map, being able to maintain mapping of petabytes of data with just megabytes of RAM. The inter-node protocol used by SDCs is simpler than iSCSI and uses fewer network resources.[9]

The ScaleIO Data Server (SDS) is situated in each host and contributes local storage to the central ScaleIO virtual SAN. Each node is part of a loosely coupled cluster.[9]

Performance is expected to increase as servers and storage devices are added to the cluster.[19][18] Additional storage and compute resources (i.e., additional servers and drives) can be added modularly. Every server in the ScaleIO cluster is used in the processing of I/O operations, making all I/O and throughput accessible to any application within the cluster. Any needed rebuilds and rebalances are processed in the background. Workloads are evenly shared with a parallel I/O architecture.[20]

ScaleIO can be deployed in either a "two-layer" multi-server cluster in which the application and storage are installed in separate servers, or as "hyper-converged" option where the application and storage are installed on the same servers in the ScaleIO cluster, creating a low-footprint, low-cost scalable single-layer architecture. Capacity and performance of all available resources are aggregated and made available to every participating ScaleIO server and application. Storage tiers can be created with media types and drive types that match the ideal performance or capacity characteristics to best suit the application needs.[18]

Storage and compute resources can be added to or removed from the ScaleIO cluster as needed, with no downtime and minimal impact to application performance. The self-healing, auto-balancing capability of the ScaleIO cluster ensures that data is automatically rebuilt and rebalanced across resources when components are added, removed, or failed. Because every server and local storage device in the cluster is used in parallel to process I/O operations and protect data, system performance scales linearly as additional servers and storage devices are added to the configuration.[18]

ScaleIO software takes each data chunk to be written and spreads it across many nodes, mirroring it as well. This makes data rebuilds from disk loss very fast as several nodes contribute their own smaller, faster and parallel rebuild efforts to the whole. ScaleIO supports VMware, Hyper-V, Xen and KVM hypervisors. It also supports OpenStack, Windows, Red Hat, SLES, CentOS, and CoreOS (docker). Any app needing block storage can use it, including Oracle and other databases. While it is not as closely integrated with VMware as Virtual SAN, the SDC functionality has moved into the VMware kernel.[9]

Promotional reports were commissioned from the Enterprise Strategy Group in 2014 and 2015 including performance measurements.[20][18]

References[edit]

  1. ^ "Investor Relations: Frequently Asked Questions - EMC.com". EMC Investor Relations. EMC Corporation. Retrieved 6 September 2016.
  2. ^ "ScaleIO". SearchConvergedInfrastructure. TechTarget.
  3. ^ "EMC Bundles ScaleIO Software, Servers". EnterpriseTech. 2015-09-16.
  4. ^ a b "ESG Blog: EMC Embracing Freemium and Open Source with ScaleIo and ViPR". Enterprise Strategy Group, Inc. 2015-05-05.
  5. ^ "EMC Introduces ScaleIO Nodes". StorageReview.com. 2015-09-16.
  6. ^ "ScaleIO Node - what's the scoop, and what's up?". Virtual Geek. October 29, 2015. Archived from the original on March 30, 2016. Retrieved August 30, 2016.
  7. ^ "ScaleIO Community web site". EMC Corporation. Retrieved August 30, 2016.
  8. ^ a b Shmulik Shelach (July 11, 2013). "EMC Buys Israeli Start-Up ScaleIO For About $200 Million In Cash". Jewish Business News. Retrieved August 30, 2016.
  9. ^ a b c d "How to storm the virtual heights of SAN". The Register. 2015-06-03.
  10. ^ "ScaleIO". TechCrunch. 2015-11-04.
  11. ^ Chris Mellor (December 6, 2012). "ScaleIO uncloaks, offers glimpses of mighty SANitisation plans: Virtual storage will be a pocket miracle - if it works". The Register. Retrieved August 30, 2016.
  12. ^ "ScaleIO joins the pack of pooled storage startups with $12M". GigaOM. 2012-12-04.
  13. ^ "Server SAN 2012-2026" (PDF). Wikibon. 2015-07-15.
  14. ^ "EMC Introduces ScaleIO Nodes". StorageReview.com, Inc. 2015-09-15.
  15. ^ "Dell EMC ScaleIO.Next Announced". StorageReview.com, Inc. 2017-05-08.
  16. ^ "Want ScaleIO virtual SAN software? You'll need to buy Dell servers". The Register. 2018-03-01.
  17. ^ "EMC ScaleIO: Software-Defined, Scale-Out SAN" (PDF). EMC Corporation.
  18. ^ a b c d e "EMC ScaleIO: Proven Performance and Scalability". Enterprise Strategy Group, Inc. 2015-09-01.
  19. ^ "EMC Announces "No Restrictions" Download For ScaleIO And Future Enhancements". 2015-05-05.
  20. ^ a b "EMC ScaleIO: Transforming Commodity Hardware into Simple, Scalable, High-performance, Shared Storage". Enterprise Strategy Group. April 1, 2014. Retrieved August 30, 2016.