Ceph (software)

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 85.173.114.233 (talk) at 17:48, 7 November 2014 (typo). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Ceph
Original author(s)Inktank Storage (Sage Weil, Yehuda Sadeh Weinraub, Gregory Farnum, Josh Durgin, Samuel Just, Wido den Hollander, Loïc Dachary)
Developer(s)Red Hat[1]
Stable release
0.87 "Giant" / 29 October 2014 (2014-10-29)
Repository
Operating systemLinux
RHEL
Debian
Ubuntu
SUSE
TypeDistributed object store
LicenseLGPL 2.1[2]
Websiteceph.com

Ceph is a free software storage platform designed to present object, block, and file storage from a single distributed computer cluster. Ceph's main goals are to be completely distributed without a single point of failure, scalable to the exabyte level, and freely-available. The data is replicated, making it fault tolerant.[3]

Ceph software runs on commodity hardware. The system is designed to be both self-healing and self-managing and strives to reduce both administrator and budget overhead.

Object storage

Ceph implements distributed object storage. Ceph’s software libraries provide client applications with direct access to the reliable autonomic distributed object store (RADOS) object-based storage system, and also provide a foundation for some of Ceph’s features, including RADOS Block Device (RBD), RADOS Gateway, and the Ceph File System.

The librados software libraries provide access in C, C++, Java, Python and PHP. The RADOS Gateway also exposes the object store as a RESTful interface which can present as both native Amazon S3 and OpenStack Swift APIs.

Block storage

Ceph’s object storage system allows users to mount Ceph as a thinly provisioned block device. When an application writes data to Ceph using a block device, Ceph automatically stripes and replicates the data across the cluster. Ceph’s RADOS Block Device (RBD) also integrates with kernel virtual machines (KVMs).

Ceph RBD interfaces with the same Ceph object storage system that provides the librados interface and the CephFS file system, and it stores block device images as objects. Since RBD is built on top of librados, RBD inherits librados's capabilities, including read-only snapshots and revert to snapshot. By striping images across the cluster, Ceph improves read access performance for large block device images.

Integrations

The block device is supported in virtualization platforms, such as:

These integrations allow administrators to use Ceph's block device as the storage for their virtual machines in these environments.

File System

Ceph’s file system (CephFS) runs on top of the same object storage system that provides object storage and block device interfaces. The Ceph metadata server cluster provides a service that maps the directories and file names of the file system to objects stored within RADOS clusters. The metadata server cluster can expand or contract, and it can rebalance the file system dynamically to distribute data evenly among cluster hosts. This ensures high performance and prevents heavy loads on specific hosts within the cluster.

Clients mount the POSIX-compatible file system using a Linux kernel client. On March 19, 2010, Linus Torvalds merged the Ceph client into Linux kernel version 2.6.34[4] which was released on May 16, 2010. An older FUSE-based client is also available. The servers run as regular Unix daemons.

Underlying file system

  • Production[5]
    • ext4 (small scale)
    • XFS (enterprise deployments)

History

Ceph was initially created by Sage Weil (developer of the Webring concept and co-founder of DreamHost) for his doctoral dissertation,[6] which was advised by Professor Scott A. Brandt in the Jack Baskin School of Engineering at the University of California, Santa Cruz and funded by the DOE/NNSA involving Lawrence Livermore National Laboratory, Los Alamos National Laboratory, and Sandia National Laboratories.

After his graduation in fall 2007, Weil continued to work on Ceph full-time, and the core development team expanded to include Yehuda Sadeh Weinraub and Gregory Farnum. In 2012, Weil created Inktank Storage for professional services and support for Ceph.[7][8]

In April of 2014 Red Hat purchased Inktank bringing the majority of Ceph development in-house.[9]

Argonaut

On July 3, 2012, the Ceph development team released Argonaut, the first major "stable" release of Ceph. This release will receive stability fixes and performance updates only, and new features will be scheduled for future releases.[10]

Bobtail (v0.56)

On January 1, 2013, the Ceph development team released Bobtail, the second major stable release of Ceph. This release focused primarily on stability, performance, and upgradability from the previous Argonaut stable series (v0.48.x).[11]

Cuttlefish (v0.61)

On May 7, 2013, the Ceph development team released Cuttlefish, the third major stable release of Ceph. This release included a number of feature and performance enhancements as well as being the first stable release to feature the 'ceph-deploy' deployment tool in favor of the previous 'mkcephfs' method of deployment.[12]

Dumpling (v0.67)

On August 14, 2013, the Ceph development team released Dumpling, the fourth major stable release of Ceph. This release included a first pass at global namespace and region support, a REST API for monitoring and management functions, improved support for RHEL-based platforms.[13]

Emperor (v0.72)

On November 9, 2013, the Ceph development team released Emperor, the fifth major stable release of Ceph. This release brings several new features, including multi-datacenter replication for the radosgw, improved usability, and lands a lot of incremental performance and internal refactoring work to support upcoming features in Firefly.[14]

Firefly (v0.80)

On May 7, 2014, the Ceph development team released Firefly, the sixth major stable release of Ceph. This release brings several new features, including erasure coding, cache tiering, primary affinity, key/value OSD backend (experimental), standalone radosgw (experimental).[15]

Giant (v0.87)

On October 29, 2014, the Ceph development team released Giant, the seventh major stable release of Ceph.[16]

Design

Ceph employs four distinct kinds of daemons:

All of these are fully distributed, and may run on the same set of servers. Clients directly interact with all of them.[18]

Ceph does striping of individual files across multiple nodes to achieve higher throughput, similarly to how RAID0 stripes partitions across multiple hard drives. Adaptive load balancing is supported whereby frequently accessed objects are replicated over more nodes.[citation needed]

Etymology

The name "Ceph" is a common nickname given to pet octopuses and derives from cephalopods, a class of molluscs, and ultimately from Ancient Greek κεφαλή (ke-pha-LEE), meaning "head" and πόδι (PO-dhi), meaning "leg". The name (emphasized by the logo) suggests the highly parallel behavior of an octopus and was chosen to connect the file system with UCSC's mascot, a banana slug called "Sammy".[19] Banana slugs are gastropods, which are also a class of molluscs.

See also

References

  1. ^ "Red Hat acquires Inktank". 2014-04-30. Retrieved 2014-08-19.
  2. ^ "LGPL2.1 license file in the Ceph sources". 2014-10-24. Retrieved 2014-10-24.
  3. ^ Jeremy Andrews (2007-11-15). "Ceph Distributed Network File System". KernelTrap.
  4. ^ Sage Weil (2010-02-19). "Client merged for 2.6.34". ceph.newdream.net.
  5. ^ "Hard Disk and File System Recommendations". ceph.com. Retrieved 28 March 2013.
  6. ^ Sage Weil (2007-12-01). "Ceph: Reliable, Scalable, and High-Performance Distributed Storage" (PDF). University of California, Santa Cruz.
  7. ^ Bryan Bogensberger (2012-05-03). "And It All Comes Together". Inktank Blog.
  8. ^ Joseph F. Kovar (July 10, 2012). "The 10 Coolest Storage Startups Of 2012 (So Far)". CRN. Retrieved July 19, 2013.
  9. ^ Red Hat Inc (2014-04-30). "Red Hat to Acquire Inktank, Provider of Ceph". Red Hat. Retrieved 2014-08-19.
  10. ^ Sage Weil (2012-07-03). "v0.48 "Argonaut" Released". Ceph Blog.
  11. ^ Sage Weil (2013-01-01). "v0.56 Released". Ceph Blog.
  12. ^ Sage Weil (2013-05-17). "v0.61 "Cuttlefish" Released". Ceph Blog.
  13. ^ Sage Weil (2013-08-14). "v0.67 Dumpling Released". Ceph Blog.
  14. ^ Sage Weil (2013-11-09). "v0.72 Emperor Released". Ceph Blog.
  15. ^ Sage Weil (2014-05-07). "v0.80 Firefly Released". Ceph Blog.
  16. ^ Sage Weil (2014-10-29). "v0.87 Giant Released". Ceph Blog.
  17. ^ "Btrfs - Ceph Wiki". Retrieved 2010-04-27.
  18. ^ Jake Edge (2007-11-14). "The Ceph filesystem". LWN.net.
  19. ^ "How the Banana Slug became UCSC's official mascot". Retrieved September 22, 2009.

Further reading

External links