||This article appears to be written like an advertisement. (January 2014)|
|Stable release||1.4 / July 10, 2013|
|Operating system||Linux, BSD, Mac OS X, Solaris|
|License||Apache License 2.0|
Riak is an open-source, fault-tolerant key-value NoSQL database implementing the principles from Amazon's Dynamo paper. with heavy influence from Dr. Eric Brewer's CAP Theorem. Written in Erlang, Riak is known for its ability to distribute data across nodes using consistent hashing in a simple key/value scheme in namespaces called buckets.
Riak is used by thousands of companies worldwide, including over 25% of the Fortune 50. Examples include AT&T, AOL, Ask.com, Best Buy, Boeing, Bump, Braintree, Comcast, DataPipe, Gilt Group, UK National Health Services (NHS), OpenX, Rovio, Symantec, TBS, The Weather Channel, WorkDay, Voxer, Yahoo! Japan, Yandex, and many more. According to the DB-Engines ranking, Riak is one of the most popular key-value stores.
- Fault-Tolerant Availability
- Riak replicates key/value stores across a cluster of nodes with a default n_val of three. In the case of node outages due to network partition or hardware failures, data can still be written to a neighboring node beyond the initial three, and read-back due to its "masterless" peer-to-peer architecture.
- Predictable Latency
- Riak evenly distributes data across nodes with consistent hashing and can provide an excellent latency profile, even in the case of multiple node failures. Key/Values can be stored in memory, disk, or a combination depending on which pluggable backend one chooses.
- Multi-Datacenter Replication
- In multi-datacenter replication, one cluster acts as a "primary cluster". The primary cluster handles replication requests from one or more "secondary clusters" (generally located in other regions or countries). If the datacenter with the primary cluster goes down, a secondary cluster can take over as the primary cluster.
- There are two primary modes of operation: fullsync and realtime. In fullsync mode, a complete synchronization occurs between primary and secondary cluster(s), by default every 360 minutes. In realtime mode, continual, incremental synchronization occurs - replication is triggered by new updates.
Licensing and Support
Riak is available for free under the Apache 2 License. In addition, Basho Technologies offers commercial licenses with subscription support and the ability for MDC (Multi Data Center) Replication.
Riak has official drivers for Ruby, Java, Erlang, Python, PHP, and C/C++. There are also a numerous amount of community-supported drivers for other programming languages and frameworks.
Riak was developed at Basho Technologies to initially power a web Sales Force Automation application by former engineers and executives from Akamai. There was more interest in the datastore technology than the applications built on it, so the company decided to build a business around Riak itself, gaining adoption throughout the Fortune 100 and becoming a foundation to many of the world’s fastest-growing Web-based, mobile and social networking applications, as well as cloud service providers. Releases after graduation include
- 1.1, released Feb 21 2012, added Riaknostic, enhanced error logging and reporting, improved resiliency for large clusters, and a new graphical operations and monitoring interface called Riak Control.
- 1.4, released July 10, 2013, added counters, secondary indexing improvements, reduced object storage overhead, handoff progress reporting, and enhancements to MDC replication.
- 2.0, technical preview released October 29, 2013, added new data types including sets, maps, registers, and flags simplifying application development. Strong consistency by bucket, full-text integration with Apache Solr, Security, and reduced replicas for Secondary sites.
- Apache's Erlang-based CouchDB (open source)
- Apache's Java-based Accumulo (open source)
- NoSQL, i.e., Structured storage
- "Riak: An Open Source Scalable Data Store". 28 November 2010. Retrieved 13 October 2011.
- Dynamo: Amazon’s Highly Available Key-value Store, SOSP 2007
- "Basho: Bitcask". Riak Wiki.
- "GigaOM: Basho takes aim at more enterprises with upgrades".
- "The Register: NHS tears out its Oracle Spine in favour of open source".
- "Computing: Angry Birds creator Rovio deploys Basho Riak NoSQL solution".
- "Riak Customers". Riak. Retrieved 21 February 2013.
- "DB-Engines Ranking of Key-value Stores".
- "Riak Client Libraries and Community Code". Retrieved 19 March 2011.
- "Case Study: Inaka deploys Boundary to monitor its Amazon Cloud and Storm On Demand servers". Boundary. December 21, 2012.
- Official Riak Wiki
- Official website
- Code on GitHub
- Basho website
- The Little Book of Riak
- GigaOM: Basho arms would-be Amazon killers with AWS-compatible storage
- NetCircle: Using Riak at NetCircle
- Riak at AOL
- Riak at Temetra for Smart Metering
- Using Riak for Ranking Collection at SEOmoz
- Using Riak at Github on Rackspace Cloud
- Searching with RiakSearch at inagist.com
- Riak and Scala at Yammer
- Riak in Production at Voxer
- Riak Benchmarks on Joyent Cloud
- Airbrake migration from MongoDB to Riak
- A vendor-independent comparison of NoSQL databases: Cassandra, HBase, MongoDB, Riak (NetworkWorld)