Apache Kudu
Stable release | 1.4.0
/ 13 June 2017[1] |
---|---|
Repository | |
Operating system | Cross-platform |
Type | Database management system |
License | Apache License 2.0 [2] |
Website | kudu |
Apache Kudu is a free and open source column-oriented data store of the Apache Hadoop ecosystem. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides a completes Hadoop's storage layer to enable fast analytics on fast data.[3]
The open source project to build Apache Kudu began as internal project at Cloudera.[4] The first version Apache Kudu 1.0 was released 19 September 2016.[5]
Comparison with other storage engines
According to the project Kudu shares some characteristics with HBase. Kudu supports "real-time store that supports key-indexed record lookup and mutation." as HBase.[6] Kudu differ from HBase since Kudu's datamodel is more traditional relational, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/BigTable".[6]
See also
- Pig (programming tool)
- Apache Hive
- Cloudera Impala
- Apache Parquet
- Apache Drill
- Apache Spark
- Apache Thrift
- ClickHouse
References
- ^ "Apache Kudu - Releases". Retrieved 13 June 2017.
Kudu 1.4.0 was released on June 13, 2017.
- ^ "Project Status". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21.
Is Kudu open source? Yes, Kudu is open source and licensed under the Apache Software License, version 2.0. Apache Kudu is a top level project (TLP) under the umbrella of the Apache Software Foundation.
- ^ https://kudu.apache.org/
- ^ "Why was Kudu developed internally at Cloudera before its release?". 2017-05-21. Retrieved 2017-05-21.
- ^ "Apache Kudu releases". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21.
Kudu 1.0.0 was released on September 19, 2016. It is the first release not considered "beta". […] Kudu 0.5.0 (beta) was released on Sep 28, 2015. It was the first public version of Kudu.
- ^ a b "Why build a new storage engine? Why not just improve Apache HBase to increase its scan speed?". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21.