This article relies too much on references to primary sources. (June 2019) (Learn how and when to remove this template message)
1.5.4 / 10 June 2019
|Type||Database management system|
|License||Apache License 2.0|
Apache CarbonData is a free and open-source column-oriented data storage format of the Apache Hadoop ecosystem. It is similar to the other columnar-storage file formats available in Hadoop namely RCFile and ORC. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk.
CarbonData was first developed at Huawei in 2013. The project was submitted to the Apache Incubator in June 2016, and had its first official release two months later. The project won top honors in the BlackDuck 2016 Open Source Rookies of the Year's Big Data category. Since May 1, 2017, Apache CarbonData is a top-level Apache Software Foundation (ASF)-sponsored project.
- Pig (programming tool)
- Apache Hive
- Apache Impala
- Apache Drill
- Apache Kudu
- Apache Spark
- Apache Thrift
- Apache Parquet
- Presto (SQL query engine)
- Foundation, The Apache Software (May 1, 2017). "The Apache Software Foundation Announces Apache® CarbonData™ as a Top-Level Project". GlobeNewswire News Room.
- "Releases - CarbonData - Apache Software Foundation". cwiki.apache.org.