Cloudera

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Cloudera, Inc.
Public
Traded asNYSECLDR
Russell 2000 Component
IndustrySoftware Development
HeadquartersPalo Alto, California
ProductsCloudera Enterprise Data Hub, Cloudera Analytic DB, Cloudera Operational DB, Cloudera Data Science and Data Engineering, Cloudera Fast Forward Labs, Cloudera Essentials, and Cloudera Altus. Components include: Cloudera Manager, Cloudera Navigator, Cloudera Data Science Workbench, Cloudera Navigator Optimizer, Cloudera Altus, Apache Hadoop, Apache Spark, Apache Impala, Apache Kudu, Apache Sentry, Apache Spot
ServicesApache Hadoop distribution with support, professional services and training
Number of employees
1,600[1]
Websitecloudera.com

Cloudera, Inc. is a US-based software company that provides a software platform for data engineering, data warehousing, machine learning and analytics that runs in the cloud or on premises.

Cloudera started as a hybrid open-source Apache Hadoop distribution, CDH (Cloudera Distribution Including Apache Hadoop), that targeted enterprise-class deployments of that technology. Cloudera states that more than 50% of its engineering output is donated upstream to the various Apache-licensed open source projects (Apache Spark, Apache Hive, Apache Avro, Apache HBase, and so on) that combine to form the Apache Hadoop platform. Cloudera is also a sponsor of the Apache Software Foundation.[2]

History[edit]

Cloudera was founded in 2008 by three engineers from Google, Yahoo! and Facebook (Christophe Bisciglia, Amr Awadallah and Jeff Hammerbacher, respectively) joined with a former Oracle executive (Mike Olson) to form Cloudera in 2008.[3] Olson was the CEO of Sleepycat Software, the creator of the open-source embedded database engine Berkeley DB (acquired by Oracle in 2006). Awadallah was from Yahoo!, where he ran one of the first business units using Apache Hadoop for data analysis.[4] At Facebook Hammerbacher used Hadoop for building analytic applications involving massive volumes of user data.[5]

Architect Doug Cutting, also a former chairman of the Apache Software Foundation, authored the open-source Lucene and Nutch search technologies before he and Mike Cafarella wrote the initial Hadoop software in 2004. He designed and managed a Hadoop storage and analysis cluster at Yahoo! before joining Cloudera in 2009. The chief operating officer was Kirk Dunn until 2015.[6]

In March 2009, Cloudera announced the availability of Cloudera Distribution Including Apache Hadoop in conjunction with a $5 million investment led by Accel Partners.[7] In 2011, the company raised a further $40 million from Ignition Partners, Accel Partners, Greylock Partners, Meritech Capital Partners, and In-Q-Tel, a venture capital firm with open connections to the CIA.[8]

In June 2013, Tom Reilly became chief executive, although Olson remained as chairman of the board and chief strategist. Reilly was chief executive at ArcSight when it was acquired by Hewlett-Packard in 2010.[9] In March 2014 Cloudera announced a $900 million funding round, led by Intel Capital ($740 million), for which Intel received an 18% share in Cloudera and Intel dropped its own Hadoop distribution and dedicated 70 Intel engineers to work exclusively on Cloudera projects. Additional funds came from T Rowe Price; Google Ventures; an affiliate of MSD Capital, L.P., the private investment firm for Michael S. Dell; and others.[10]

In January 2012, Oracle Corporation announced a partnership with Cloudera for its Oracle Big Data Appliance.[11] In January 2013, Dell announced a partnership with Cloudera.[12] In March 2013, Intel invested $740 million in Cloudera for an 18% investment.[13] In May 2013, SAS Institute announced a partnership.[14] In June 2014, Accenture announced a service offering based on Cloudera.[15] In June 2014, Cloudera acquired Gazzang, which developed encryption and key management software.[16] In October 2014, Cloudera announced the first Payment Card Industry Data Security Standard (PCI) with MasterCard.[17] In February 2015, Deloitte announced an alliance with Cloudera.[18] In May 2015, Capgemini announced a marketing program for SAP HANA and Cloudera.[19] On July 9, 2015, Cloudera announced a partnership with Teradata.[20] In September 2015, Cloudera announced the Kudu storage manager.[21] In September 2015, Microsoft Azure announced full support of Cloudera Enterprise.[22]

In January 2016, Tata Consultancy Services announced an Internet of things framework based on Cloudera for sensor data analytics.[23] In February 2016, EMC announces evolution in advanced storage with DSSD support for Cloudera[24]

In 2016, Cloudera was ranked #5 on the Forbes Cloud 100 list.[25]

Cloudera filed for an initial public offering in March 2017,[26] and on April 28, 2017, its shares were listed on the New York Stock Exchange under the symbol CLDR.[27]

In September 2017, Cloudera acquired Fast Forward Labs (FFL), a leading machine learning and applied artificial intelligence research and development company in an effort to deepen Cloudera’s expertise in the application of machine learning to practical business problems. The new division is headed up by FFL co-founder and CEO Hilary Mason.[28]

In October 2018, Cloudera and Hortonworks announced they would be merging in an all-stock merger of equals.[29]

Products and Services[edit]

Cloudera offers software, services and support in five bundles available both on-premise and across multiple cloud providers:

  • Cloudera Enterprise Data Hub - Cloudera’s comprehensive data management platform including all of Data Science & Engineering, Operational DB, Analytic DB, and Cloudera Essentials.[30]
  • Cloudera Analytic DB - Cloudera’s technologies that enable fast, flexible, and scalable Business Intelligence (BI) and SQL analytics built on the core Cloudera Essentials platform.[31]
  • Cloudera Operational DB - Cloudera’s high-scale NoSQL technologies for real-time, data applications built on the core Cloudera Essentials platform.[32]
  • Cloudera Data Science and Engineering - Cloudera’s technologies that enable efficient, high-scale data processing, data science, and machine learning on top of the Core Essentials platform.[33]
  • Cloudera Essentials - Cloudera’s core data management platform for fast, easy, and secure large-scale data processing that includes Cloudera’s enterprise-ready management capabilities (Cloudera Manager) and open source platform distribution (CDH).[34]

Cloudera also offers a managed-service offering on the cloud:

  • Altus Data Engineering which provides a cloud-native offering of Cloudera Data Engineering.[35]

Cloudera also offers the following free software versions:

  • Cloudera Express - includes Cloudera’s CDH open-source platform and a no-charge version of its deployment, monitoring, and administration suite, Cloudera Manager.[36]
  • CDH (Cloudera’s Distribution including Apache Hadoop) - is Cloudera’s 100% open source platform distribution including Apache Hadoop, Apache Spark, Apache Impala, Apache Kudu, Apache HBase, and many more.[37]

As part of the software bundles above Cloudera offers the following additional technologies in addition to its open-source distribution:

  • Cloudera Director - a tool distributed without charge that enables easy deployment of cloud-native Cloudera clusters on-demand across multiple cloud providers.[38]
  • Cloudera Data Science Workbench - A data science tool for secure collaboration and model development add-on for Cloudera Enterprise Data Engineering and Data Science as well as Cloudera Enterprise Data Hub.[39]
  • Cloudera Navigator - critical data governance functionality for Cloudera’s platform, offering capabilities such as data discovery, audit, lineage, metadata management, encryption, encryption key management, and policy enforcement to help meet regulatory compliance requirements.[40]
  • Cloudera Navigator Optimizer - a software-as-a-service tool to assist in identifying, migrating, and tuning traditional database workloads to Cloudera’s platform as well as analyze and tune workloads running on Cloudera’s platform.[41]
  • Cloudera Manager - an administrative tool for fast, easy, and secure deployment, monitoring, alerting, and management of Cloudera’s platform.[42]

References[edit]

  1. ^ "About Cloudera". Cloudera.
  2. ^ "Apache Software Foundation Sponsorship". Retrieved 28 August 2012.
  3. ^ Vance, Ashlee (16 March 2009). "Bottling the Magic Behind Google and Facebook". The New York Times. Retrieved 20 January 2014.
  4. ^ "This Former Yahoo-er's Startup Is So Hot, Even the CIA Invested In It". Archived from the original on February 9, 2012. Retrieved 28 August 2012.
  5. ^ "This Tech Bubble Is Different". Retrieved 28 August 2012.
  6. ^ "Bloomberg Business Week, Executive Profile Kirk Dunn". Retrieved 30 September 2012.
  7. ^ Wauters, Robin (16 March 2009). "Cloudera Raises $5 Million Series A Round For Hadoop Commercialization". TechCrunch. Retrieved 22 April 2010.
  8. ^ "Hadoop-based startup Cloudera raises $40M from Ignition Partners, Accel, Greylock". Retrieved 28 August 2012.
  9. ^ Timothy Prickett Morgan (20 June 2013). "Cloudera taps new CEO for inevitable IPO push or acquisition: Former CEO becomes chairman and chief strategist". The Register. Retrieved 20 January 2014.
  10. ^ Noel Randewich (31 March 2014). "Intel invested $740 million to buy 18 percent of Cloudera". Reuters.
  11. ^ "Oracle Selects Cloudera to Provide Apache Hadoop Distribution and Tools for Oracle Big Data Appliance".
  12. ^ Dell us. "Dell Apache Hadoop Solutions - Dell". Dell.
  13. ^ "Intel invested $740 million to buy 18 percent of Cloudera".
  14. ^ "How the SAS and Cloudera Platforms Work Together". Cloudera Engineering Blog.
  15. ^ "Accenture Forms Alliance with Cloudera to Empower Enterprises with Data as a Platform Offering".
  16. ^ "cloudera strengthens hadoop security with acquisition of gazzang". Cloudera.
  17. ^ "cloudera enterprise certified for full pci compliance". Cloudera.
  18. ^ "cloudera deloitte announce strategic alliance". Cloudera.
  19. ^ "Insights-Driven Operations with SAP HANA and Cloudera Enterprise". Capgemini Capgemini Worldwide.
  20. ^ "Cloudera and Teradata Announce Integrated, Enterprise-Ready Appliance for Hadoop". Cloudera.
  21. ^ "Kudu: New Apache Hadoop Storage for Fast Analytics on Fast Data". Cloudera Engineering Blog.
  22. ^ "Full support of Cloudera Enterprise on Azure". Microsoft.
  23. ^ "TCS Sensor Data Analytics IoT Framework with Cloudera". Cloudera.
  24. ^ "EMC DSSD and Cloudera Evolve Hadoop: Innovating to deliver high-performance enterprise analytics on Hadoop" (PDF). EMC Corporation. 2016. Retrieved April 25, 2016.
  25. ^ "Forbes Cloud 100". Forbes. Retrieved 28 October 2016.
  26. ^ "Cloudera SEC S-1 filing". Retrieved April 5, 2017.
  27. ^ Balakrishnan, Anita (April 28, 2017). "Cloudera shares close more than 20% higher on Day 1". CNBC. Retrieved April 29, 2017.
  28. ^ "Cloudera acquires AI research firm Fast Forward Labs". TechCrunch. Retrieved 3 January 2018.
  29. ^ "Cloudera and Hortonworks Announce Merger to Create World's Leading Next Generation Data Platform and Deliver Industry's First Enterprise Data Cloud". BusinessWire. Retrieved 3 October 2018.
  30. ^ "Cloudera Enterprise Data Hub". Retrieved 2 January 2018.
  31. ^ "Cloudera Analytic DB". Retrieved 2 January 2018.
  32. ^ "Cloudera Operational DB". Retrieved 2 January 2018.
  33. ^ "Cloudera Data Science and Engineering". Retrieved 2 January 2018.
  34. ^ "Cloudera Essentials". Retrieved 2 January 2018.
  35. ^ "Altus Data Engineering". Retrieved 2 January 2018.
  36. ^ "Cloudera Express". Retrieved 2 January 2018.
  37. ^ "Cloudera's Distribution including Apache Hadoop". Retrieved 2 January 2018.
  38. ^ "Cloudera Director". Retrieved 2 January 2018.
  39. ^ "Cloudera Data Science Workbench". Retrieved 2 January 2018.
  40. ^ "Cloudera Navigator". Retrieved 2 January 2018.
  41. ^ "Cloudera Navigator Optimizer". Retrieved 2 January 2018.
  42. ^ "Cloudera Manager". Retrieved 2 January 2018.

External links[edit]