Cloudera

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Cloudera
Privately held
Industry Software Development
Headquarters Palo Alto, California
Number of locations

Palo Alto, Chicago, San Francisco, New York, North Carolina, Virginia, Austin Texas, New Hampshire, Atlanta Georgia, Kirkland WA, McLean VA

EMEA - London, Paris, Budapest.

Asia / APJ - Shanghai China, Beijing China, Tokyo Japan, Singapore, Seoul Korea, Melbourne Australia, India Chennai
Products Cloudera Manager, Cloudera Navigator, Gazzang, Cloudera Navigator Optimizer, Impala
Services Apache Hadoop distribution with support, professional services and training
Number of employees
1,400[1]
Website www.cloudera.com

Cloudera Inc. is an American-based software company that provides Apache Hadoop-based software, support and services, and training to business customers.

Cloudera's open-source Apache Hadoop distribution, CDH (Cloudera Distribution Including Apache Hadoop), targets enterprise-class deployments of that technology. Cloudera says that more than 50% of its engineering output is donated upstream to the various Apache-licensed open source projects (Apache Hive, Apache Avro, Apache HBase, and so on) that combine to form the Hadoop platform. Cloudera is also a sponsor of the Apache Software Foundation.[2]

History[edit]

Three engineers from Google, Yahoo and Facebook (Christophe Bisciglia, Amr Awadallah and Jeff Hammerbacher, respectively) joined with a former Oracle executive (Mike Olson) to form Cloudera in 2008.[3] Olson was the CEO of Sleepycat Software, the creator of the open-source embedded database engine Berkeley DB (acquired by Oracle in 2006). Awadallah was from Yahoo, where he ran one of the first business units using Hadoop for data analysis.[4] At Facebook Hammerbacher used Hadoop for building analytic applications involving massive volumes of user data.[5]

Architect Doug Cutting, also a former chairman of the Apache Software Foundation, authored the open-source Lucene and Nutch search technologies before he wrote the initial Hadoop software in 2004. He designed and managed a Hadoop storage and analysis cluster at Yahoo! before joining Cloudera in 2009. Chief operating officer was Kirk Dunn.[6]

In March 2009, Cloudera announced the availability of Cloudera Distribution Including Apache Hadoop in conjunction with a $5 million investment led by Accel Partners.[7] In 2011, the company raised a further $40 million from Ignition Partners, Accel Partners, Greylock Partners, Meritech Capital Partners, and In-Q-Tel, a venture capital firm with open connections to the CIA.[8]

In June 2013 Tom Reilly became chief executive, although Olson remained as chairman of the board and chief strategist. Reilly was chief executive at ArcSight when it was acquired by Hewlett-Packard in 2010.[9] In March 2014 Cloudera announced a $900 million funding round, led by Intel Capital ($740 million), for which Intel received 18% share in Cloudera and Intel dropped its own Hadoop distribution and dedicated 70 Intel engineers to work exclusively on cloudera projects. With additional funds coming from T Rowe Price, Google Ventures and an affiliate of MSD Capital, L.P., the private investment firm for Michael S. Dell. and others.[10]

In January 2012 Oracle Corporation announced a partnership with Cloudera for its Oracle Big Data Appliance.[11] In January 2013 Dell announced a partnership with Cloudera.[12] In March 2013 Intel invested $740 million in Cloudera for an 18% investment.[13] In May 2013 SAS Institute announced a partnership.[14] In June 2014 Accenture announced a service offering based on Cloudera.[15] In June 2014 Cloudera acquired Gazzang, which developed encryption and key management software.[16] In October 2014 Cloudera announces the first Payment Card Industry Data Security Standard (PCI) Hadoop distribution with MasterCard.[17] In February 2015 Deloitte announced an alliance with Cloudera.[18] In May 2015 Capgemini announced a marketing program for SAP HANA and Cloudera.[19] In July 9, 2015 Cloudera a partnership with Teradata.[20] In September 2015 Cloudera announced the Kudu file format.[21] In September 2015 Microsoft Azure announced full support of Cloudera Enterprise.[22]

In January 2016 Tata Consultancy Services announced an Internet of things framework based on Cloudera for sensor data analytics.[23] In February 2016 EMC announces evolution in advanced storage with DSSD support for Cloudera[24]

In 2016, Cloudera was ranked #5 on the Forbes Cloud 100 list.[25]

Products and services[edit]

Cloudera offers software, services and support in three bundles:

  • Cloudera Enterprise includes CDH and an annual subscription license (per node) to Cloudera Manager and technical support. It comes in three editions: Basic, Flex, and Data Hub.
  • Cloudera Express includes CDH and a version of Cloudera Manager lacking enterprise features such as rolling upgrades and backup/disaster recovery, LDAP and SNMP integration.
  • CDH may be downloaded from Cloudera's website at no charge, but with no technical support nor Cloudera Manager.
  • Cloudera Navigator - provides data governance for Hadoop, offering capabilities such as data discovery, continuous optimization, audit, lineage, metadata management, and policy enforcement. As part of Cloudera Enterprise, Cloudera Navigator helps meet regulatory compliance requirements.[26]
  • Cloudera Navigator Optimizer (beta) - A software as a service tool to provide insights into workloads and recommend optimization strategies.[27]

CDH contains the main, core elements of Hadoop that provide reliable, scalable distributed data processing of large data sets (chiefly MapReduce and HDFS), as well as other enterprise-oriented components that provide security, high availability, and integration with hardware and other software.[28]

In October 2012, Cloudera announced the Cloudera Impala project, an open-source distributed query engine for Apache Hadoop.[29]

Awards[edit]

  • In April 2010, Chief Scientist Jeff Hammerbacher was named a "Best Young Tech Entrepreneur" by Bloomberg BusinessWeek.[30]
  • In June 2012, received Morgan Stanley's "CTO Award for Innovation".[31]
  • In August 2012, CRN named Cloudera among the "The 25 Coolest Emerging Vendors For 2012".[32]

References[edit]

  1. ^ "About Cloudera". Cloudera. 
  2. ^ "Apache Software Foundation Sponsorship". Retrieved 28 August 2012. 
  3. ^ Vance, Ashlee (16 March 2009). "Bottling the Magic Behind Google and Facebook". The New York Times. Retrieved 20 January 2014. 
  4. ^ "This Former Yahoo-er's Startup Is So Hot, Even the CIA Invested In It". Archived from the original on February 9, 2012. Retrieved 28 August 2012. 
  5. ^ "This Tech Bubble Is Different". Retrieved 28 August 2012. 
  6. ^ "Bloomberg Business Week, Executive Profile Kirk Dunn". Retrieved 30 September 2012. 
  7. ^ Wauters, Robin (16 March 2009). "Cloudera Raises $5 Million Series A Round For Hadoop Commercialization". TechCrunch. Retrieved 22 April 2010. 
  8. ^ "Hadoop-based startup Cloudera raises $40M from Ignition Partners, Accel, Greylock". Retrieved 28 August 2012. 
  9. ^ Timothy Prickett Morgan (20 June 2013). "Cloudera taps new CEO for inevitable IPO push or acquisition: Former CEO becomes chairman and chief strategist". The Register. Retrieved 20 January 2014. 
  10. ^ Noel Randewich (31 March 2014). "Intel invested $740 million to buy 18 percent of Cloudera". Reuters. 
  11. ^ "Oracle Selects Cloudera to Provide Apache Hadoop Distribution and Tools for Oracle Big Data Appliance". 
  12. ^ Dell us. "Dell Apache Hadoop Solutions - Dell". Dell. 
  13. ^ "Intel invested $740 million to buy 18 percent of Cloudera". 
  14. ^ "How the SAS and Cloudera Platforms Work Together". Cloudera Engineering Blog. 
  15. ^ "Accenture Forms Alliance with Cloudera to Empower Enterprises with Data as a Platform Offering". 
  16. ^ "cloudera strengthens hadoop security with acquisition of gazzang". Cloudera. 
  17. ^ "cloudera enterprise certified for full pci compliance". Cloudera. 
  18. ^ "cloudera deloitte announce strategic alliance". Cloudera. 
  19. ^ "Insights-Driven Operations with SAP HANA and Cloudera Enterprise". Capgemini Capgemini Worldwide. 
  20. ^ "Cloudera and Teradata Announce Integrated, Enterprise-Ready Appliance for Hadoop". Cloudera. 
  21. ^ "Kudu: New Apache Hadoop Storage for Fast Analytics on Fast Data". Cloudera Engineering Blog. 
  22. ^ "Full support of Cloudera Enterprise on Azure". Microsoft. 
  23. ^ "TCS Sensor Data Analytics IoT Framework with Cloudera". Cloudera. 
  24. ^ "EMC DSSD and Cloudera Evolve Hadoop: Innovating to deliver high-performance enterprise analytics on Hadoop" (PDF). EMC Corporation. 2016. Retrieved April 25, 2016. 
  25. ^ "Forbes Cloud 100". Forbes. Retrieved 28 October 2016. 
  26. ^ "Cloudera Navigator". Cloudera. 
  27. ^ "Cloudera Optimizer". 
  28. ^ Henschen, Doug (6 June 2012). "Cloudera Releases Next-Generation Hadoop Platform". InformationWeek. Retrieved 22 April 2010. 
  29. ^ Brust, Andrew (25 October 2012). "Cloudera's Impala brings Hadoop to SQL and BI". ZDNet. Retrieved 20 January 2014. 
  30. ^ "Best Young Technology Entrepreneurs 2010". Archived from the original on July 7, 2012. Retrieved 28 August 2012. 
  31. ^ "Cloudera Honored by Morgan Stanley With Prestigious 'CTO Award for Innovation'". Retrieved 28 August 2012. [dead link]
  32. ^ "The 25 Coolest Emerging Vendors For 2012". Retrieved 28 August 2012.