Vertica

From Wikipedia, the free encyclopedia
  (Redirected from HP Vertica)
Jump to: navigation, search
Vertica
Industry Enterprise Software & Database Management & Data Warehousing
Founded 2005
Founder Andrew Palmer and Michael Stonebraker
Headquarters Cambridge, MA
Key people
  • Colin Mahony (SVP and General Manager)
  • Joy King (VP of Product Marketing & Product Management)
Misha Davidson (Director of Engineering)
Products Vertica Analytics Platform Enterprise Edition, Vertica SQL on Hadoop, Vertica Analytics Platform Community Edition
Parent Micro Focus
Website www.vertica.com

Vertica Systems was an analytic database management software company.[1][2] Vertica was founded in 2005 by database researcher Michael Stonebraker, and Andrew Palmer. Former CEOs include Ralph Breslauer and Christopher P. Lynch.

Vertica was acquired by Hewlett Packard on March 22, 2011.[3][4] The acquisition expanded the HP Software software portfolio for enterprise companies and the public sector group.[5] On September 1, 2017, it was merged with Micro Focus

Products[edit]

The column-oriented Vertica Analytics Platform was designed to manage large, fast-growing volumes of data and provide very fast query performance when used for data warehouses and other query-intensive applications. The product claims to drastically improve query performance over traditional relational database systems, provide high-availability, and exabyte scalability on commodity enterprise servers. Vertica is infrastructure independent, supporting deployments on multiple cloud platforms (AWS, Google, Azure) , on premise and natively on Hadoop nodes.

Its design features include:

  • Column-oriented storage organization, which increases performance of sequential record access at the expense of common transactional operations such as single record retrieval, updates, and deletes.[6]
  • Massively Parallel Processing (MPP) architecture to distribute queries on independend nodes and scale performance linearly.
  • Standard SQL interface with many analytics capabilities built-in, such as time series gap filling/interpolation, event-based windowing and sessionization, pattern matching, event series joins, statistical computation (e.g., regression analysis), and geospatial analysis.
  • In database Machine Learning including categorization, over fitting and prediction to enhance processing speed by eliminating the need for down-sampling and data movement. Vertica offers a variety of in database algorithms including linear regression, logistic regression, K-means, Naive Bayes, Random Forest & Support Vector machines. Vertica 9 also allows deployment of ML models to multiple clusters, a significant advantage for software developers who embed Vertica analytics.
  • Compression, which reduces storage costs and I/O bandwidth. High compression is possible because columns of homogeneous datatype are stored together and because updates to the main store are batched.[7]
  • Shared-nothing architecture, which reduces system contention for shared resources and allows gradual degradation of performance in the face of hardware failure.
  • Easy to use and maintain through automated workload management, data replication, server recovery, query optimization, and storage optimization.
  • Native integration with open source big data technologies like Apache Kafka and Apache Spark.
  • Support for standard programming interfaces ODBC, JDBC, ADO.NET, and OLEDB.
  • High-performance and parallel data transfer to statistical tools such as built-in machine learning algorithms based on R, and the ability to store machine learning models, and use them for in-database scoring.[8][9]

Vertica's specialized approach aims to significantly increase query performance in data warehouses, while reducing the total cost of ownership by reducing the hardware footprint. One example of a use case detailed in a research paper shows a performance improvement of hundreds of times with Vertica in a specific application due to the use of the vertical DBMS approach.[10]

In late 2011, the Vertica Analytics Platform Community Edition was made available for free with certain limitations, such as a maximum of one terabyte of raw data, three-node (servers) cluster, and community-based support.[11]

Optimizations[edit]

The Vertica Analytics Platform runs on cluster of Linux-based commodity servers. It is also available on the Amazon Elastic Compute Cloud , Microsoft Azure and the Google Cloud Platform, ensuring no infrastructure or platform lock in. The product integrates with Hadoop[12] to leverage HDFS via External Tables with ORC and Parquet Readers and can be installed on Hadoop nodes in a co-located manner as Vertica for SQL on Hadoop (a separate offering, priced by per node). These combined capabilities allow users to analyze their data in the right place, including across multiple data lakes.

A range of BI, data visualization, and ETL tools are certified to work with and integrate with the Vertica Analytics Platform. Vertica also offers a certified and secure interface with the popular Kafka message bus, allowing streaming data ingestion. This capability combined with Vertica's high performance analytics supports use cases like Internet of Things, Edge Analytics and near real time Fraud Prevention. The Vertica website lists many of these.

Several of Vertica’s features were originally prototyped within the C-Store column-oriented database, an academic open source research project at MIT and other universities. The system's architecture is described in a 2012 VLDB paper.[13]

Versions and documentation[edit]

  • Vertica Analytics Platform 8.1.x[14]
  • Vertica Analytics Platform 8.0.x[15]
  • Vertica Analytics Platform 7.2.x[16]
  • Vertica Analytics Platform 7.1.x[17]
  • Vertica Analytics Platform 7.0.x[18]
  • Vertica Analytics Platform 6.1.x[19]
  • Vertica 6.0.x Enterprise Edition[20]
  • Vertica 5.1 Enterprise Edition[21]
  • Vertica Enterprise Edition 5.0[22]
  • Vertica Enterprise Edition 4.1[23]

Company events[edit]

In January 2008, Sybase filed a patent-infringement lawsuit against Vertica.[24] In January 2010, Vertica prevailed in a preliminary hearing,[25] and in June, 2010, Sybase and Vertica resolved the suit, with the court dismissing all infringement claims.[26] Under the leadership of Colin Mahony, Vertica has sponsored various technological events in the database industry.[27]

In August 2013, Vertica held its first Big Data conference[28] event in Boston, MA USA. This event was held again in 2014, 2015, 2016, and 2017.

In 2016, Vertica published its first O'Reilly book, The Big Data Transformation - Understanding Why Change is Actually Good for Your Business.

See also[edit]

References[edit]

  1. ^ Network World staff: "New database company raises funds, nabs ex-Oracle bigwigs”, [1] LinuxWorld, February 14, 2007
  2. ^ Brodkin, J: "10 enterprise software companies to watch", [2] Network World, April 11, 2007
  3. ^ HP News Release: “HP to Acquire Vertica: Customers Can Analyze Massive Amounts of Big Data at Speed and Scale” Feb. 2011
  4. ^ HP News Release: “HP Completes Acquisition of Vertica Systems, Inc.” March 22, 2011.
  5. ^ ComputerWorld.com: “Update: HP to buy Vertica for analytics.” Kanaracus. Feb. 2011.
  6. ^ Monash, C: "Are row-oriented RDBMS obsolete?" [3] DBMS2, January 22, 2007
  7. ^ Monash, C: "Mike Stonebraker on database compression – comments”,[4]DBMS2, March 24, 2007
  8. ^ Gagliordi, Natalie. "HP adds scale to open-source R in latest big data platform". ZDNet. Retrieved 17 February 2015. 
  9. ^ Prasad, Shreya; Fard, Arash; Gupta, Vishrut; Martinez, Jorge; LeFevre, Jeff; Xu, Vincent; Hsu, Meichun; Roy, Indrajit (2015). "Enabling predictive analytics in Vertica: Fast data transfer, distributed model creation and in-database prediction". ACM SIGMOD International Conference on Management of Data (SIGMOD). 
  10. ^ One Size Fits All? Part 2: Benchmarking Results (sect. 3.1)
  11. ^ "Vertica Announces Community Edition Version of Vertica Analytic Database". Archived from the original on July 4, 2015. Retrieved August 17, 2016. 
  12. ^ "Vertica-Hadoop integration". DBMS2. October 12, 2010. 
  13. ^ "The Vertica Analytic Database: C-Store 7 Years Later" (PDF). VLDB. August 28, 2012. 
  14. ^ Documentation https://my.vertica.com/docs/8.1.x/HTML/index.htm
  15. ^ Documentation https://my.vertica.com/docs/8.0.x/HTML/index.htm
  16. ^ Documentation https://my.vertica.com/docs/7.2.x/HTML/index.htm
  17. ^ Documentation https://my.vertica.com/docs/7.1.x/HTML/index.htm
  18. ^ Documentation https://my.vertica.com/docs/7.0.x/HTML/index.htm
  19. ^ Documentation https://my.vertica.com/docs/6.1.x/HTML/index.htm
  20. ^ Documentation http://www.vertica.com/documentation/hp-vertica-documentation-6-0-x/
  21. ^ Documentation http://www.vertica.com/documentation/hp-vertica-5-1-x-enterprise-edition-product-documentation/
  22. ^ Documentation http://www.vertica.com/documentation/hp-vertica-enterprise-edition-5-0-product-documentation/
  23. ^ Documentation http://www.vertica.com/documentation/hp-vertica-documentation-5-1/
  24. ^ Sybase, Inc. v. Vertica Systems, Inc. (Texas Eastern District Court January 30, 2008). Text
  25. ^ Monash, C: "Vertica slaughters Sybase in patent litigation”,[5]DBMS2, January 14, 2010
  26. ^ Vertica Press Release, "Vertica Resolves Sybase Patent Lawsuits" http://www.vertica.com/news/press/vertica-resolves-sybase-patent-lawsuits/
  27. ^ http://www.vertica.com/news/events/
  28. ^ HP Vertica Big Data Conference 2013 http://www.vertica.com/hp-vertica-big-data-conference-2013/

External link[edit]