Apache Samza

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 186.87.8.121 (talk) at 19:31, 28 May 2020 (→‎External links: This isn't Microsoft Software, this is Apache Software, it also has never been Microsoft Software as it was given to Apache by LinkedIn before Microsoft bought LinkedIn). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Apache Samza
Original author(s)LinkedIn
Developer(s)Apache Software Foundation
Stable release
1.2.0 / 11 June 2019; 4 years ago (2019-06-11)
RepositorySamza Repository
Written inScala, Java
Operating systemCross-platform
TypeDistributed stream processing
LicenseApache License 2.0
Websitesamza.apache.org

Apache Samza is an open-source, near-realtime, asynchronous computational framework for stream processing developed by the Apache Software Foundation in Scala and Java. It has been developed in conjunction with Apache Kafka. Both were originally developed by LinkedIn.[1]

Overview

Samza allows users to build stateful applications that process data in real-time from multiple sources including Apache Kafka.

Samza provides fault tolerance, isolation and stateful processing. Unlike batch systems such as Apache Hadoop or Apache Spark, it provides continuous computation and output, which result in sub-second[2] response times.

There are many players in the field of real-time stream processing and Samza is one of the mature products.[3][4][5] It was added to Apache in 2013.[6]

Samza is used by multiple companies.[7] The biggest installation is in LinkedIn.

See also

References

  1. ^ "How LinkedIn Uses Apache Samza". InfoQ. Retrieved 2016-09-28.
  2. ^ "Samza: Stateful Scalable Stream Processing at LinkedIn" (PDF).
  3. ^ "Spark Streaming vs Flink vs Storm vs Kafka Streams vs Samza : Choose Your Stream Processing Framework". www.linkedin.com. Retrieved 2019-07-23.
  4. ^ "Comparing Apache Spark, Storm, Flink and Samza stream processing engines - Part 1". Scott Logic. Retrieved 2019-07-23.
  5. ^ "Hadoop, Storm, Samza, Spark, and Flink: Big Data Frameworks Compared". DigitalOcean. Retrieved 2019-07-23.
  6. ^ "Apache Samza". blogs.apache.org. Retrieved 2019-07-23.
  7. ^ "Samza - Powered By". samza.apache.org. Retrieved 2019-07-23.

External links