Scribe (log server)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Scribe
Original author(s) Bobby Johnson
Developer(s) Facebook
Initial release October 24, 2008 (2008-10-24)
Written in C++, Python
License Apache License 2.0
Website Scribe homepage (Github)

Scribe is a server for aggregating log data streamed in real-time from a large number of servers. It is designed to be scalable, extensible without client-side modification, and robust to failure of the network or any specific machine.

Scribe was developed at Facebook using Apache Thrift and released in 2008 as open source.[1][2]

Scribe servers are arranged in a directed graph, with each server knowing only about the next server in the graph. This network topology allows for adding extra layers of fan-in as a system grows, and batching messages before sending them between datacenters, without having any code that explicitly needs to understand datacenter topology, only a simple configuration.[3]

Scribe was designed to consider reliability but to not require heavyweight protocols and expansive disk usage. Scribe spools data to disk on any node to handle intermittent connectivity node failure, but doesn't sync a log file for every message. This creates a possibility of a small amount of data loss in the event of a crash or catastrophic hardware failure. However, this degree of reliability is often suitable for most Facebook use cases.[3]

See also[edit]

Notes and references[edit]

External links[edit]