Jump to content

eBPF

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Yonkeltron (talk | contribs) at 03:12, 27 June 2023 (Mostly adding links.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

eBPF
Original author(s)Alexei Starovoitov,
Daniel Borkmann[1][2]
Developer(s)Open source community, Meta, Google, Isovalent, Microsoft, Netflix[1]
Initial release2014; 10 years ago (2014)[3]
RepositoryLinux: git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/
Windows: github.com/Microsoft/ebpf-for-windows/
Written inC
Operating systemLinux, Windows[4]
TypeRuntime system, Sandboxing
LicenseLinux: GPL
Windows: MIT License
Websiteebpf.io

eBPF (sometimes referred to by the acronym BPF, standing for Berkeley Packet Filter)[2][5] is a technology that can run sandboxed programs in a privileged context such as the operating system kernel.[6] It is used to safely and efficiently extend the capabilities of the kernel at runtime without requiring to change kernel source code or load kernel modules.[7] Safety is provided through an in-kernel verifier which performs static code analysis and rejects programs which crash, hang or otherwise interfere with the kernel negatively.[8][9] Examples of programs that are automatically rejected are programs without strong exit guarantees (i.e. for/while loops without exit conditions) and programs dereferencing pointers without safety-checks.[10] Loaded programs which passed the verifier are either interpreted or in-kernel JIT compiled for native execution performance. The execution model is event-driven and with few exceptions run-to-completion,[2] meaning, programs can be attached to various hook points in the operating system kernel and are run upon triggering of an event. eBPF use cases include (but are not limited to) networking such as XDP, tracing and security subsystems.[6] Given eBPF's efficiency and flexibility opened up new possibilities to solve production issues, Brendan Gregg famously coined eBPF as "superpowers for Linux".[11] Linus Torvalds expressed that "BPF has actually been really useful, and the real power of it is how it allows people to do specialized code that isn't enabled until asked for".[12] Due to its success in Linux, the eBPF runtime has been ported to other operating systems such as Windows.[4]

History

Evolution from classic BPF

eBPF was built on top of the Berkeley Packet Filter (cBPF). At the lowest level, it introduced the use of ten 64-bit registers (instead of two 32-bit long registers for cBPF), different jump semantics, a call instruction and corresponding register passing convention, new instructions, and a different encoding for these instructions.[13] A number of additional features were subsequently added. The evolution of eBPF took many years and a large community of contributors, and is still ongoing. The table below summarizes some of the most significant milestones of this evolution:

Most significant milestones in the evolution of eBPF
Date Event
April 2011 The first in-kernel Linux JIT compiler for the classic Berkeley Packet Filter got merged.[14]
January 2012 The first non-networking use case of the classic Berkeley Packet Filter, seccomp-bpf,[15] appeared; it allows filtering of system calls using a configurable policy implemented through BPF instructions.
March 2014 David S. Miller, primary maintainer of the Linux networking stack, accepted the rework of the old in-kernel BPF interpreter. It was replaced by an eBPF interpreter and the Linux kernel internally translates classic BPF (cBPF) into eBPF instructions.[16]
March 2015 The ability to attach eBPF to kprobes as first tracing use case was merged.[18] In the same month, initial infrastructure work got accepted to attach eBPF to the networking traffic control (tc) layer allowing to attach eBPF to the core ingress and later also egress paths of the network stack, later heavily used by projects such as Cilium.[19][20][21]
August 2015 The eBPF compiler backend got merged into LLVM 3.7.0 release.[22]
September 2015 Brendan Gregg announced a collection of new eBPF-based tracing tools as the bcc project, providing a front-end for eBPF to make it easier to write programs.[23]
July 2016 eBPF got the ability to be attached into network driver's core receive path. This layer is known today as eXpress DataPath (XDP) and was added as a response to DPDK to create a fast data path which works in combination with the Linux kernel rather than bypassing it.[24][25][26]
August 2016 Cilium was initially announced during LinuxCon as a project providing fast IPv6 container networking with eBPF and XDP. Today, Cilium has been adopted by major cloud provider's Kubernetes offerings and is one of the most widely used CNIs.[27][21][28]
November 2016 Netronome added offload of eBPF programs for XDP and tc BPF layer to their NIC.[29]
May 2017 Meta's layer 4 load-balancer, Katran, went live. Every packet towards facebook.com since then has been processed by eBPF & XDP.[30]
November 2017 eBPF becomes its own kernel subsystem to ease the continuously growing kernel patch management. The first pull request by eBPF maintainers was submitted.[31]
September 2017 Bpftool was added to the Linux kernel as a user space utility to introspect the eBPF subsystem.[32]
January 2018 A new socket family called AF_XDP was published, allowing for high performance packet processing with zero-copy semantics at the XDP layer.[33] Today, DPDK has an official AF_XDP poll-mode driver support.[34]
February 2018 The bpfilter prototype has been published, allowing translation of a subset of iptables rulesets into eBPF via a newly developed user mode driver. The work has caused controversies due to the ongoing nftables development effort and has not been merged into mainline.[35][36]
October 2018 The new bpftrace tool has been announced by Brendan Gregg as DTrace 2.0 for Linux.[37]
November 2018 eBPF introspection has been added for kTLS in order to support the ability for in-kernel TLS policy enforcement.[38]
November 2018 BTF (BPF Type Format) has been added to the Linux kernel as an efficient meta data format which is approximately 100x smaller in size than DWARF.[39]
December 2019 The first 880-page long book on BPF, written by Brendan Gregg, was released.[40]
March 2020 Google upstreamed BPF LSM support into the Linux kernel, enabling programmable LSMs through eBPF.[41]
September 2020 The eBPF compiler backend for GCC was merged.[42]

Adoption

eBPF has been adopted by a number of large-scale production users, for example:

The bee is the official logo for eBPF. At the first eBPF Summit there was a vote taken and the bee mascot was named "eBee".[77][78] The logo has originally been created by Vadim Shchekoldin.[78] Earlier unofficial eBPF mascots have existed in the past,[79] but haven't seen widespread adoption.

Naming

There has been controversy around the naming of eBPF. The alias eBPF is often interchangeably used with BPF, for example by the Linux kernel community. eBPF and BPF is referred to as a technology name like LLVM.[2] eBPF evolved from the Berkeley Packet Filter as an extended version, but its use case outgrew networking, and today eBPF as a pseudo-acronym is preferred.[2]

eBPF Foundation

The eBPF Foundation was created in August 2021 with the goal to expand the contributions being made to extend the powerful capabilities of eBPF and grow beyond Linux.[1] Founding members include Meta, Google, Isovalent, Microsoft and Netflix. The purpose is to raise, budget and spend funds in support of various open source, open data and/or open standards projects relating to eBPF technologies[80] to further drive the growth and adoption of the eBPF ecosystem. Since inception, Red Hat, Huawei, Crowdstrike, Tigera, DaoCloud, Datoms, FutureWei also joined.[81]

Steering committee

With the creation of the eBPF Foundation,[1] an eBPF steering committee (BSC) was established in order to take care of the technical direction and vision of eBPF.[82] Tasks include the collaboration among projects, defining the minimal requirements of eBPF runtimes, overseeing community events, maintaining eBPF technical project lifecycle procedures, and communicating on behalf of the eBPF community.

Active members include:[82]

There are currently no emeritus members.

eBPF standardization

Although eBPF is supported, to various degrees, on multiple platforms, there is no standard specification (as of January 2023) to formally define its components. However, there is currently some work in progress to define and publish a standard for the instruction set, under the auspices of the eBPF Foundation.[83]

Security concerns

Due to the ease of programmability, eBPF has been used as a tool for implementing microarchitectural timing side-channel attacks such as Spectre against vulnerable microprocessors.[84] While unprivileged eBPF implemented mitigations against transient execution attacks,[85] unprivileged use has ultimately been disabled by the kernel community by default to protect from use against future hardware vulnerabilities.[86]

Conferences

The eBPF community organises a number of technical workshops and conferences to discuss ongoing research, development efforts, and use cases around eBPF. They can broadly be categorised into user-focused conferences and more developer-focused conferences.

User-focused conferences:

Developer-focused conferences:

See also

References

  1. ^ a b c d "Meta, Google, Isovalent, Microsoft and Netflix Launch eBPF Foundation as Part of the Linux Foundation". Linux Foundation. 12 August 2021. Retrieved 1 July 2022.
  2. ^ a b c d e "BPF Internals". USENIX LISA 2021 conference. 1 June 2021. Retrieved 1 July 2022.
  3. ^ "eBPF and Kubernetes: Little Helper Minions for Scaling Microservices". CNCF KubeCon + CloudNativeCon Europe 2020. 19 August 2020. Retrieved 1 July 2022.
  4. ^ a b "Making eBPF work on Windows". Microsoft Open Source Blog. 10 May 2021. Retrieved 1 July 2022.
  5. ^ Brendan Gregg (December 2019). BPF Performance Tools. ISBN 978-0136554820.
  6. ^ a b "eBPF Documentation: What is eBPF?". eBPF.io. Retrieved 1 July 2022.
  7. ^ "eBPF - Rethinking the Linux Kernel". QCon 2020. Retrieved 1 July 2022.
  8. ^ "Safe Programs The Foundation of BPF". eBPF Summit 2021. 8 November 2020. Retrieved 1 July 2022.
  9. ^ "BPF and Spectre: Mitigating transient execution attacks". POPL 2022 conference. 22 January 2022. Retrieved 1 July 2022.
  10. ^ Hedam, Niclas (2021). "eBPF - From a Programmer's Perspective" (PDF). doi:10.13140/RG.2.2.33688.11529/3. {{cite journal}}: Cite journal requires |journal= (help)
  11. ^ "Linux BPF Superpowers". Brendan Gregg's Blog. 5 March 2016. Retrieved 1 July 2022.
  12. ^ "Linus Torvalds talks about coming back to work on Linux". zdnet Interview with Linus Torvalds. 23 October 2018. Retrieved 1 July 2022.
  13. ^ "Classic BPF vs eBPF". LWN. March 2014. Retrieved 6 January 2023.
  14. ^ "net: filter: Just In Time compiler". lore.kernel.org. April 2011. Retrieved 1 July 2022.
  15. ^ "Yet another new approach to seccomp". LWN. 1 January 2012. Retrieved 1 July 2022.
  16. ^ "BPF updates". lore.kernel.org. March 2014. Retrieved 1 July 2022.
  17. ^ "Happy birthday BPF!". lore.kernel.org. September 2014. Retrieved 1 July 2022.
  18. ^ "tracing: attach eBPF programs to kprobes". lore.kernel.org. March 2015. Retrieved 1 July 2022.
  19. ^ "eBPF support for cls_bpf". lore.kernel.org. March 2015. Retrieved 1 July 2022.
  20. ^ "net, sched: add clsact qdisc". lore.kernel.org. January 2016. Retrieved 1 July 2022.
  21. ^ a b "eBPF-based Networking, Observability, Security". cilium.io. January 2016. Retrieved 1 July 2022.
  22. ^ "LLVM 3.7 Release Notes". releases.llvm.org. August 2015. Retrieved 1 July 2022.
  23. ^ "bcc: Taming Linux 4.3+ Tracing Superpowers". brendangregg.com. September 2015. Retrieved 1 July 2022.
  24. ^ "Add driver bpf hook for early packet drop and forwarding". lore.kernel.org. July 2016. Retrieved 1 July 2022.
  25. ^ "eCHO episode 9: XDP and Load Balancing". youtube.com. June 2021. Retrieved 1 July 2022.
  26. ^ "The eXpress Data Path: Fast Programmable Packet Processing in the Operating System Kernel". acm.org. December 2018. Retrieved 1 July 2022.
  27. ^ "Cilium - Fast IPv6 Container Networking with BPF and XDP". slideshare.net. August 2016. Retrieved 1 July 2022.
  28. ^ a b "New GKE Dataplane V2 increases security and visibility for containers". cloud.google.com. May 2021. Retrieved 16 August 2022.
  29. ^ "nfp ring reconfiguration and XDP support". lore.kernel.org. November 2016. Retrieved 1 July 2022.
  30. ^ a b "XDP 1.5 Years In Production. Evolution and Lessons Learned". lpc.events. November 2018. Retrieved 16 August 2022.
  31. ^ "pull-request: bpf 2017-11-23". lore.kernel.org. November 2017. Retrieved 1 July 2022.
  32. ^ "tools: add bpftool". lore.kernel.org. September 2017. Retrieved 1 July 2022.
  33. ^ "Introducing AF_XDP support". lore.kernel.org. January 2018. Retrieved 1 July 2022.
  34. ^ "AF_XDP Poll Mode Driver". doc.dpdk.org. August 2022. Retrieved 16 August 2022.
  35. ^ "BPF comes to firewalls". lwn.net. February 2018. Retrieved 1 July 2022.
  36. ^ "Why is the kernel community replacing iptables with BPF?". cilium.io. April 2018. Retrieved 1 July 2022.
  37. ^ "bpftrace (DTrace 2.0) for Linux 2018". brendangregg.com. October 2018. Retrieved 16 August 2022.
  38. ^ "Combining kTLS and BPF for Introspection and Policy Enforcement" (PDF). vger.kernel.org. November 2018. Retrieved 1 July 2022.
  39. ^ "BTF deduplication and Linux kernel BTF". nakryiko.com. November 2018. Retrieved 1 July 2022.
  40. ^ "BPF Performance Tools (book)". brendangregg.com. December 2019. Retrieved 16 August 2022.
  41. ^ "MAC and Audit policy using eBPF (KRSI)". lore.kernel.org. March 2020. Retrieved 16 August 2022.
  42. ^ "BPF in GCC". lwn.net. September 2020. Retrieved 16 August 2022.
  43. ^ "Open-sourcing Katran, a scalable network load balancer". fb.com. May 2018. Retrieved 16 August 2022.
  44. ^ "BPF at Facebook". youtube.com. December 2019. Retrieved 16 August 2022.
  45. ^ "From XDP to socket". lpc.events. September 2021. Retrieved 16 August 2022.
  46. ^ "eCHO episode 29: BPF LSM with KP Singh". youtube.com. November 2021. Retrieved 16 August 2022.
  47. ^ "BPF security auditing at Google - Brendan Jackman/KP Singh". youtube.com. November 2021. Retrieved 16 August 2022.
  48. ^ "Replacing HTB with EDT and BPF". netdevconf.info. July 2020. Retrieved 16 August 2022.
  49. ^ "Cloudflare architecture and how BPF eats the world". blog.cloudflare.com. May 2019. Retrieved 16 August 2022.
  50. ^ "It's crowded in here!". blog.cloudflare.com. October 2019. Retrieved 16 August 2022.
  51. ^ "Production ready eBPF, or how we fixed the BSD socket API". blog.cloudflare.com. February 2022. Retrieved 16 August 2022.
  52. ^ "Live-patching security vulnerabilities inside the Linux kernel with eBPF Linux Security Module". blog.cloudflare.com. June 2022. Retrieved 16 August 2022.
  53. ^ "Unimog - Cloudflare's edge load balancer". blog.cloudflare.com. September 2020. Retrieved 16 August 2022.
  54. ^ "How Netflix uses eBPF flow logs at scale for network insight". netflixtechblog.com. June 2021. Retrieved 16 August 2022.
  55. ^ "Extending Vector with eBPF to inspect host and container performance". netflixtechblog.com. February 2019. Retrieved 16 August 2022.
  56. ^ "Dropbox traffic infrastructure: Edge network". dropbox.tech. October 2018. Retrieved 16 August 2022.
  57. ^ "eBPF Traffic Monitoring". source.android.com. August 2022. Retrieved 16 August 2022.
  58. ^ "Extending the Kernel with eBPF". source.android.com. August 2022. Retrieved 16 August 2022.
  59. ^ "NAT46 translation with BPF". lore.kernel.org. April 2022. Retrieved 16 August 2022.
  60. ^ "How Does Alibaba Cloud Build High-Performance Cloud-Native Pod Networks in Production Environments?". alibabacloud.com. September 2020. Retrieved 16 August 2022.
  61. ^ "Datadog on eBPF". datadogon.datadoghq.com. February 2021. Retrieved 16 August 2022.
  62. ^ "Runtime Security Monitoring with eBPF" (PDF). sstic.org. February 2021. Retrieved 16 August 2022.
  63. ^ "Our eBPF Journey at Datadog - Laurent Bernaille & Tabitha Sable, Datadog". youtube.com. November 2020. Retrieved 16 August 2022.
  64. ^ "User Story - How Trip.com uses Cilium". cilium.io. February 2020. Retrieved 16 August 2022.
  65. ^ "Trip.com: Stepping into Cloud Native Networking Era with Cilium+BGP". arthurchiao.art. November 2020. Retrieved 16 August 2022.
  66. ^ "Making eBPF work on Windows". cloudblogs.microsoft.com. May 2021. Retrieved 16 August 2022.
  67. ^ "Getting Linux based eBPF programs to run with eBPF for Windows". cloudblogs.microsoft.com. February 2022. Retrieved 16 August 2022.
  68. ^ "Progress on making eBPF work on Windows". cloudblogs.microsoft.com. November 2019. Retrieved 16 August 2022.
  69. ^ "Cilium Standalone Layer 4 Load Balancer XDP". cilium.io. July 2022. Retrieved 16 August 2022.
  70. ^ "Building a Secure and Maintainable PaaS - Bradley Whitfield, Capital One". youtube.com. November 2020. Retrieved 16 August 2022.
  71. ^ "Think eBPF for Kernel Security Monitoring - Falco at Apple- Eric Sage & Melissa Kilby, Apple". youtube.com. October 2021. Retrieved 16 August 2022.
  72. ^ "eBPF & Cilium at Sky – Sebastian Duff, Anthony Comtois, Jospeh Samuel, Sky". youtube.com. August 2021. Retrieved 16 August 2022.
  73. ^ "Running and orchestrating multiple XDP and TC programs – Brian Merrell, Walmart". youtube.com. August 2021. Retrieved 16 August 2022.
  74. ^ "High Performance Load Balancing @Walmart – Kanthi Pavuluri & Karan Dalal, Walmart". youtube.com. August 2021. Retrieved 16 August 2022.
  75. ^ "DIGLIM eBPF: secure boot at application level with minimal changes to distros - Roberto Sassu". youtube.com. August 2022. Retrieved 16 August 2022.
  76. ^ "IKEA Private Cloud, eBPF Based Networking, Load Balancing, and Observability with... Karsten Nielsen". youtube.com. May 2022. Retrieved 16 August 2022.
  77. ^ "eBPF Summit Day Two". cilium.io. October 2020. Retrieved 1 July 2022.
  78. ^ a b "What is the bee named?". ebpf.io. Retrieved 1 July 2022.
  79. ^ "eBPF: One Small Step". Brendan Gregg's Blog. May 2015. Retrieved 1 July 2022.
  80. ^ "eBPF Foundation Charter". ebpf.foundation. June 2021. Retrieved 16 August 2022.
  81. ^ "eBPF Foundation Governance". ebpf.foundation. August 2022. Retrieved 16 August 2022.
  82. ^ a b "eBPF Steering Committee (BSC), eBPF Foundation". Linux Foundation. Retrieved 1 July 2022.
  83. ^ "eBPF Standardization". lpc.events. September 2022. Retrieved 6 January 2023.
  84. ^ "Reading privileged memory with a side-channel". googleprojectzero.blogspot.com. Retrieved 16 August 2022.
  85. ^ "BPF and Spectre: Mitigating transient execution attacks". popl22.sigplan.org. Retrieved 16 August 2022.
  86. ^ "bpf: Disallow unprivileged bpf by default". kernel.org. Retrieved 16 August 2022.
  87. ^ "eBPF Summit 2020". eBPF.io. August 2020. Retrieved 1 July 2022.
  88. ^ "eBPF Summit 2021". eBPF.io. August 2021. Retrieved 1 July 2022.
  89. ^ "eBPF Summit 2022". eBPF.io. August 2022. Retrieved 1 August 2022.
  90. ^ "Cloud Native eBPF Day North America 2021". linuxfoundation.org. October 2021. Retrieved 1 January 2022.
  91. ^ "Cloud Native eBPF Day Europe 2022". linuxfoundation.org. May 2022. Retrieved 1 July 2022.
  92. ^ "LSF/MM/BPF, bpfconf 2019". kernel.org. April 2019. Retrieved 1 July 2022.
  93. ^ "LSF/MM/BPF, bpfconf 2022". kernel.org. May 2022. Retrieved 1 July 2022.
  94. ^ "LPC 2017, Tracing & BPF micro-conference". lpc.events. September 2017. Retrieved 1 July 2022.
  95. ^ "LPC 2018, BPF micro-conference". kernel.org. November 2018. Retrieved 1 July 2022.
  96. ^ "LPC 2019, BPF micro-conference". lpc.events. September 2019. Retrieved 1 July 2022.
  97. ^ "LPC 2020, Networking and BPF Summit". lpc.events. August 2020. Retrieved 1 July 2022.
  98. ^ "LPC 2021, BPF & Networking Summit". lpc.events. September 2021. Retrieved 1 July 2022.
  99. ^ "LPC 2022, eBPF & Networking Summit". lpc.events. September 2022. Retrieved 16 August 2022.

Further reading