Green thread
In computer programming, a green thread is a thread that is scheduled by a runtime library or virtual machine (VM) instead of natively by the underlying operating system (OS). Green threads emulate multithreaded environments without relying on any native OS abilities, and they are managed in user space instead of kernel space, enabling them to work in environments that do not have native thread support.[1]
Etymology
[edit]Green threads refers to the name of the original thread library for Java programming language (that was released in version 1.1 and then Green threads were abandoned in version 1.3 to native threads). It was designed by The Green Team at Sun Microsystems.[2]
History
[edit]Green threads were briefly available in Java between 1997 and 2000.
Green threads share a single operating system thread through co-operative concurrency and can therefore not achieve parallelism performance gains like operating system threads. The main benefit of coroutines and green threads is ease of implementation.
Performance
[edit]This section needs to be updated.(February 2014) |
On a multi-core processor, native thread implementations can automatically assign work to multiple processors, whereas green thread implementations normally cannot.[1][3] Green threads can be started much faster on some VMs. On uniprocessor computers, however, the most efficient model has not yet been clearly determined.
Benchmarks on computers running the Linux kernel version 2.2 (released in 1999) have shown that:[4]
- Green threads significantly outperform Linux native threads on thread activation and synchronization.
- Linux native threads have slightly better performance on input/output (I/O) and context switching operations.
When a green thread executes a blocking system call, not only is that thread blocked, but all of the threads within the process are blocked.[5] To avoid that problem, green threads must use non-blocking I/O or asynchronous I/O operations, although the increased complexity on the user side can be reduced if the virtual machine implementing the green threads spawns specific I/O processes (hidden to the user) for each I/O operation.[citation needed]
There are also mechanisms which allow use of native threads and reduce the overhead of thread activation and synchronization:
- Thread pools reduce the cost of spawning a new thread by reusing a limited number of threads.[6]
- Languages which use virtual machines and native threads can use escape analysis to avoid synchronizing blocks of code when unneeded.[7]
Green threads in the Java Virtual Machine
[edit]In Java 1.1, green threads were the only threading model used by the Java virtual machine (JVM),[8] at least on Solaris. As green threads have some limitations compared to native threads, subsequent Java versions dropped them in favor of native threads.[9][10]
An exception to this is the Squawk virtual machine, which is a mixture between an operating system for low-power devices and a Java virtual machine. It uses green threads to minimize the use of native code, and to support migrating its isolates.
Kilim[11][12] and Quasar[13][14] are open-source projects which implement green threads on later versions of the JVM by modifying the Java bytecode produced by the Java compiler (Quasar also supports Kotlin and Clojure).
Green threads in other languages
[edit]There are some other programming languages that implement equivalents of green threads instead of native threads. Examples:
- Chicken Scheme uses lightweight user-level threads based on first-class continuations[15]
- Common Lisp[16]
- CPython natively supports asyncio since Version 3.4, alternative implementations exist like greenlet, eventlet and gevent, PyPy[17]
- Crystal offers fibers[18]
- D offers fibers, used for asynchronous I/O[19]
- Dyalog APL terms them threads[20]
- Erlang[21]
- Go implements so called goroutines[22]
- Haskell[22]
- Julia uses green threads for its Tasks.
- Limbo[23]
- Lua uses coroutines for concurrency. Lua 5.2 also offers true C coroutine semantics through the functions lua_yieldk, lua_callk, and lua_pcallk. The CoCo extension allows true C coroutine semantics for Lua 5.1.
- Nim provides asynchronous I/O and coroutines
- OCaml, since version 5.0, supports green threads through the Domainslib.Task module
- occam, which prefers the term process instead of thread due to its origins in communicating sequential processes
- Perl supports green threads through coroutines
- PHP supports green threads through fibers and coroutines
- Ruby before version 1.9[24]
- Racket (native threads are also available through Places[25])
- Rust supports system threads natively[26] and supports asynchronous I/O through third-party libraries like Tokio
- SML/NJ's implementation of Concurrent ML
- Smalltalk (most dialects: Squeak, VisualWorks, GNU Smalltalk, etc.)
- Stackless Python supports either preemptive multitasking or cooperative multitasking through microthreads (termed tasklets).[27]
- Tcl has coroutines and an event loop[28]
The Erlang virtual machine has what might be called green processes – they are like operating system processes (they do not share state like threads do) but are implemented within the Erlang Run Time System (erts). These are sometimes termed green threads, but have significant differences[clarification needed] from standard green threads.[citation needed]
In the case of GHC Haskell, a context switch occurs at the first allocation after a configurable timeout. GHC threads are also potentially run on one or more OS threads during their lifetime (there is a many-to-many relationship between GHC threads and OS threads), allowing for parallelism on symmetric multiprocessing machines, while not creating more costly OS threads than needed to run on the available number of cores.[citation needed]
Most Smalltalk virtual machines do not count evaluation steps; however, the VM can still preempt the executing thread on external signals (such as expiring timers, or I/O becoming available). Usually round-robin scheduling is used so that a high-priority process that wakes up regularly will effectively implement time-sharing preemption:
[
[(Delay forMilliseconds: 50) wait] repeat
] forkAt: Processor highIOPriority
Other implementations, e.g., QKS Smalltalk, are always time-sharing. Unlike most green thread implementations, QKS also supports preventing priority inversion.
Differences to virtual threads in the Java Virtual Machine
[edit]Virtual threads were introduced as a preview feature in Java 19[29] and stabilized in Java 21.[30] Important differences between virtual threads and green threads are:
- Virtual threads coexist with existing (non-virtual) platform threads and thread pools.
- Virtual threads protect their abstraction:
- Unlike with green threads, sleeping on a virtual thread does not block the underlying carrier thread.
- Working with thread-local variables is deemphasized, and scoped values are suggested as a more lightweight replacement.[31]
- Virtual threads can be cheaply suspended and resumed, making use of JVM support for the special
jdk.internal.vm.Continuation
class. - Virtual threads handle blocking calls by transparently unmounting from the carrier thread where possible, otherwise compensating by increasing the number of platform threads.
See also
[edit]- Async/await
- Light-weight process
- Coroutine
- Java virtual machine
- Global interpreter lock
- Fiber (computer science)
- GNU Portable Threads
- Protothreads
References
[edit]- ^ a b Sintes, Tony (April 13, 2001). "Four for the ages". JavaWorld. Archived from the original on 2020-07-15. Retrieved 2020-07-14.
Green threads, the threads provided by the JVM, run at the user level, meaning that the JVM creates and schedules the threads itself. Therefore, the operating system kernel doesn't create or schedule them. Instead, the underlying OS sees the JVM only as one thread. Green threads prove inefficient for a number of reasons. Foremost, green threads cannot take advantage of a multiprocessor system(...) Thus, the JVM threads are bound to run within that single JVM thread that runs inside a single processor.
{{cite web}}
: CS1 maint: bot: original URL status unknown (link) - ^ "Java Technology: The Early Years". java.sun.com. 2014-12-22. Archived from the original on 2008-05-30.
- ^ "What is the difference between "green" threads and "native" threads?". jguru.com. 2000-09-06. Retrieved 2009-06-01.
On multi-CPU machines, native threads can run more than one thread simultaneously by assigning different threads to different CPUs. Green threads run on only one CPU.
- ^ "Comparative performance evaluation of Java threads for embedded applications: Linux Thread vs. Green Thread". CiteSeerX 10.1.1.8.9238.
- ^ Stallings, William (2008). Operating Systems, Internal and Design Principles. New Jersey: Prentice Hall. p. 171. ISBN 9780136006329.
- ^ Sieger, Nick (2011-07-22). "Concurrency in JRuby". Engine Yard. Archived from the original on 2014-01-30. Retrieved 2013-01-26.
For systems with large volumes of email, this naive approach may not work well. Native threads carry a bigger initialization cost and memory overhead than green threads, so JRuby normally cannot support more than about 10,000 threads. To work around this, we can use a thread pool.
- ^ Goetz, Brian (2005-10-18). "Java theory and practice: Synchronization optimizations in Mustang". IBM. Retrieved 2013-01-26.
- ^ "Java Threads in the Solaris Environment – Earlier Releases". Oracle Corporation. Retrieved 2013-01-26.
As a result, several problems arose: Java applications could not interoperate with existing MT applications in the Solaris environment, Java threads could not run in parallel on multiprocessors, An MT Java application could not harness true OS concurrency for faster applications on either uniprocessors or multiprocessors. To substantially increase application performance, the green threads library was replaced with native Solaris threads for Java on the Solaris 2.6 platform; this is carried forward on the Solaris 7 and Solaris 8 platforms.
- ^ "Threads: Green or Native". SCO Group. Retrieved 2013-01-26.
The performance benefit from using native threads on an MP machine can be dramatic. For example, using an artificial benchmark where Java threads are doing processing independent of each other, there can be a three-fold overall speed improvement on a 4-CPU MP machine.
- ^ "Threads: Green or Native". codestyle.org. Archived from the original on 2013-01-16. Retrieved 2013-01-26.
There is a significant processing overhead for the JVM to keep track of thread states and swap between them, so green thread mode has been deprecated and removed from more recent Java implementations.
- ^ "kilim". GitHub. Retrieved 2016-06-09.
- ^ "Kilim". www.malhar.net. Retrieved 2016-06-09.
- ^ "Quasar Code on GitHub". GitHub.
- ^ "Parallel Universe". Archived from the original on 22 December 2015. Retrieved 6 December 2015.
- ^ "Chicken Scheme". Retrieved 5 November 2017.
- ^ "thezerobit/green-threads". GitHub. Retrieved 2016-04-08.
- ^ "Application-level Stackless features – PyPy 4.0.0 documentation". Retrieved 6 December 2015.
- ^ "Concurrency: GitBook". crystal-lang.org. Retrieved 2018-04-03.
- ^ "Fibers - Dlang Tour". tour.dlang.org. Retrieved 2022-05-02.
- ^ "Threads: Overview". Dyalog APL 17.0 Help. Retrieved 2018-12-14.
A thread is a strand of execution in the APL workspace.
- ^ @joeerl (23 June 2018). "Erlang processes are emulated in the Erlang VM, like Green threads - we like them since this simplifies many proble…" (Tweet) – via Twitter.
- ^ a b "Go and Dogma". research!rsc. Retrieved 2017-01-14.
for example both Go and Haskell need some kind of "green threads", so there are more shared runtime challenges than you might expect.
- ^ "The Limbo Programming Language". www.vitanuova.com. Retrieved 2019-04-01.
- ^ "Multithreading in the MRI Ruby Interpreter | BugFactory". Retrieved 2024-06-18.
- ^ "Racket Places". Retrieved 2011-10-13.
Places enable the development of parallel programs that take advantage of machines with multiple processors, cores, or hardware threads. A place is a parallel task that is effectively a separate instance of the Racket virtual machine.
- ^ "Using Threads to Run Code Simultaneously - The Rust Programming Language". doc.rust-lang.org. Retrieved 2021-09-24.
- ^ "Stackless.com: About Stackless". Archived from the original on 2012-02-27. Retrieved 2008-08-27.
A round robin scheduler is built in. It can be used to schedule tasklets either cooperatively or preemptively.
- ^ "Tcl event loop". Retrieved 6 December 2015.
- ^ "JEP 425: Virtual Threads (Preview)". Retrieved 2024-01-25.
- ^ "JEP 444: Virtual Threads". Retrieved 2024-01-25.
- ^ "JEP 464: Scoped Values (Second Preview)". Retrieved 2024-01-25.
External links
[edit]- "Four for the ages", JavaWorld article about Green threads
- Green threads on Java threads FAQ