Proof assistant

From Wikipedia, the free encyclopedia
An interactive proof session in CoqIDE, showing the proof script on the left and the proof state on the right

In computer science and mathematical logic, a proof assistant or interactive theorem prover is a software tool to assist with the development of formal proofs by human-machine collaboration. This involves some sort of interactive proof editor, or other interface, with which a human can guide the search for proofs, the details of which are stored in, and some steps provided by, a computer.

A recent effort within this field is making these tools use artificial intelligence to automate the formalization of ordinary mathematics.[1]

System comparison[edit]

Name Latest version Developer(s) Implementation language Features
Higher-order logic Dependent types Small kernel Proof automation Proof by reflection Code generation
ACL2 8.3 Matt Kaufmann and J Strother Moore Common Lisp No Untyped No Yes Yes[2] Already executable
Agda 2.6.3 Ulf Norell, Nils Anders Danielsson, and Andreas Abel (Chalmers and Gothenburg) Haskell Yes Yes Yes No Partial Already executable
Albatross 0.4 Helmut Brandl OCaml Yes No Yes Yes Un­known Not yet Implemented
Coq 8.17.0 INRIA OCaml Yes Yes Yes Yes Yes Yes
F* repository Microsoft Research and INRIA F* Yes Yes No Yes Yes[3] Yes
HOL Light repository John Harrison OCaml Yes No Yes Yes No No
HOL4 Kananaskis-13 (or repo) Michael Norrish, Konrad Slind, and others Standard ML Yes No Yes Yes No Yes
Idris 2 0.6.0. Edwin Brady Idris Yes Yes Yes Un­known Partial Yes
Isabelle Isabelle2021 (February 2021) Larry Paulson (Cambridge), Tobias Nipkow (München) and Makarius Wenzel Standard ML, Scala Yes No Yes Yes Yes Yes
Lean v3.4.2 (official release)[4] v3.51.1 (community release)[5] v4.1.0[6] Leonardo de Moura (Microsoft Research) C++, Lean Yes Yes Yes Yes Yes Yes
LEGO (not affiliated with Lego) 1.3.1 Randy Pollack (Edinburgh) Standard ML Yes Yes Yes No No No
Metamath v0.198[7] Norman Megill ANSI C
Mizar 8.1.05 Białystok University Free Pascal Partial Yes No No No No
NuPRL 5 Cornell University Common Lisp Yes Yes Yes Yes Un­known Yes
PVS 6.0 SRI International Common Lisp Yes Yes No Yes No Un­known
Twelf 1.7.1 Frank Pfenning and Carsten Schürmann Standard ML Yes Yes Un­known No No Un­known
  • ACL2 – a programming language, a first-order logical theory, and a theorem prover (with both interactive and automatic modes) in the Boyer–Moore tradition.
  • Coq – Allows the expression of mathematical assertions, mechanically checks proofs of these assertions, helps to find formal proofs, and extracts a certified program from the constructive proof of its formal specification.
  • HOL theorem provers – A family of tools ultimately derived from the LCF theorem prover. In these systems the logical core is a library of their programming language. Theorems represent new elements of the language and can only be introduced via "strategies" which guarantee logical correctness. Strategy composition gives users the ability to produce significant proofs with relatively few interactions with the system. Members of the family include:
  • IMPS, An Interactive Mathematical Proof System[8]
  • Isabelle is an interactive theorem prover, successor of HOL. The main code-base is BSD-licensed, but the Isabelle distribution bundles many add-on tools with different licenses.
  • Jape – Java based.
  • Lean
  • LEGO
  • Matita – A light system based on the Calculus of Inductive Constructions.
  • MINLOG – A proof assistant based on first-order minimal logic.
  • Mizar – A proof assistant based on first-order logic, in a natural deduction style, and Tarski–Grothendieck set theory.
  • PhoX – A proof assistant based on higher-order logic which is eXtensible.
  • Prototype Verification System (PVS) – a proof language and system based on higher-order logic.
  • TPS and ETPS – Interactive theorem provers also based on simply-typed lambda calculus, but based on an independent formulation of the logical theory and independent implementation.

User interfaces[edit]

A popular front-end for proof assistants is the Emacs-based Proof General, developed at the University of Edinburgh.

Coq includes CoqIDE, which is based on OCaml/Gtk. Isabelle includes Isabelle/jEdit, which is based on jEdit and the Isabelle/Scala infrastructure for document-oriented proof processing. More recently, a Visual Studio Code extension for Isabelle has also been developed by Makarius Wenzel.[9]

Formalization extent[edit]

Freek Wiedijk has been keeping a ranking of formal verification systems (proof assistants included) by the amout of formalized theorems out of a list of 100 famous theorems. As of September 2023, only five systems have formalized proofs of more than 70% of the theorems, namely Isabelle, HOL Light, Coq, Lean and Metamath.[10]

See also[edit]


  1. ^ Ornes, Stephen (August 27, 2020). "Quanta Magazine - How Close Are Computers to Automating Mathematical Reasoning?".
  2. ^ Hunt, Warren; Matt Kaufmann; Robert Bellarmine Krug; J Moore; Eric W. Smith (2005). "Meta Reasoning in ACL2" (PDF). Theorem Proving in Higher Order Logics. Lecture Notes in Computer Science. Vol. 3603. pp. 163–178. doi:10.1007/11541868_11. ISBN 978-3-540-28372-0.
  3. ^ Search for "proofs by reflection": arXiv:1803.06547
  4. ^ "Lean Theorem Prover Releases page". GitHub.
  5. ^ "Lean Community Releases Page". GitHub.
  6. ^ "Lean 4 Releases Page". GitHub.
  7. ^
  8. ^ Farmer, William M.; Guttman, Joshua D.; Thayer, F. Javier (1993). "IMPS: An interactive mathematical proof system". Journal of Automated Reasoning. 11 (2): 213–248. doi:10.1007/BF00881906. S2CID 3084322. Retrieved 22 January 2020.
  9. ^ Wenzel, Makarius. "Isabelle". Retrieved 2 November 2019.
  10. ^ Wiedijk, Freek (15 September 2023). "Formalizing 100 Theorems".


External links[edit]