Chess engine

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In computer chess, a chess engine is a computer program that analyses chess positions and makes decisions on the best chess moves.[1]

The chess engine decides what moves to make, but typically does not interact directly with the user. Most chess engines do not have their own graphical user interface (GUI) but are rather console applications that communicate with a GUI such as XBoard, WinBoard or glChess via a standard protocol. This allows the user to play against multiple engines without learning a new user interface for each, and allows different engines to play against each other.

Interface protocol[edit]

The command-line interface of GNU Chess became the initial de facto standard, called the Chess Engine Communication Protocol and was first supported by XBoard. When XBoard was ported to the Windows operating system as WinBoard, this protocol was popularly renamed to 'WinBoard Protocol'. The WinBoard Protocol was itself upgraded and the two versions of the protocols are referred to as: 'WinBoard Protocol 1' (original version) and 'WinBoard Protocol 2' (newer version). There is another protocol, the Universal Chess Interface. Some engines support both major protocols, and each protocol has its supporters. The Winboard Protocol is more popular but many chess engine developers feel that the Universal Chess Interface is easier to implement. Some interfaces support both protocols, while others, such as WinBoard, support only one and depend on subsidiary interpreters to translate.

Increasing strength[edit]

Main article: Computer chess

Chess engines increase in playing strength each year. This is partly due to the increase in processing power that enables calculations to be made to ever greater depths in a given time. In addition, programming techniques have improved, enabling the engines to be more selective in the lines that they analyse and to acquire a better positional understanding.

Some chess engines use endgame tablebases to increase their playing strength during the endgame. An endgame tablebase is a database of all possible endgame positions with small groups of material. Each position is conclusively determined as a win, loss, or draw for the player whose turn it is to move, and the number of moves to the end with best play by both sides. Endgame tablebases in all cases identify the absolute best move in all positions included (identifying the move that wins fastest against perfect defense, or the move that loses slowest against optimal opposition). Such tablebases are available for all positions containing three to six pieces (counting the kings) and for some seven-piece combinations. When the maneuvering in an ending to achieve an irreversible improvement takes more moves than the horizon of calculation of a chess engine, an engine is not guaranteed to find the best move without the use of an endgame tablebase, and in many cases can fall foul of the fifty-move rule as a result.

Many engines use permanent brain as a method to increase their strength.

Comparisons[edit]

Tournaments[edit]

The results of computer tournaments give one view of the relative strengths of chess engines. However, tournaments do not play a statistically significant number of games for accurate strength determination. In fact, the number of games that need to be played between fairly evenly matched engines, in order to achieve significance, runs into the thousands and is, therefore, impractical within the framework of a tournament.[2] Most tournaments also allow any types of hardware, so only engine/hardware combinations are being compared.

Historically, commercial programs have been the strongest engines. To some extent, this is a self-fulfilling prophecy; if an amateur engine wins a tournament or otherwise performs well (for example, Zappa in 2005), then it is quickly commercialized. Titles gained in these tournaments garner much prestige for the winning programs, and are thus used for marketing purposes.

Ratings[edit]

Chess engine rating lists aim to provide statistically significant measures of relative engine strength. These lists play multiple games between engines on standard hardware platforms, so that processor differences are factored out. Some also standardize the opening books, in an attempt to measure the strength differences of the engines only. These lists not only provide a ranking, but also margins of error on the given ratings. Also rating lists typically play games continuously, publishing many updates per year, compared to tournaments which only take place annually.

There are a number of factors that vary among the chess engine rating lists:

  • Time control. Longer time controls, such as 40 moves in 120 minutes, are better suited for determining tournament play strength, but also make testing more time-consuming.
  • Hardware used. Faster hardware with more memory leads to stronger play.
  • 64-bit (vs. 32-bit) hardware and operating systems favor bitboard-based programs
  • Multiprocessor vs. single processor hardware.
  • Ponder settings (speculative analysis while the opponent is thinking) aka Permanent Brain.
  • Transposition table sizes.
  • Opening book settings.

These differences affect the results, and make direct comparisons between rating lists difficult. All listed engines are 64-bit.

Rating list Time control
(moves/minutes)
Year
started
Last updated Engine/platform
entries
Games
played
Top three engines Rating
CCRL[3] 40/40[4]
Ponder OFF
2005 June 28, 2014 1427 522,540 Stockfish 5 x64 4CPU
Houdini 4 x64 4CPU
Komodo 7a x64 4CPU
3305
3275
3246
CEGT[5] 40/20[6]
Ponder OFF
2006 June 29, 2014 1198 751,877 Stockfish 5 x64 4CPU
Komodo 7 x64 4CPU
Houdini 4 x64 4CPU
3176
3164
3153
IPON[7] 5m+3s
~16min/game
Ponder ON
2006 June 6, 2014 134 322,480 Stockfish 5 Syzygy
Houdini 4
Komodo 7a
3145
3135
3107
SWCR[8] 40/5
Ponder ON
2009 January 31, 2014 21 17,500 Houdini 4 x64 1CPU
Stockfish 4 130512-beta x64 1CPU
Komodo TCEC x64 1CPU
3098
3066
3046
SSDF[9] 40/120 --> 20/60
Ponder ON
1984 May 12, 2014 339 134,044 Komodo 5.1 MP x64 2GB Q6600 2.4 GHz
Stockfish 3 MP x64 2GB Q6600 2.4 GHz
Deep Rybka 4 x64 2GB Q6600 2.4 GHz
3259
3211
3209
  • Note that the listings in the above table only count the best entry for a given engine.

These ratings, although calculated by using the Elo system (or similar rating methods), have no direct relation to FIDE Elo ratings or to other chess federation ratings of human players. Except for some man versus machine games which the SSDF had organized many years ago (which were far from today's level), there is no calibration between any of these rating lists and player pools. Hence, the results which matter are the ranks and the differences between the ratings, not the absolute level of the numbers. Also, each list calibrates their Elo via a different method. Therefore no Elo comparisons can be made between the lists. Nevertheless, in view of recent man versus machine matches, it is generally undisputed that top computer chess engines should be rated at least in the range of top human performances, and probably significantly higher.

Missing from many rating lists are IPPOLIT and its derivatives (e.g. Fire). Although very strong and open source, there are allegations from commercial software interests that they were derived from disassembled binary of Rybka.[10] Due to the controversy, all these engines have been blacklisted from many tournaments and rating lists. Rybka in turn was accused of being based on Fruit,[11] and in June 2011, the ICGA formally claimed Rybka was derived from Fruit and Crafty and banned Rybka from the International Computer Games Association World Computer Chess Championship, and revoked its previous victories (2007, 2008, 2009, and 2010)[12] The ICGA was criticized for this decision by Dr. Søren Riis, a longstanding supporter of the Rybka program.[13] Rybka is still included on several society ranking lists.

Test suites[edit]

Engines can be tested by measuring their performance on specific positions. Typical is the use of test suites, where for each given position there is one best move to find. These positions can be geared towards positional, tactical or endgame play. The Nolot test suite, for instance, focuses on deep sacrifices.[14] Then there are the BT2450 and BT2630 test suites by Hubert Bednorz and Fred Toennissen. These suites measure the tactical capability of the engine[15] and have been used at least by REBEL.[16] There is also a general test suite called Brilliancy by Dana Turnmire. The suite has been compiled mostly from How to Reassess Your Chess Workbook.[17]

Strategic Test Suite (STS) by Swaminathan and Dann Corbit, tests chess engine's strategical strength.[18]

Categorizations[edit]

Dedicated hardware[edit]

These chess playing systems include custom hardware or run on supercomputers. All are historical; chess supercomputers have not competed in computer tournaments since Hydra played in 2006.

Commercial dedicated computers[edit]

In the 1980s and early 1990s, there was a competitive market for strong dedicated chess computers. Many form-factors were sold, from handheld peg-board computers to wooden auto-sensory boards with state-of-the-art processors. This market changed in the mid-90s when the economical embedded processors in dedicated chess computers could no longer compete with the fast processors in personal computers. Nowadays, most dedicated units sold are of beginner and intermediate strength.

  • Chess Challenger, a line of chess computers sold by Fidelity Electronics from 1977 to 1992.[19] These models won the first four World Microcomputer Chess Championships.
  • ChessMachine, an ARM-based dedicated computer, which could run two engines:
  • Excalibur Electronics sells a line of beginner strength units. Excalibur was started in 1992 by the son of the founder of Fidelity Electronics.
  • Mephisto, a line of chess computers sold by Hegener & Glaser. The units programmed by Richard Lang won six consecutive World Microcomputer Chess Championships. They bought out Fidelity in 1989.
  • Novag sells a line of tactically strong computers, including the Constellation, Sapphire, and Star Diamond brands.
  • Phoenix Chess Systems makes limited edition units based around StrongARM and XScale processors running modern engines and emulating classic engines.
  • Saitek sells mid-range units of intermediate strength. They bought out Hegener & Glaser and its Mephisto brand in 1994.

Historical[edit]

These chess programs run on obsolete hardware.

Other[edit]

There are hundreds of free and/or open source chess engines which conform to one of the above communication protocols. The top 50 strongest, freely available engines, according to the CCRL 40/40 rating list, are listed here.[21]

See also[edit]

Notes and references[edit]

  1. ^ blog.chess.com Creating a chess engine from scratch (Part 1: Basics), Link date 28 June 2012
  2. ^ mizarchessengine.com
  3. ^ "CCRL 40/40 - Complete list". June 28, 2014. Retrieved July 1, 2014. 
  4. ^ Also available: 40 moves in 4 minutes
  5. ^ "CEGT 40/20". Chess Engines Grand Tournament. june 29, 2014. Retrieved July 1, 2014. 
  6. ^ Also available: 40 moves in 4 minutes, 40 moves in 120 minutes
  7. ^ "IPON". IPON. June 6, 2014. Retrieved July 1, 2014. 
  8. ^ "SWCR". SWCR. January 31, 2014. Retrieved January 31, 2014. 
  9. ^ "The SSDF Rating List". Swedish Chess Computer Association. May 12, 2014. Retrieved July 1, 2014. 
  10. ^ Chess engine controversy at chessvibes.com, retrieved 28/May/2010
  11. ^ Evaluation
  12. ^ Rybka disqualified and banned from World Computer Chess Championships | ChessVibes
  13. ^ Riis, Dr. Søren (January 2, 2012). "A Gross Miscarriage of Justice in Computer Chess (part one)". Chessbase News. Retrieved 19 February 2012. 
  14. ^ Nolot test suite
  15. ^ BT2450 test suite
  16. ^ Rebel
  17. ^ Brilliancy suite TalkChess forum
  18. ^ [1] Strategic Test Suite
  19. ^ Fidelity Chess Challenger 1 - World's First Chess Computer
  20. ^ Microchess
  21. ^ Elo ratings taken from "CCRL 40/40, full list". Computer Chess Rating List. January 29, 2012. Retrieved February 9, 2012. 

External links[edit]