Q*

Q* (pronounced "Q-star") is an alleged unreleased project by OpenAI dedicated to the application of artificial intelligence in logical and mathematical reasoning. In November 2023, certain employees of OpenAI reportedly raised concerns with the company's board, suggesting that Q* might signify the imminent emergence of artificial general intelligence.^[1] The reported work involves performing math on the level of grade-school students.^[1]^[2]^[3]

OpenAI spokesperson Lindsey Held Bolton contested this perspective in a statement conveyed to The Verge, stating, "Mira told employees what the media reports were about but she did not comment on the accuracy of the information." Additionally, a source familiar with the situation informed The Verge that the board never received a letter regarding such a groundbreaking development, and the progress of the company's research did not factor into Altman's abrupt termination.^[4]

Reaction from others in the field of AI were also dismissive when it came to claims of artificial general intelligence (AGI). François Chollet, an AI Researcher at Google with work on how to achieve greater generality in artificial intelligence,^[5] noted "Every single month from here on there will be rumors of AGI having been achieved internally. Just rumors, never any actual paper, product release, or anything of the sort. The first panic over imminent AGI was circa 2013 about Atari Q-learning by DeepMind. The second one was circa 2016 over Deep RL (partially triggered by AlphaGo)."^[6]^[7] Yann LeCun, Chief AI Scientist at Meta, described the rumors as a "deluge of complete nonsense about Q*."^[8]

References

^ ^a ^b Anna Tong; Jeffrey Dastin; Krystal Hu (November 22, 2023). "Exclusive: OpenAI researchers warned board of AI breakthrough ahead of CEO ouster, sources say". Reuters. Some at OpenAI believe Q* (pronounced Q-Star) could be a breakthrough in the startup's search for what's known as artificial general intelligence (AGI), one of the people told Reuters. OpenAI defines AGI as autonomous systems that surpass humans in most economically valuable tasks.
^ PRM800K: 800,000 step-level correctness labels on LLM solutions to MATH problems
^ Let's Verify Step by Step
^ The Verge: A recent OpenAI breakthrough on the path to AGI has caused a stir
^ "To Really Judge an AI's Smarts, Give it One of These IQ Tests". IEEE Spectrum. February 2, 2021. Retrieved August 2, 2021.
^ @fchollet (November 23, 2023). "Every single month from here on there will be rumors of AGI having been achieved internally. Just rumors, never any actual paper, product release, or anything of the sort" (Tweet) – via Twitter. – François Chollet, the creator of the Keras deep-learning library and AI Researcher at Google
^ @fchollet (November 23, 2023). "The first panic over imminent AGI was circa 2013 about Atari Q-learning by DeepMind. The second one was circa 2016 over Deep RL (partially triggered by AlphaGo). So many folks in late 2016 were convinced that Deep RL would lead to AGI in under in 5 years..." (Tweet) – via Twitter. – François Chollet, the creator of the Keras deep-learning library and AI Researcher at Google
^ @ylecun (November 24, 2023). "Please ignore the deluge of complete nonsense about Q*" (Tweet) – via Twitter. One of the main challenges to improve LLM reliability is to replace Auto-Regressive token prediction with planning.
Pretty much every top lab (FAIR, DeepMind, OpenAI etc) is working on that and some have already published ideas and results.
It is likely that Q* is OpenAI attempts at planning. They pretty much hired Noam Brown (of Libratus/poker and Cicero/Diplomacy fame) to work on that.
[Note: I've been advocating for deep learning architecture capable of planning since 2016].

Q*

See also

References

Further reading