Mike Phillips (speech recognition)
This biographical article is written like a résumé. (January 2016)
Phillips was a student in electrical engineering at Carnegie Mellon University. He was also a researcher for Carnegie Mellon and then a research scientist at the Spoken Language Systems group at the Massachusetts Institute of Technology (MIT), where he helped to develop VOYAGER, an “urban navigation and exploration system” that could recognize and interpret basic spoken queries. VOYAGER was one of the first research systems to combine speech recognition and natural language processing to have a conversation with a user.
In 1994, Phillips co-founded and became CTO of Boston-based SpeechWorks, which became one of the leading US-based vendors of speech recognition technology at the time, alongside Nuance Communications and IBM. The startup developed interactive voice response systems, including call-center interfaces for clients including Amtrak and FedEx. SpeechWorks’ technology worked for call-center interfaces because the customer could verbally answer questions posed by the human-sounding speech recognition program, rather than navigating through a menu. The technology also had time-saving “barge-in” capabilities, meaning that a customer could interrupt the system before it finished offering the full list of options. The system could also “learn.” It kept a record of names or phrases customers had used in the past so that it could learn to understand names and phrases that slightly differed from its original vocabulary.
SpeechWorks’ value more than tripled after its initial public offering, and it was acquired by ScanSoft in 2003. While Phillips was CTO at ScanSoft, he worked on technologies across the company’s products, including the leading dictation software Dragon NaturallySpeaking. ScanSoft then acquired Nuance Communications in 2005, and adopted the latter’s name.
Phillips returned to MIT as a visiting scientist and co-founded Vlingo in 2006 with former SpeechWorks colleague John Nguyen. An intelligent software assistant, Vlingo is a speech-to-text application integrated with user-facing apps for iPhone, Android, BlackBerry, and other smartphones. Vlingo software allowed users to text and navigate smartphones via voice recognition. The first cell phone speech recognition software that successfully interpreted user input and learned over time, the software would later be adapted into the popular personal assistant software Siri.
In 2008, Nuance Communications attempted to sue Vlingo on the grounds of patent infringement. Phillips was offered the choice to either sell Vlingo to Nuance or be sued. After six lengthy lawsuits, Phillips won, but the $3 million in legal fees drained his company’s research and development funds. Vlingo was sold to Nuance in December 2011.
In 2013, Phillips co-founded a startup, Sense Labs. Headquartered in Cambridge, Massachusetts, the Sense home energy monitor is an in-development device. Once attached to a home’s electric panel, it “listens” to a home’s electricity usage and identifies the wattage various appliances draw. The first wave of Sense energy monitors began shipping in early December 2015.
Phillips has served on various boards and holds more than 20 patents.
- 2004: Top Leader in Speech from Speech Technology Magazine 
- 2005: Winner of the Speech Technology Magazine Lifetime Achievement Award 
- 1983 Feature-based speaker-independent recognition of isolated english letters
- 1986 The C-MU phonetic classification system
- 1989 The MIT Summit Speech Recognition System: A Progress Report
- 1989 The Voyager Speech Understanding System: A Progress Report
- 1989 The Collection And Preliminary Analysis Of A Spontaneous Speech Database
- 1989 Preliminary Evaluation Of The Voyager Spoken Language System
- 1990 The VOYAGER speech understanding system: preliminary development and evaluation
- 1990 Preliminary ATIS Development at MIT
- 1990 Recent Progress on the VOYAGER System
- 1990 Recent Progress on the SUMMIT System
- 1990 From speech recognition to spoken language understanding: the development of the MIT SUMMIT and VOYAGER systems
- 1990 Phonetic classification and recognition using the multi-layer perceptron
- 1991 Modelling Context Dependency in Acoustic-Phonetic and Lexical Representations
- 1991 Development and Preliminary Evaluation of the MIT ATIS System
- 1991 Integrating Syntax and Semantics into Spoken Language Understanding
- 1991 Integration of speech recognition and natural language processing in the MIT VOYAGER system
- 1991 Back Talking To Your Database: Interactive Spoken Language Interfaces
- 1992 The MIT ATIS System: February 1992 Progress Report
- 1992 Collection and Analyses of WSJ-CSR Data at MIT
- 1993 A Bilingual VOYAGER System
- 1994 The Hub and Spoke Paradigm for CSR Evaluation
- 1994 PEGASUS: A Spoken Language Interface for On-Line Air Travel Planning
- 1995 Multilingual spoken-language understanding in the MIT Voyager system
- 2004 Multimodal conversational systems for automobiles
- 2006 Applications of spoken language technology and systems
- "CMU Robust Speech Recognition Home Page". www.cs.cmu.edu. Retrieved 2016-01-21.
- "Speech Industry Expert Mike Phillips Joins Tell-Eureka Advisory Board; MIT Scientist and a Founder of Speechworks (Now Part of Nuance) to Help Tell-Eureka Bring Next Generation Speech Applications to a Broader Market | Business Wire". www.businesswire.com. Retrieved 2016-01-21.
- Zue, Victor. "From Speech Recognition to Spoken Language Understanding: The Development of the MIT SUMMIT and VOYAGER Systems" (PDF).
- Zue, Victor. "THE VOYAGER SPEECH UNDERSTANDING SYSTEM: A PROGRESS REPORT" (PDF).
- Fitzgerald, Michael (2008-01-27). "The Coming Wave of Gadgets That Listen and Obey". The New York Times. ISSN 0362-4331. Retrieved 2016-01-21.
- Fluss, Donna (June 2002). "Ripe for the picking. (Speech Recognition)".
- "Talk to the Phone | MIT Technology Review". MIT Technology Review. Retrieved 2016-01-21.
- Kirsner, Scott (2012-05-25). "Former SpeechWorks chief executive out raising money for Xtone, startup that wants to speech-enable mobile apps". Boston.com. Retrieved 2016-01-21.
- "Thrifty speaks to its customers: car rental agency deploys speech recognition to improve customer experience while reducing costs". Customer Interface. October 2002.
- Akass, Clive (July 1, 2005). "Voice on a sound footing. Speech input has become viable on PCs and will soon be available on mobiles. But it has a long way to go before you can throw away your keyboard, writes Clive Akass".
- Banks, Courtney. "A Safer Way to Text on the Road". Wall Street Journal. ISSN 0099-9660. Retrieved 2016-01-21.
- "Vlingo's Adaptive Speech Recognition Promises an End to Typing on your Phone Keyboard | Xconomy". Xconomy. Retrieved 2016-01-21.
- Farrell, Michael. "Does Siri soar on Dragon's wings?" (PDF).
- "Nuance Plays Hardball in Voice Recognition". BloombergView. Retrieved 2016-01-21.
- "The Patent, Used as a Sword - NYTimes.com". mobile.nytimes.com. Retrieved 2016-01-21.
- UTC, Samantha Murphy Kelly2011-12-20 21:39:55. "Nuance Acquires Voice-Recognition Competitor Vlingo". Mashable. Retrieved 2016-01-21.
- Duhigg, Charles; Lohr, Steve (2012-10-07). "In Technology Wars, Using the Patent as a Sword". The New York Times. ISSN 0362-4331. Retrieved 2016-01-21.
- "Cambridge's Sense Labs starts production of new device to track what's happening at home". www.betaboston.com. Retrieved 2016-01-21.
- Cohan, Peter. "5 Reasons to Scrap Our Patent System: #1. Apple's Siri". Forbes. Retrieved 2016-01-21.
- "2004 Speech Solutions Winners". www.speechtechmag.com. Retrieved 2016-01-21.
- "2005 Speech Solutions Winners". www.speechtechmag.com. Retrieved 2016-01-21.