Xuedong Huang

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Xuedong Huang
Dr Xuedong Huang.jpg
Born (1962-10-20) October 20, 1962 (age 52)
Hunan, China
Citizenship American (since 1995)
Fields Speech Recognition
Natural Language Processing
Software Development
Institutions Microsoft
Carnegie Mellon University
Alma mater Hunan University
Tsinghua University
Edinburgh University
Doctoral advisor Mervyn Jack
Doctoral students Mei-Yuh Hwang
Roni Rosenfeld
Notable awards 2011 Asian American Engineer of the Year
IEEE 1993 Paper Award
Allen Newell Research Excellence Medal

Xuedong David Huang (also known as XD, Simplified Chinese: 黄学东, b. October 20, 1962) is a Chinese-American computer scientist and the key person behind Microsoft's spoken language and search technologies. He is currently a Distinguished Engineer and Architect in Microsoft's online services division for Search and Advertising.


Huang grew up in Hunan, China and became a US citizen in 1995.



In 1978, Huang entered Hunan University without finishing his high school. He graduated with a B.S. degree in computer science from Hunan University in 1982, and went on to earn a MS in computer science from Tsinghua University. He received his PhD in Electrical Engineering from University of Edinburgh.

Academic research[edit]

He joined the Carnegie Mellon University faculty in 1989 and worked with Raj Reddy and Kai-Fu Lee on speech recognition. At CMU, Huang directed Sphinx-II speech system research that had the best overall performance in every category of DARPA's 1992 benchmarking. He received the 1992 Allen Newell research excellence medal for his leadership in speech recognition.

Huang has co-authored two books: Hidden Markov Models for Speech Recognition, (1987) and Spoken Language Processing, Prentice Hall](2000). He became an IEEE Fellow in 2000. Huang received the National Education Commission of China's 1987 Science and Technology Progress Award, IEEE 1993 Speech Processing Best Paper Award. SpeechTek has named him a top 10 leader of the speech industry.


Huang is known as Mr Speech at Microsoft for founding its speech recognition and Microsoft's MiPad [1] initiatives. MiPad was several years ahead of Apple's iPad as Bill Gates demonstrated it in CES 2001.

Huang was general manager of Microsoft's Communications Innovation Center. He helped to create Microsoft Response Point that received 2009's Technology of the Year Awards as the best VOIP phone system from the InfoWorld Magazine.[2]

Huang has spent his career helping to advance speech recognition technologies in a variety of capacities. He was the key leader who brought Microsoft's Speech Application Programming Interface (SAPI) and speech recognition/TTS technologies to the public. From 2000 to 2004, Huang served as general manager of Microsoft's Speech Platforms Group, where he led both the business and engineering teams that shipped Microsoft Speech Server and other voice technologies used in Microsoft Windows, Microsoft Office, Windows Mobile and Microsoft Exchange Server. He recently coauthored a historical speech recognition review with Raj Reddy and James K. Baker for CACM. [3]

Hunan University and University of Washington[edit]

Huang is the Honorary Dean and Professor of School of Software Engineering at Hunan University. He also serves on the advisory board of China's National Super Computer Center in Changsha. He is an affiliate Professor at University of Washington.

TV and books[edit]

  • Robert MacNeil, William Cran, Robert McCrum (2005). Do You Speak American? page 191-197, Harcourt Trade
  • PBS TV: Do You Speak American? 2005
  • Xuedong Huang, Alex Acero, Hsiao-Wuen Hon (2001). Spoken Language Processing: a guide to theory, algorithm, and system development, page 1-980. Prentice Hall
  • Xuedong D Huang, Yasuo Ariki, Mervyn A Jack (1990). Hidden Markov Models for Speech Recognition, Edinburgh University Press


External links[edit]