Ashish Vaswani
Ashish Vaswani | |
|---|---|
| Born | 1986 (age 38–39) |
| Alma mater | |
| Known for | Transformer (deep learning architecture) |
| Scientific career | |
| Fields | |
| Institutions |
|
| Thesis | Smaller, Faster, and Accurate Models for Statistical Machine Translation (2014) |
| Doctoral advisor |
|
Ashish Vaswani (born 1986)[1] is an Indian computer scientist. He worked as a research scientist at Google Brain and Information Sciences Institute.
Vaswani is a co-author of the 2017 paper "Attention Is All You Need," which introduced the Transformer neural network architecture.[2] The Transformer model has been used in the development of subsequent NLP models BERT, ChatGPT, and their successors.
Career
[edit]Vaswani completed his engineering in Computer Science from BIT Mesra in 2002. In 2004, he enrolled at the University of Southern California for graduate studies.[3] He earned his PhD in Computer Science at the University of Southern California supervised by David Chiang.[4] He has worked as a researcher at Google,[5] where he was part of the Google Brain team. He was a co-founder of Adept AI Labs. He has since left the company.[6][7]
Vaswani is currently co-founder and CEO of Essential AI.
Notable works
[edit]Vaswani's most notable paper, "Attention Is All You Need", was published in 2017.[8] The paper introduced the Transformer model, which uses self-attention mechanisms instead of recurrence for sequence-to-sequence tasks. The model has been instrumental in the development of several subsequent state-of-the-art models in NLP, including BERT,[9] GPT-2, and GPT-3.
References
[edit]- ^ Nichil, Geoffrey (16 November 2024). "Who is Ashish Vaswani?". Synaptiks. Archived from the original on 15 December 2024.
- ^ Ashish Vaswani; Noam Shazeer; Niki Parmar; Jakob Uszkoreit; Llion Jones; Aidan N. Gomez; Łukasz Kaiser; Illia Polosukhin (12 June 2017). "Attention is All you Need" (PDF). Advances in Neural Information Processing Systems 30. Advances in Neural Information Processing Systems. arXiv:1706.03762. Wikidata Q30249683.
- ^ Team, OfficeChai (February 4, 2023). "The Indian Researchers Whose Work Led To The Creation Of ChatGPT". OfficeChai.
- ^ "Ashish Vaswani's webpage at ISI". www.isi.edu.
- ^ "Transformer: A Novel Neural Network Architecture for Language Understanding". ai.googleblog.com. August 31, 2017.
- ^ Rajesh, Ananya Mariam; Hu, Krystal; Rajesh, Ananya Mariam; Hu, Krystal (March 16, 2023). "AI startup Adept raises $350 mln in fresh funding". Reuters – via www.reuters.com.
- ^ Tong, Anna; Hu, Krystal; Tong, Anna; Hu, Krystal (2023-05-04). "Top ex-Google AI researchers raise funding from Thrive Capital". Reuters. Retrieved 2023-07-11.
- ^ Dawson, Caitlin (March 9, 2023). "USC Alumni Paved Path for ChatGPT". USC Viterbi | School of Engineering.
- ^ Devlin, Jacob; Chang, Ming-Wei; Lee, Kenton; Toutanova, Kristina (May 24, 2019). "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding". arXiv:1810.04805 [cs.CL].