|Born||April 25, 1956|
|Alma mater||University of Pittsburgh (Ph.D., 1993)|
|Known for||Natural language processing|
|Awards||ACM Multimedia Grand Challenge|
Natural Language Processing & Understanding
|Thesis||Exploring Automatic Thesaurus Generation (1993)|
Gregory Grefenstette is a French American computer scientist, researcher, professor, and author in the fields of artificial intelligence and natural language processing, who currently serves as chief scientific officer at Biggerpan, a company developing a predictive contextual engine for the mobile web. Grefenstette is also a senior associate researcher at the Florida Institute for Human and Machine Cognition (IHMC).
Grefenstette was born in Pittsburgh, PA in 1956. He attended the Massachusetts Institute of Technology as an undergraduate student in mathematical science and completed his bachelor’s degree at Stanford University in 1978. Grefenstette graduated with a master's degree from Paris-Sud University in 1983 and received his PhD in computer science from University of Pittsburgh in 1993. From 1984 to 1989, he has also worked as an assistant professor at University of Tours, in France.
Grefenstette’s research primarily focuses on natural language processing. Following his PhD work on “Exploring Automatic Thesaurus Generation”, he mostly addressed large-scale natural language processing problems and co-edited with Adam Kilgarriff a special issue of "Computational Linguistics" about how to use the Internet as a corpus for machine learning. Previous to his position at Biggerpan, Grefenstette was an Advanced Researcher at INRIA, the French national research institute in Computer Science, working on personal semantics. Prior to that, he was chief science officer of Exalead, a search engine company, managing the OSEO QUAERO CMSE program on innovative multimedia indexing technologies. Grefenstette is also a former chief scientist at the Xerox Research Centre Europe (1993-2001), at Clairvoyance Corporation (2001-2004), and with the French CEA (2004-2008).
Grefenstette was named as an inventor in 20 US patents, a majority of which were filed during his years at Xerox. With his research team, he has received awards for his work on semantic maps and won a three-year grant by the Lagardere Foundation in 2007. He has also authored four books and has been part of several journal publications. Referenced in many natural language processing research papers, Grefenstette is especially renowned for his work on cross-language information retrieval and distributional semantics.
- Grefenstette, Gregory; Wilber, Laura (2011). Search-based Applications: At the Confluence of Search and Database Technologies. Morgan & Claypool Publishers. ISBN 9781608455072.
- Renals, Steve; Grefenstette, Gregory (2003-09-09). Text- and Speech-Triggered Information Access: 8th ELSNET Summer School, Chios Island, Greece, July 15-30, 2000, Revised Lectures. Springer. ISBN 9783540451150.
- Grefenstette, Gregory (1998). Cross-language information retrieval. Kluwer Academic Publishers. ISBN 9781461527107.
- Grefenstette, Gregory (2012-12-06). Explorations in Automatic Thesaurus Discovery. Springer Science & Business Media. ISBN 9781461527107.
- International Handbook of Modern Lexis and Lexicography | Patrick Hanks | Springer.
- Grefenstette, Gregory and Lawrence Muchemi. 2016. Determining the Characteristic Vocabulary for a Specialized Dictionary using Word2vec and a Directed Crawler, 10th Language Resources and Evaluation Conference
- Grefenstette, Gregory. 2016. Extracting Weighted Language Lexicons from Wikipedia.
- Grefenstette, Gregory; Rafes, Karima (2015). "Transforming Wikipedia into an Ontology-based Information Retrieval Search Engine for Local Experts using a Third-Party Taxonomy". arXiv:1511.01259 [cs.IR].
- "Adam Kilgarriff's Legacy to Computational Linguistics and Beyond, Proceedings of the 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2016)" (PDF).
- "Extracting Hierarchical Topic Models from the Web for Improving Digital Archive Access" (PDF).
- Grefenstette, Gregory. "Personal Information Systems and Personal Semantics".
- Grefenstette, Gregory (2015). Language Production, Cognition, and the Lexicon. Springer, Cham. pp. 203–219. doi:10.1007/978-3-319-08043-7_12.
- Grefenstette, Gregory (2015). "INRIASAC: Simple Hypernym Extraction Methods". arXiv:1502.01271 [cs.CL].
- Law-To, Julien; Grefenstette, Gregory; Landais, Rémi (2013). "Semantic Dispatching of Multimedia News with MEWS". Proceedings of the 21st ACM International Conference on Multimedia. MM '13. New York, NY, USA: ACM: 427–428. doi:10.1145/2502081.2502253. ISBN 9781450324045.
- 2010 ACM Multimedia Grand Challenge Bronze award
- 2009 ACM Multimedia Grand Challenge, most practical system award
- 2007 three-year grant by Lagardere Foundation for work on semantic maps
- 1978 ITT International Fellow to Belgium
- "Gregory Grefenstette - IHMC | Institute for Human & Machine Cognition". IHMC | Institute for Human & Machine Cognition. Retrieved 2017-07-26.
- "Грегори Грефенштетт: Компьютерная лингвистика в начале XXI века - ПОЛИТ.РУ". polit.ru. Retrieved 2017-07-26.
- "Gregory Grefenstette — Dr. Haxel". www.haxel.com. Retrieved 2017-07-26.
- "Gregory Grefenstette Inventions, Patents and Patent Applications - Justia Patents Search". patents.justia.com. Retrieved 2017-07-26.
- "dblp: Gregory Grefenstette". dblp.uni-trier.de. Retrieved 2017-07-26.
- Král, Pavel; Martín-Vide, Carlos (2016-09-20). Statistical Language and Speech Processing: 4th International Conference, SLSP 2016, Pilsen, Czech Republic, October 11-12, 2016, Proceedings. Springer. ISBN 9783319459257.