Marti Hearst is a professor in the School of Information at the University of California, Berkeley. She did early work in corpus-based computational linguistics, including some of the first work in automating sentiment analysis, and word sense disambiguation. She invented an algorithm that became known as "Hearst patterns" which applies lexico-syntactic patterns to recognize hyponymy (ISA) relations with high accuracy in large text collections, including an early application of it to WordNet; this algorithm is widely used in commercial text mining applications including ontology learning. Hearst also developed early work in automatic segmentation of text into topical discourse boundaries, inventing a now well-known approach called TextTiling.
Hearst research is on user interfaces for search engine technology and big data analytics. She did early work in user interfaces and information visualization for search user interfaces, inventing the TileBars query term visualization. Her Flamenco research project investigated and developed the now widely used faceted navigation approach for searching and browsing web sites and information collections. She wrote the first academic book on the topic of Search User Interfaces (Cambridge University Press, 2009).
Hearst is an Edge Foundation contributing author and a member of the Usage panel of the American Heritage Dictionary of the English Language.
- Hearst, M. (1992). Direction-Based Text Interpretation as an Information Access (in Text-Based Intelligent Systems). Lawrence Erlbaum.
- Hearst, M. (1991). "Noun Homograph Disambiguation using Local Context in Large Text Corpora" (PDF). Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora. Oxford. Retrieved February 15, 2013.
- Indurkhya, N., Damerau, F. (2010). Handbook of Natural Language Processing. Chapman & Hall/CRC. p. 594.
- "Automatic Acquisition of Hyponyms from Large Text Corpora" (PDF). Proceedings of the Fourteenth International Conference on Computational Linguistics. Nantes, France. 1992. Retrieved February 15, 2013.
- Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. MIT Press.
- "Multi-Paragraph Segmentation of Expository Text" (PDF). Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics. 32nd Annual Meeting of the Association for Computational Linguistics. Las Cruces, NM. June 1994. Retrieved February 15, 2013.
- "ACM Hypertext 2011 Keynotes". 22nd ACM Conference on Hypertext and Hypermedia. Association for Computing Machinery. 2011-06-06. Retrieved 2013-05-08.
- Tate, Ryan (2013-01-15). "Facebook Announces New Search Engine". Wired.com. Retrieved 2013-05-08.
- Hearst, Marti A. (2011-11-01). "'Natural' Search User Interfaces". Communications of the ACM, Vol. 54, No. 11. Association for Computing Machinery. pp. 60–67. Retrieved 2013-05-08.
- Isaac, Mike (2012-12-14). "Twitter Takes Big Data to School". AllThingsD. Retrieved 2013-05-08.
- Keen, Andrew (2012-05-12). "Keen On… Big Data: Why UC Berkeley Might Have An Edge Over Stanford [TCTV]". TechCrunch.com. Retrieved 2013-05-08.
- Yee, Christopher (2012-11-13). "Five Questions with Marti Hearst, 'Big Data' pioneer". The Daily Californian. University of California, Berkeley. Retrieved 2013-05-08.
- Hearst, M. (1995). "TileBars: Visualization of Term Distribution Information in Full Text Information Access" (PDF). Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI). ACM SIGCHI Conference on Human Factors in Computing Systems. Denver, CO. Retrieved February 15, 2013.
- Hearst, M. (September 2000). "Next Generation Web Search: Setting Our Sites" (PDF). In Gravano, Luis. In IEEE Data Engineering Bulletin. Special issue on Next Generation Web Search. Retrieved February 15, 2013.
- Yee, K-P., Swearingen, K., Li, K., and Hearst, M., (2003). "Faceted Metadata Image Search and Browsing" (PDF). in Proceedings of ACM CHI 2003. Retrieved February 15, 2013.
- ACM Names Fellows for Computing Advances that Are Transforming Science and Society Archived 2014-07-22 at the Wayback Machine., Association for Computing Machinery, accessed 2013-12-10.