|Other names||Martha Alice Hearst|
|Alma mater||University of California, Berkeley|
|Known for||Hearst patterns|
|Thesis||Context and structure in automated full-text information access (1994)|
|Doctoral advisor||Robert Wilensky|
|Other academic advisors||Michael Stonebraker|
|Doctoral students||Cecilia R. Aragon|
Marti Hearst is a professor in the School of Information at the University of California, Berkeley. She did early work in corpus-based computational linguistics, including some of the first work in automating sentiment analysis, and word sense disambiguation. She invented an algorithm that became known as "Hearst patterns" which applies lexico-syntactic patterns to recognize hyponymy (ISA) relations with high accuracy in large text collections, including an early application of it to WordNet; this algorithm is widely used in commercial text mining applications including ontology learning. Hearst also developed early work in automatic segmentation of text into topical discourse boundaries, inventing a now well-known approach called TextTiling.
Hearst's research is on user interfaces for search engine technology and big data analytics. She did early work in user interfaces and information visualization for search user interfaces, inventing the TileBars query term visualization. Her Flamenco research project investigated and developed the now widely used faceted navigation approach for searching and browsing web sites and information collections. She wrote the first academic book on the topic of Search User Interfaces (Cambridge University Press, 2009).
Hearst is an Edge Foundation contributing author and a member of the Usage panel of the American Heritage Dictionary of the English Language.
Hearst received her B.A., M.S., and Ph.D. in Computer Science, all from U.C. Berkeley. In 2013 she became a fellow of the Association for Computing Machinery. She became a member of the CHI Academy in 2017, and has previously served as President of the Association for Computational Linguistics and on the Advisory Council of NSF's CISE Directorate. Additionally, she has been a member of the Web Board for CACM, the Usage Panel for the American Heritage Dictionary, the Edge.org panel of experts, the Research Staff at Xerox PARC, and the boards of ACM Transactions on the Web, Computational Linguistics, ACM Transactions on Information Systems, and IEEE Intelligent Systems.
Hearst has received an NSF CAREER award, an IBM Faculty Award, and an Okawa Foundation Fellowship. Her work on user interfaces has had a profound impact on the industry, earning Hearst two Google Research Awards and four Excellence in Teaching Awards. She has also led projects worth over $3.5M in research grants.
Hearst’s publications date back to 1990, when ‘A Hybrid Approach to Restricted Text Interpretation’ was published in Stanford University’s AAAI Spring Symposium on Text Based Intelligent Systems in March of that year.
- Hearst, M. (1992). Direction-Based Text Interpretation as an Information Access (in Text-Based Intelligent Systems). Lawrence Erlbaum.
- Hearst, M. (1991). "Noun Homograph Disambiguation using Local Context in Large Text Corpora" (PDF). Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora. Oxford. Retrieved February 15, 2013.
- Indurkhya, N., Damerau, F. (2010). Handbook of Natural Language Processing. Chapman & Hall/CRC. p. 594.CS1 maint: multiple names: authors list (link)
- "Automatic Acquisition of Hyponyms from Large Text Corpora" (PDF). Proceedings of the Fourteenth International Conference on Computational Linguistics. Nantes, France. 1992. Retrieved February 15, 2013.
- Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. MIT Press.
- "Multi-Paragraph Segmentation of Expository Text" (PDF). Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics. 32nd Annual Meeting of the Association for Computational Linguistics. Las Cruces, NM. June 1994. Retrieved February 15, 2013.
- "ACM Hypertext 2011 Keynotes". 22nd ACM Conference on Hypertext and Hypermedia. Association for Computing Machinery. 2011-06-06. Archived from the original on 2016-06-04. Retrieved 2013-05-08.
- Tate, Ryan (2013-01-15). "Facebook Announces New Search Engine". Wired. Wired.com. Retrieved 2013-05-08.
- Hearst, Marti A. (2011-11-01). "'Natural' Search User Interfaces". Communications of the ACM, Vol. 54, No. 11. Association for Computing Machinery. pp. 60–67. Retrieved 2013-05-08.
- Isaac, Mike (2012-12-14). "Twitter Takes Big Data to School". AllThingsD. Retrieved 2013-05-08.
- Keen, Andrew (2012-05-12). "Keen On… Big Data: Why UC Berkeley Might Have An Edge Over Stanford [TCTV]". TechCrunch.com. Retrieved 2013-05-08.
- Yee, Christopher (2012-11-13). "Five Questions with Marti Hearst, 'Big Data' pioneer". The Daily Californian. University of California, Berkeley. Retrieved 2013-05-08.
- Hearst, M. (1995). "TileBars: Visualization of Term Distribution Information in Full Text Information Access" (PDF). Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI). ACM SIGCHI Conference on Human Factors in Computing Systems. Denver, CO. Retrieved February 15, 2013.
- Hearst, M. (September 2000). "Next Generation Web Search: Setting Our Sites" (PDF). In Gravano, Luis (ed.). In IEEE Data Engineering Bulletin. Special issue on Next Generation Web Search. Retrieved February 15, 2013.
- Yee, K-P., Swearingen, K., Li, K., and Hearst, M. (2003). "Faceted Metadata Image Search and Browsing" (PDF). in Proceedings of ACM CHI 2003. Retrieved February 15, 2013.CS1 maint: multiple names: authors list (link)
- Hearst, Martha Alice (1994). Context and structure in automated full-text information access (Ph.D. thesis). University of California, Berkeley. OCLC 33496523. ProQuest 304100421.
- ACM Names Fellows for Computing Advances that Are Transforming Science and Society Archived 2014-07-22 at the Wayback Machine, Association for Computing Machinery, accessed 2013-12-10.