Matei Zaharia

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 2600:387:6:80d::9e (talk) at 21:44, 27 November 2016. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Matei Zaharia
NationalityRomania
CitizenshipCanada
Alma materUC Berkeley (Ph.D.)
University of Waterloo (B.Math.)
Known forApache Spark
Apache Mesos
Scientific career
FieldsComputer Science
InstitutionsStanford University
ThesisAn Architecture for Fast and General Data Processing on Large Clusters (2013)
Doctoral advisorIon Stoica
Scott Shenker
Websitepeople.csail.mit.edu/matei

Matei Zaharia is a Romanian-Canadian computer scientist specializing in big data, distributed systems, and cloud computing. He is a co-founder and Chief Technologist of Databricks, and an assistant professor of computer science at Stanford University.[1] He created the Apache Spark project and co-created the Apache Mesos project during his PhD at UC Berkeley, and also designed the core scheduling algorithms used in Apache Hadoop, including the most widely used fair scheduler.[2]

Biography

Matei Zaharia was born in Romania. His family moved to Canada later and he attended Jarvis Collegiate Institute in Toronto for high school and the University of Waterloo for Computer Science. He received the Governor General’s Academic Silver Medal for highest academic standing upon graduation from the University of Waterloo. He went on to study at UC Berkeley gaining a Ph.D. in Computer Science in 2013 under the supervision of Ion Stoica and Scott Shenker.[3]

He participated in programming contests, winning two IOI silver medals in high school. He was on the University of Waterloo team that competed in ACM ICPC programming competition in 2004 and 2005. He won a gold medal in ICPC 2005 (3rd place worldwide), and placed 15th in 2004.[4] Both times his team got a title of North America champions.[citation needed]

In the course of his PhD studies, he created the Apache Spark project and co-created the Apache Mesos project. He also designed and implemented the core scheduling algorithms used in Apache Hadoop.[5]

He received two Best Paper awards at NSDI 2012 and SIGCOMM 2012, Honorable Mention for Community Award at NSDI 2012, and a Best Demo Award at SIGMOD 2012. Jointly with Reynold Xin, Parviz Deyhim, Xiangrui Meng, and Ali Ghodsi, he holds the 2014 world record in Daytona GraySort using Apache Spark.[6] Moreover, in 2015 he received the ACM Doctoral Dissertation Award.[7]

References

  1. ^ "How Companies are Using Spark, and Where the Edge in Big Data Will Be". Strata Conference. Retrieved 26 August 2014.
  2. ^ "Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling" (PDF).
  3. ^ Zaharia, Matei. "An Architecture for Fast and General Data Processing on Large Clusters" (PDF). http://www.eecs.berkeley.edu. Retrieved 29 June 2015. {{cite web}}: External link in |website= (help)
  4. ^ "Programming Contest Resources".
  5. ^ "Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling" (PDF).
  6. ^ "Sort Benchmark".
  7. ^ "ACM Doctoral Dissertation Award 2015".

External links