Kaggle

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Kaggle
Industry Predictive modelling
Founded April 2010
Founders Anthony Goldbloom
Headquarters San Francisco, United States
Key people Anthony Goldbloom (CEO)
Max Levchin (Chairman)
Jeff Moser (CTO)
Website www.kaggle.com

Kaggle is a platform for predictive modelling and analytics competitions on which companies and researchers post their data and statisticians and data miners from all over the world compete to produce the best models. This crowdsourcing approach relies on the fact that there are countless strategies that can be applied to any predictive modelling task and it is impossible to know at the outset which technique or analyst will be most effective. Kaggle aims at making data science a sport.

How Kaggle competitions work[edit]

  1. The competition host prepares the data and a description of the problem. Kaggle offers a consulting service which can help the host do this, as well as frame the competition, anonymize the data, and integrate the winning model into their operations.
  2. Participants experiment with different techniques and compete against each other to produce the best models. For most competitions, submissions are scored immediately (based on their predictive accuracy relative to a hidden solution file) and summarized on a live leaderboard.
  3. After the deadline passes, the competition host pays the prize money in exchange for "a worldwide, perpetual, irrevocable and royalty free license [...] to use the winning Entry", i.e. the algorithm, software and related intellectual property developed, which is "non-exclusive unless otherwise specified".[1]

Alongside its public competitions, it also offers private competitions limited to Kaggle's top participants, and Kaggle-in-Class[2] for university groups.

Impact of Kaggle competitions[edit]

Kaggle has approximately 200,000 data scientists worldwide, from fields such as computer science, statistics, economics and mathematics.[3] It has partnered with organisations such as NASA, Wikipedia, Deloitte and Allstate for its competitions. Kaggle is best known as the platform hosting the $3 million Heritage Health Prize.[4] Another competition looked at improving gesture recognition for Microsoft Kinect.[5]

Competitions have resulted in many successful projects including furthering the state of the art in HIV research,[6] chess ratings[7] and traffic forecasting.[8] Several academic papers have been published on the basis of findings made in Kaggle competitions.[citation needed] A key to this is the effect of the live leaderboard, which encourages participants to continue innovating beyond existing best practice.[9] The winning methods are frequently written up on the Kaggle blog, No Free Hunch.

Financials[edit]

In November 2011, Kaggle announced a Series A funding round of $11 million from a number of high profile Silicon Valley investors. Index Ventures and Khosla Ventures led the round, while Max Levchin, the co-founder of PayPal, also took part and became Chairman of the Board. Another well-known investor is Hal Varian, Chief Economist at Google, who described Kaggle as "a way to organize the brainpower of the world’s most talented data scientists and make it accessible to organizations of every size".[10] Founded in Melbourne, Australia, Kaggle moved to San Francisco in 2011 and experienced a phase of rapid expansion following its fundraising.

See also[edit]

References[edit]

  1. ^ Kaggle. "Terms and Conditions - Kaggle". 
  2. ^ Kaggle. "Kaggle in Class". 
  3. ^ Kaggle. "About Us". 
  4. ^ "Healthy Competition". The Economist. April 10, 2011. Retrieved 3 May 2011. 
  5. ^ Byrne, Ciara (December 12, 2011). "Kaggle launches competition to help Microsoft Kinect learn new gestures". VentureBeat. Retrieved 13 December 2011. 
  6. ^ Carpenter, Jennifer (February 2011). "May the Best Analyst Win". Science Magazine. Retrieved 1 April 2011. 
  7. ^ Sonas, Jeff (20 February 2011). "The Deloitte/FIDE Chess Rating Challenge". Chessbase. Retrieved 3 May 2011. 
  8. ^ Foo, Fran (April 6, 2011). "Smartphones to predict NSW travel times?". The Australian. Retrieved 3 May 2011. 
  9. ^ Athanasopoulos, George; Hyndman, Rob (2011). "The Value of Feedback in Forecasting Competitions". International Journal of Forecasting 27. pp. 845–849. 
  10. ^ Rao, Leena (November 2, 2011). "Index And Khosla Lead $11M Round In Kaggle, A Platform For Data Modeling Competitions". TechCrunch. Retrieved 13 December 2011. 

Further reading[edit]

External links[edit]