Chris Van Pelt
|Headquarters||San Francisco, California, United States|
CrowdFlower is a data enrichment, data mining and crowdsourcing company based in the Mission District of San Francisco, California. The company's software as a service platform allows users to access an online workforce of millions of people to clean, label and enrich data. CrowdFlower is typically used by data scientists at academic institutions, start-ups and large enterprises.
|This section requires expansion. (May 2014)|
CrowdFlower was founded in 2007 by Lukas Biewald and Chris Van Pelt, as "Dolores Labs." CrowdFlower received $1,200,000 in seed funding in March 2009 from K9 Ventures, Quest Venture Partners, Gary Kremen, FF Angel, and Uber CEO, Travis Kalanick, among others. In January 2010, CrowdFlower raised a $5,000,000 Series A that included Bessemer Venture Partners, Trinity Ventures, and Founders Fund. In March 2011, CrowdFlower raised a $9,300,000 Series B followed by a $12,500,000 Series C in September 2014, this time led by Canvas Venture Fund.
CrowdFlower cleans up messy and incomplete data using an online workforce of millions of people. Typical users of CrowdFlower are data scientists who utilize the software to create training data to build models and train machine learning algorithms.
The platform allows users to distribute work to contributors in the U.S. and 153 other countries while maintaining quality and controlling costs. On a continuous basis, these contributors discover work on online job boards and decide what they're going to work on based on how interesting it is, how much work is available and how much the job compensates them. These jobs can include analyzing the sentiment of tweets on a brand or hashtag, scoring relevance for search queries and results of an e-commerce website or moderating user generated content.
Once data is uploaded to the platform, the system automatically allocates the work to contributors and tests them against known answers hidden within the task (what CrowdFlower refers to as a "job" ). The way in which contributors perform on these hidden test questions calibrates how much the system trusts them on an individual level. As long as contributors remain trusted they're allowed to continue working on a given job. If they become untrusted, they're removed from the job and all of their work is disregarded. Multiple contributor judgments are collected and an aggregate answer with an associated confidence score (agreement of the contributors weighted by the trust of each contributor) is provided as a result - effectively returning the "most trusted judgment," for a given unit of data.
- Researchers at the Harvard Tuberculosis Lab used the it to identify drug-resistant TB cells.
- After the 2010 Haiti earthquake, the company helped to route text messages to the proper aid workers, to get them translated, and to ensure that the people sending the texts had a chance of getting what they needed.
- Similar relief efforts were handled after the 2010 Pakistan floods.
- In 2009, the company worked with Samasource to provide work for refugees in Kenya who completed microtasks; iPhone users donated their time by checking for accuracy through Give Work, an app.
- Adrienne Burke (2011-10-26). "Crowdsourcing Scientific Progress: How Crowdflower's Hordes Help Harvard Researchers Study TB". Forbes. Retrieved 2012-01-31.
- "Crowdsourcing the Haiti Relief | The CrowdFlower Blog". Blog.crowdflower.com. 2010-01-29. Retrieved 2012-01-31.
- "How To Cope with Very Large Volumes of Crowdsourced Reports? Add More Crowd!". The Ushahidi Blog. Retrieved 2012-01-31.
- Oshiro, Dana (2009-10-13). "Samasource / CrowdFlower iPhone App Helps Refugees Fight Poverty". Readwriteweb.com. Retrieved 2012-01-31.