Freebase

From Wikipedia, the free encyclopedia
Jump to: navigation, search
For the chemistry term, see free base. For the music supergroup, see Freebass.
Freebase
Freebase-logo.png
Web address www.freebase.com
Commercial? No
Type of site Online database
Registration Optional
Available in English
Content license Creative Commons Attribution License
Owner Metaweb Technologies (Google)
Alexa rank negative increase 18,151 (May 2014)[1]
Current status Active

Freebase is a large collaborative knowledge base consisting of metadata composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual 'wiki' contributions.[2] Freebase aims to create a global resource which allows people (and machines) to access common information more effectively. It was developed by the American software company Metaweb and has been running publicly since March 2007. Metaweb was acquired by Google in a private sale announced July 16, 2010.[3] Google's Knowledge Graph is powered in part by Freebase.[4]

Freebase data is freely available for commercial and non-commercial use under a Creative Commons Attribution License, and an open API, RDF endpoint, and database dump are provided for programmers.

Overview[edit]

On March 3, 2007, Metaweb publicly announced Freebase, described by the company as "an open shared database of the world's knowledge," and "a massive, collaboratively edited database of cross-linked data." Often understood as database model using Wikipedia-turned-database or entity-relationship model, Freebase provides an interface that allows non-programmers to fill in structured, or 'meta-data', of general information, and to categorize or connect data items in meaningful, or 'semantic' ways.

Described by Tim O'Reilly upon their launch, "Freebase is the bridge between the bottom up vision of Web 2.0 collective intelligence and the more structured world of the semantic web."[5]

Freebase contains data harvested from sources such as Wikipedia, NNDB, FMD and MusicBrainz, as well as individually contributed data from its users. The structured data is licensed under the Creative Commons Attribution License, [5] and a JSON-based HTTP API is provided to programmers for developing applications on any platform to utilize the Freebase data. The source code for the Metaweb application itself is proprietary.

Freebase runs on a database infrastructure created in-house by Metaweb that uses a graph model. This means that instead of using tables and keys to define data structures, Freebase defines its data structure as a set of nodes and a set of links that establish relationships between the nodes. Because its data structure is non-hierarchical, Freebase can model much more complex relationships between individual elements than a conventional database[citation needed], and is open for users to enter new objects and relationships into the underlying graph. Queries to the database are made in Metaweb Query Language (MQL) and served by a triplestore called graphd.[6]

Development[edit]

Danny Hillis first described his idea for creating a knowledge web he called Aristotle in a paper in 2000.[7] But he said he did not try to build the system until he had recruited two technical experts as co-founders. Robert Cook, in parallel computing and database design, is Metaweb’s executive vice president for product development. John Giannandrea, formerly chief technologist at Tellme Networks and chief technologist of the Web browser group at Netscape/AOL, is the company’s chief technology officer.[8]

Originally accessible by invitation only, Freebase opened full anonymous read access to the public in its alpha stage of development, and now requires registration only for data contributions.

On October 29, 2008, at the International Semantic Web Conference 2008, Freebase released its RDF service for generating RDF representations of Freebase topics, allowing Freebase to be used as Linked Data.[9]

Organization and policy[edit]

Freebase's subjects are called topics and the data stored about them depend on their type, how they are classified. For example, an entry for Arnold Schwarzenegger, the former Governor of California, would be entered as a topic that would include a variety of types describing him as an actor, bodybuilder, and politician.[10] As of January 2014, Freebase has approximately 44 million topics and 2.4 billion facts.[11]

Freebase's ontologies (structured categories), known in Freebase as "types" — are themselves user-editable.[5] Each type has a number of defined predicates, called "properties".

[U]nlike the W3C approach to the semantic web, which starts with controlled ontologies, Metaweb adopts a folksonomy approach, in which people can add new categories (much like tags), in a messy sprawl of potentially overlapping assertions.[5]

In this manner, Freebase differs from the wiki model in many ways. Users can create their own types, but these types aren't adopted in the 'public commons' until promoted by a Metaweb employee. As well, users cannot modify each other's types. The reason Freebase can't open up permissions of schemas is because external apps rely on them; thus changing a type's schema, for instance by deleting a property or changing a simple property, might break queries for API users and even within Freebase itself - in saved views, for example.

Metaweb promotes some users to expert status, similar to Wikipedia's administrator policy, who are given some admin permissions.[12]

The underlying data storage supports multilingual data, but as of 2011 every user’s display language was forcibly set to English. This will change at some point.[citation needed]

As of 2011, the only access is via Metaweb Query Language (MQL).[13]

Business and community[edit]

The Freebase system is built and patented[14][15] by Metaweb, a for-profit company,[16] that has been acquired by Google.[17]

In terms of Freebase's relationship with the open data community:

...we have no formal relationship with other open data projects. Though the definition of open data is pretty loose, we try to follow general open data principles by not restricting access to Freebase information to registered users, charging users to access our information, imposing restrictive licenses over the use of Freebase information, or using proprietary or closed technology as a barrier to accessing Freebase information.[18]

Freebase is planning formal mappings of some of their types to established ontologies like FOAF, though this is not a priority.[19]

In the future, the company hopes to also generate profit by organizing proprietary data.[20]

Criticism[edit]

Bulk import tools
They are used internally at Metaweb, but the reconciliation process for imported data has so far proved too complicated for public release, and public bulk tools are very limited.[21]
Multilingual implementation
Freebase has translations (or translation support) of many of its topics, but its types are currently implemented (or at least described) in natural language English, leading to challenges in developing a universal schema.

See also[edit]

References[edit]

  1. ^ "Freebase - Alexa". Alexa Internet, Inc. Retrieved 15 May 2014. 
  2. ^ Markoff, John (2007-03-09). "Start-Up Aims for Database to Automate Web Searching". The New York Times. Retrieved 2007-03-09. 
  3. ^ Deeper understanding with Metaweb, Google Blog, 16 July 2010
  4. ^ Introducing the Knowledge Graph: things, not strings, Google Blog, 16 May 2012
  5. ^ a b c d "Freebase Will Prove Addictive". O'Reilly Radar. 2007-03-08. Retrieved 2007-03-09. 
  6. ^ "a-brief-tour-of-graphd". Retrieved 2009-07-08. 
  7. ^ Hillis, W. Daniel (2000). "Aristotle (The Knowledge Web)". Retrieved 20 Jan 2013. 
  8. ^ Markoff, John (2007-03-09). "Start-Up Aims for Database to Automate Web Searching". nytimes. Retrieved 2009-02-07. 
  9. ^ "Introducing the Freebase RDF service". Retrieved 2009-02-19. 
  10. ^ "Arnold Schwarzenegger". Freebase. Retrieved 2014-02-14. 
  11. ^ "Explore Freebase Data". Freebase. Retrieved 2013-02-14. 
  12. ^ "new-experts-programme". Retrieved 2009-02-07. 
  13. ^ Posted by crism on December 4th, 2007 (2007-12-04). "The Freebase Blog » Blog Archive » Internationalization in Freebase". Blog.freebase.com. Retrieved 2012-02-08. 
  14. ^ "Patent US7502770". Google Patents. Retrieved 2014-02-14. 
  15. ^ US 7502770, Hillis, Daniel, "Patent US7502770", published 2009-03-10 
  16. ^ "investors". Retrieved 2009-01-02. 
  17. ^ "Google Makes Major Semantic Web Play, Acquires Freebase Operators Metaweb". ReadWrite. 2010-07-16. Retrieved 2013-12-06. 
  18. ^ "faq". Retrieved 2009-01-02. 
  19. ^ "introducing_the_rdf_service/". Retrieved 2009-02-07. 
  20. ^ "Sharing what matters". The Economist. 2007-06-07. Retrieved 2007-06-15. 
  21. ^ "blog". Retrieved 2009-01-02. 

External links[edit]