Jump to content

Freebase (database): Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Fixing "Pages with citations using unsupported parameters" error
m Replaced png logo with svg version
Line 2: Line 2:
{{Infobox website
{{Infobox website
| name = Freebase
| name = Freebase
| logo = [[Image:Freebase-logo.png]]
| logo = [[File:Freebase Logo optimised.svg|thumb]]
|logocaption =
|logocaption =
|screenshot =
|screenshot =

Revision as of 22:45, 13 November 2014

Template:Two other uses

Freebase
Type of site
Online database
Available inEnglish
OwnerMetaweb Technologies (Google)
URLwww.freebase.com
CommercialNo
RegistrationOptional

Freebase is a large collaborative knowledge base consisting of metadata composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual, user-submitted wiki contributions.[2] Freebase aims to create a global resource which allows people (and machines) to access common information more effectively. It was developed by the American software company Metaweb and has been running publicly since March 2007. Metaweb was acquired by Google in a private sale announced July 16, 2010.[3] Google's Knowledge Graph is powered in part by Freebase.[4]

Freebase data is freely available for commercial and non-commercial use under a Creative Commons Attribution License, and an open API, RDF endpoint, and database dump are provided for programmers.

Overview

On March 3, 2007, Metaweb publicly announced Freebase, described by the company as "an open shared database of the world's knowledge", and "a massive, collaboratively edited database of cross-linked data." Often understood as database model using Wikipedia-turned-database or entity-relationship model, Freebase provides an interface that allows non-programmers to fill in structured data, or metadata, of general information, and to categorize or connect data items in meaningful, semantic ways.

Described by Tim O'Reilly upon their launch, "Freebase is the bridge between the bottom up vision of Web 2.0 collective intelligence and the more structured world of the semantic web."[5]

Freebase contains data harvested from sources such as Wikipedia, NNDB, FMD and MusicBrainz, as well as individually contributed data from its users. The structured data is licensed under the Creative Commons Attribution License, [5] and a JSON-based HTTP API is provided to programmers for developing applications on any platform to utilize the Freebase data. The source code for the Metaweb application itself is proprietary.

Freebase runs on a database infrastructure created in-house by Metaweb that uses a graph model. This means that instead of using tables and keys to define data structures, Freebase defines its data structure as a set of nodes and a set of links that establish relationships between the nodes. Because its data structure is non-hierarchical, Freebase can model much more complex relationships between individual elements than a conventional database[citation needed], and is open for users to enter new objects and relationships into the underlying graph. Queries to the database are made in Metaweb Query Language (MQL) and served by a triplestore called graphd.[6]

Development

Danny Hillis first described his idea for creating a knowledge web he called Aristotle in a paper in 2000.[7] But he said he did not try to build the system until he had recruited two technical experts as co-founders. Robert Cook, in parallel computing and database design, is Metaweb's executive vice president for product development. John Giannandrea, formerly chief technologist at Tellme Networks and chief technologist of the Web browser group at Netscape/AOL, is the company's chief technology officer.[8]

Originally accessible by invitation only, Freebase opened full anonymous read access to the public in its alpha stage of development, and now requires registration only for data contributions.

On October 29, 2008, at the International Semantic Web Conference 2008, Freebase released its RDF service for generating RDF representations of Freebase topics, allowing Freebase to be used as linked data.[9]

Organization and policy

Freebase's subjects are called topics and the data stored about them depend on their type, how they are classified. For example, an entry for Arnold Schwarzenegger, the former Governor of California, would be entered as a topic that would include a variety of types describing him as an actor, bodybuilder, and politician.[10] As of January 2014, Freebase has approximately 44 million topics and 2.4 billion facts.[11]

Freebase's ontologies (structured categories), known in Freebase as "types", are themselves user-editable.[5] Each type has a number of defined predicates, called "properties".

[U]nlike the W3C approach to the semantic web, which starts with controlled ontologies, Metaweb adopts a folksonomy approach, in which people can add new categories (much like tags), in a messy sprawl of potentially overlapping assertions.[5]

In this manner, Freebase differs from the wiki model in many ways. Users can create their own types, but these types aren't adopted in the "public commons" until promoted by a Metaweb employee. As well, users cannot modify each other's types. The reason Freebase can't open up permissions of schemas is because external apps rely on them; thus changing a type's schema, for instance by deleting a property or changing a simple property, might break queries for API users and even within Freebase itself, in saved views, for example.

Metaweb promotes some users to "expert status", similar to Wikipedia's administrator policy, who are given some admin permissions.[12]

The underlying data storage supports multilingual data, but as of 2011 every user's display language was set to English. This will change at some point.[citation needed]

As of 2011, the only access is via Metaweb Query Language (MQL).[13]

Business and community

The Freebase system is built and patented[14][15] by Metaweb, a for-profit company,[16] that has been acquired by Google.[17]

In terms of Freebase's relationship with the open data community:

...we have no formal relationship with other open data projects. Though the definition of open data is pretty loose, we try to follow general open data principles by not restricting access to Freebase information to registered users, charging users to access our information, imposing restrictive licenses over the use of Freebase information, or using proprietary or closed technology as a barrier to accessing Freebase information.[18]

Freebase is planning formal mappings of some of their types to established ontologies like FOAF, though this is not a priority.[9]

In the future, the company hopes to also generate profit by organizing proprietary data.[19]

Criticism

Bulk import tools
They are used internally at Metaweb, but the reconciliation process for imported data has so far proved too complicated for public release, and public bulk tools are very limited.[20]
Multilingual implementation
Freebase has translations (or translation support) of many of its topics, but its types are currently implemented (or at least described) in natural language English, leading to challenges in developing a universal schema.

See also

References

  1. ^ "Freebase - Alexa". Alexa.com. Alexa Internet, Inc. Retrieved 15 May 2014.
  2. ^ Markoff, John (2007-03-09). "Start-up Aims for Database to Automate Web Searching". The New York Times. Retrieved 2007-03-09.
  3. ^ Menzel, Jack (July 16, 2010). "Deeper Understanding with Metaweb". Google Official Blog. Retrieved September 6, 2014.
  4. ^ "Introducing the Knowledge Graph: Things, Not Strings]". Google Official Blog. May 16, 2012. Retrieved September 6, 2014. {{cite web}}: |first= missing |last= (help); Missing pipe in: |first= (help)
  5. ^ a b c d O'Reilly, Tim (March 8, 2007). "Freebase Will Prove Addictive". O'Reilly Radar. O'Reilly Media. Archived from the original on October 14, 2008. Retrieved September 6, 2014.
  6. ^ Meyer, Scott (April 8, 2008). "A Brief Tour of Graphd". Blog.Freebase.com. Archived from the original on May 30, 2012. Retrieved September 6, 2014.
  7. ^ Hillis, W. Daniel (2000). "Aristotle (The Knowledge Web)". Retrieved January 20, 2013.
  8. ^ Markoff, John (2007-03-09). "Start-Up Aims for Database to Automate Web Searching". NYTimes.com. The New York Times Company. Retrieved 2009-02-07.
  9. ^ a b Taylor, Jamie (October 30, 2008). "Introducing the Freebase RDF service". Archived from the original on May 16, 2012. Retrieved September 6, 2014.
  10. ^ "Arnold Schwarzenegger". Freebase. Retrieved 2014-02-14.
  11. ^ "Explore Freebase Data". Freebase.com. Retrieved 2013-02-14.
  12. ^ Robert, Kirrily "Skud" (August 29, 2008). "Freebase's New Experts Programme". Blog.Freebase.com. Archived from the original on February 17, 2012. Retrieved September 6, 2014.
  13. ^ Maden, Christopher R. "crism" (December 4, 2007). "Internationalization in Freebase". Blog.Freebase.com. Archived from the original on March 9, 2012. Retrieved September 6, 2014.
  14. ^ "Patent US7502770". Google Patents. Retrieved 2014-02-14.
  15. ^ US 7502770, Hillis, Daniel & Ferren, Bran, "Patent US7502770", published 2009-03-10 
  16. ^ "Investors". Metaweb.com. Retrieved 2009-01-02.
  17. ^ "Google Makes Major Semantic Web Play, Acquires Freebase Operators Metaweb". ReadWrite. 2010-07-16. Retrieved 2013-12-06.
  18. ^ "FAQ". Freebase.com. Retrieved 2009-01-02.
  19. ^ "Sharing What Matters – Software: A computing maverick hopes to upgrade the web, transforming it from a document collection into a data commons". Monitor. The Economist Technology Quarterly. June 7, 2007. Retrieved September 6, 2014.
  20. ^ Culbertson, Bryan (December 14, 2007). "List Importer". Blog.Freebase.com. Archived from the original on April 17, 2012. Retrieved September 6, 2014.