Jump to content

BigQuery

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by MichaelHausenblas (talk | contribs) at 05:34, 23 June 2011 (External links). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

BigQuery
Type of site
Infrastructure as a service
Available inEnglish
OwnerGoogle
URLcode.google.com/apis/bigquery
RegistrationRequired

BigQuery for Developers is a RESTful web service that enables interactive analysis of massively large datasets working in conjunction with Google Storage. It is an Infrastructure as a Service (IaaS) that may be used complementary with MapReduce.

Design

BigQuery (BQ) is reportedly based on Dremel[1], a scalable, interactive ad-hoc query system for analysis of read-only nested data. To use the data in BigQuery, it first must be uploaded to Google Storage and in a second step imported using the BigQuery HTTP API. BigQuery requires all requests to be authenticated, supporting a number of Google-proprietary mechanisms as well as OAuth.

Features

  • Managing data - create and delete tables based on a JSON-encoded schema, import data from Google Storage.
  • Query - the queries are expressed in a SQL dialect and the results are returned in JSON with a maximum reply length of approximately 4 MB (currently, there is no support for paging through results that exceed this limit). BigQuery does not currently support joins.
  • Integration - BigQuery can be used from Google Apps Script and Google Spreadsheets.
  • Access Control - is done via Google Storage.

Notes

  1. ^ Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton, Theo Vassilakis (2010). "Dremel: Interactive Analysis of Web-Scale Datasets". Proc. of the 36th International Conference on Very Large Data Bases (VLDB).{{cite web}}: CS1 maint: multiple names: authors list (link)

References