XLDB (eXtremely Large Data Bases) is a yearly conference about data processing. The definition of extremely large refers to data sets that are too big in terms of volume (too much), and/or velocity (too fast), and/of variety (too many places, too many formats) to be handled using conventional solutions.
In October 2007 the XLDB experts gathered at SLAC for the First Workshop on Extremely Large Databases. As a result, the XLDB research community was formed. to meet rapidly growing demands, in addition to the original invitational workshop, an open conference, tutorials, and annual satellite events on different continents were added. The main event, held annually at Stanford gathers over 300 technically savvy attendees. XLDB is one of the premier database events catered towards both academic and industrial communities.
The main goals of this community include:
- Identify trends, commonalities and major roadblocks related to building extremely large databases
- Bridge the gap between users trying to build extremely large databases and database solution providers worldwide
- Facilitate development and growth of practical technologies for extremely large data stores
As of 2013, the community consisted of about a thousand members including:
- Scientists who develop, use, or plan to develop or use XLDB for their research, from laboratories.
- Commercial users of XLDB.
- Providers of database products, including commercial vendors and representatives from open source database communities.
- Academic database researchers.
XLDB Conferences, Workshops and Tutorials
The community meets annually at Stanford where the main event is held each fall, usually in September. These who live too far from California to attend have the opportunity to attend satellite events, organized annually around May/June either in Asia or in Europe.
A detailed report is produced after each workshop.
|2017||Clermont-Ferrand||||10th XLDB Conference|
|2016||Stanford||||9th XLDB Conference|
|2015||Stanford||||8th XLDB Conference|
|2014||Observatório Nacional, Rio_de_Janeiro||||Satellite XLDB Workshop in South America|
|2013||Stanford||||7th XLDB Conference|
|2013||CERN, Geneva/Switzerland||[permanent dead link]||Satellite XLDB Workshop in Europe|
|2012||Stanford||||||6th XLDB Conference, Workshop & Tutorials|
|2012||Beijing, China||||||Satellite XLDB Conference in Asia|
|2011||SLAC||||[permanent dead link]||5th XLDB Conference and Workshop|
|2011||Edinburgh, UK||||not available||Satellite XLDB Workshop in Europe|
|2010||SLAC||||[permanent dead link]||4th XLDB Conference and Workshop|
|2009||Lyon, France||||[permanent dead link]||3rd XLDB Workshop|
|2008||SLAC||||[permanent dead link]||2nd XLDB Workshop|
|2007||SLAC||||[permanent dead link]||1st XLDB Workshop|
The XLDB organizers started defining a science benchmark for scientific data management systems called SS-DB.
At 2012[permanent dead link] the XLDB organizers announced that two major databases that support arrays as first-class objects (MonetDB SciQL and SciDB) have formed a working group in conjunction with XLDB. This working group is proposing a common syntax (provisionally named “ArrayQL”) for manipulating arrays, including array creation and query.
- Pavlo A., Paulson E., Rasin A., Abadi D. J., Dewitt D. J., Madden S., and Stonebraker M., A Comparison of Approaches to Large-Scale Data Analysis," Proceedings of the 2009 ACM SIGMOD, https://web.archive.org/web/20090611174944/http://database.cs.brown.edu:80/sigmod09/benchmarks-sigmod09.pdf
- Becla, J., et al. 2006, Designing a multi-petabyte database for LSST, http://arxiv.org/abs/cs/0604112
- Becla, J., & Wang, D. L. 2005, Lessons Learned from Managing a Petabyte, downloaded from https://web.archive.org/web/20110604223735/http://www.slac.stanford.edu/pubs/slacpubs/10750/slac-pub-10963.pdf on 2007-11-25.
- Bell, G., Gray, J., & Szalay, A. 2005, Petascale computations systems: Balanced cyberinfrastructure in a data-centric world, http://arxiv.org/abs/cs/0701165
- Duellmann, D. 1999, Petabyte Databases, ACM SIGMOD Record, vol. 28, p. 506, https://web.archive.org/web/20071012015357/http://www.sigmod.org/sigmod/record/issues/9906/index.html#TutorialSessions.
- Hanushevsky, A., & Nowak, M. 1999, Pursuit of a Scalable High Performance Multi-Petabyte Database, 16th IEEE Symposium on Mass Storage Systems, pp. 169–175, http://citeseer.ist.psu.edu/217883.html.
- Shiers, J., Building Very Large, Distributed Object Databases, downloaded from https://web.archive.org/web/20070915101842/http://wwwasd.web.cern.ch:80/wwwasd/cernlib/rd45/papers/dbprog.html on 2007-11-25.