Jump to content

Extract, load, transform

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Gregorywizard (talk | contribs) at 20:16, 17 December 2015. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

ELT is an alternative to Extract, transform, load (ETL) used with data lake implementations. In ELT models the data is not processed on entry to the data lake which enables faster loading times. But does require sufficient processing within the data processing engine to carry out the transform on demand and return the results to the consumer in a timely manner. Since the data is not processed on entry to the data lake the query and schema do not need to be defined a-priori (often the schema will be available during load since many data sources are extracts from databases or similar structured data systems and hence have an associated schema).