Transitive data skew
Appearance
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)
|
In distributed computing problems, transitive data skew is an issue of data synchronization.
It arises with the uneven distribution of otherwise evenly distributed data across a number of devices while the data is in transition. If sorted data is being distributed across multiple devices and the column on which that data is sorted is the "key" used to identify the target device, the resulting transitive data skew may be self-correcting.