Fifth normal form
Fifth normal form (5NF), also known as projection–join normal form (PJ/NF), is a level of database normalization designed to remove redundancy in relational databases recording multi-valued facts by isolating semantically related multiple relationships. A table is said to be in the 5NF if and only if every non-trivial join dependency in that table is implied by the candidate keys. It is the final normal form as far as removing redundancy is concerned.
Consider the following example:
|Traveling Salesman||Brand||Product Type|
|Jack Schneider||Acme||Vacuum Cleaner|
|Mary Jones||Robusto||Pruning Shears|
|Mary Jones||Robusto||Vacuum Cleaner|
|Mary Jones||Robusto||Umbrella Stand|
|Louis Ferguson||Robusto||Vacuum Cleaner|
|Louis Ferguson||Acme||Vacuum Cleaner|
|Louis Ferguson||Acme||Lava Lamp|
|Louis Ferguson||Nimbus||Tie Rack|
The table's predicate is: Products of the type designated by Product Type, made by the brand designated by Brand, are available from the traveling salesman designated by Traveling Salesman.
The primary key is the composite of all three columns. Also note that the table is in 4NF, since there are no multivalued dependencies (2-part join dependencies) in the table: no column (which by itself is not a candidate key or a superkey) is a determinant for the other two columns.
In the absence of any rules restricting the valid possible combinations of Traveling Salesman, Brand, and Product Type, the three-attribute table above is necessary in order to model the situation correctly.
Suppose, however, that the following rule applies: A Traveling Salesman has certain Brands and certain Product Types in their repertoire. If Brand B1 and Brand B2 are in their repertoire, and Product Type P is in their repertoire, then (assuming Brand B1 and Brand B2 both make Product Type P), the Traveling Salesman must offer products of Product Type P those made by Brand B1 and those made by Brand B2.
In that case, it is possible to split the table into three:
|Traveling Salesman||Product Type|
|Jack Schneider||Vacuum Cleaner|
|Mary Jones||Pruning Shears|
|Mary Jones||Vacuum Cleaner|
|Mary Jones||Umbrella Stand|
|Louis Ferguson||Vacuum Cleaner|
|Louis Ferguson||Lava Lamp|
|Louis Ferguson||Tie Rack|
In this case, it's impossible for Louis Ferguson to refuse to offer Vacuum Cleaners made by ACME (assuming ACME makes Vacuum Cleaners) if he sells anything else made by Acme (Lava Lamp) and he also sells Vacuum Cleaners made by any other brand (Robusto).
Note how this setup helps to remove redundancy. Suppose that Jack Schneider starts selling Robusto's products Breadboxes and Vacuum Cleaners. In the previous setup we would have to add two new entries one for each product type (<Jack Schneider, Robusto, Breadboxes>, <Jack Schneider, Robusto, Vacuum Cleaners>). With the new setup we need to add only a single entry (<Jack Schneider, Robusto>) in Brands By Traveling Salesman.
Only in rare situations does a 4NF table not conform to 5NF. For instance, when the decomposed tables are cyclic. These are situations in which a complex real-world constraint governing the valid combinations of attribute values in the 4NF table is not implicit in the structure of that table. If such a table is not normalized to 5NF, the burden of maintaining the logical consistency of the data within the table must be carried partly by the application responsible for insertions, deletions, and updates to it; and there is a heightened risk that the data within the table will become inconsistent. In contrast, the 5NF design excludes the possibility of such inconsistencies.
A table T is in fifth normal form (5NF) or projection-join normal form (PJ/NF) if it cannot have a lossless decomposition into any number of smaller tables. The case where all the smaller tables after the decomposition have the same candidate key as the table T is excluded.
- Why is the 5NF the final normal form?
- See the Anchor Modeling website for a website that describes a data warehouse modelling method based on the sixth normal form
- Analysis of normal forms for anchor-tables
- S. Krishna (1991). Introduction to Data Base and Knowledge Base Systems. ISBN 9810206208.
The fifth normal form was introduced by Fagin
- Kent, W. (1983) A Simple Guide to Five Normal Forms in Relational Database Theory, Communications of the ACM, vol. 26, pp. 120–125
- Date, C.J., & Darwen, H., & Pascal, F. Database Debunkings
- Darwen, H.; Date, C. J.; Fagin, R. (2012). "A normal form for preventing redundant tuples in relational databases". Proceedings of the 15th International Conference on Database Theory - ICDT '12 (PDF). pp. 114–126. doi:10.1145/2274576.2274589. ISBN 9781450307918.