Unix File System: Difference between revisions
FFS defined too far up the page, redefined here. |
|||
Line 43: | Line 43: | ||
== History and evolution == |
== History and evolution == |
||
Early versions of Unix used filesystems referred to simply as ''FS''. FS only included the boot block, superblock, a clump of inodes, and the data blocks. This worked well for the small disks early Unixes were designed for, but as technology advanced and disks got larger, moving the head back and forth between the clump of inodes and the data blocks they referred to caused [[Thrash (computer science)|thrashing]]. BSD optimized this in FFS by inventing cylinder groups, breaking the disk up into smaller chunks, each with its own inode clump and data blocks. |
Early versions of Unix used filesystems referred to simply as ''FS''. FS only included the boot block, superblock, a clump of inodes, and the data blocks. This worked well for the small disks early Unixes were designed for, but as technology advanced and disks got larger, moving the head back and forth between the clump of inodes and the data blocks they referred to caused [[Thrash (computer science)|thrashing]]. BSD optimized this in FFS (Fast File System) by inventing cylinder groups, breaking the disk up into smaller chunks, each with its own inode clump and data blocks. |
||
The intent of BSD FFS is to try to localize associated data blocks and metadata in the same cylinder group, and ideally, all of the contents of a directory (both data and metadata for all the files) in the same or nearby cylinder group, thus reducing [[file system fragmentation|fragmentation]] caused by scattering a directory's contents over a whole disk. |
The intent of BSD FFS is to try to localize associated data blocks and metadata in the same cylinder group, and ideally, all of the contents of a directory (both data and metadata for all the files) in the same or nearby cylinder group, thus reducing [[file system fragmentation|fragmentation]] caused by scattering a directory's contents over a whole disk. |
Revision as of 06:33, 11 April 2008
Developer(s) | CSRG |
---|---|
Full name | UNIX file system |
Introduced | with 4.2BSD |
The Unix file system (UFS) is a file system used by many Unix and Unix-like operating systems. It is also called the Berkeley Fast File System, the BSD Fast File System or FFS. It is a distant descendant of the original filesystem used by Version 7 Unix.
Design
A UFS volume is composed of the following parts:
- a few blocks at the beginning of the partition reserved for boot blocks (which must be initialized separately from the filesystem)
- a superblock, containing a magic number identifying this as a UFS filesystem, and some other vital numbers describing this filesystem's geometry and statistics and behavioral tuning parameters
- a collection of cylinder groups. Each cylinder group has the following components:
- a backup copy of the superblock
- a cylinder group header, with statistics, free lists, etc, about this cylinder group, similar to those in the superblock
- a number of inodes, each containing file attributes
- a number of data blocks
Inodes are numbered sequentially. The first several inodes are reserved for historical reasons, followed by the inode for the root directory.
Directory files contain only the list of filenames in the directory and the inode associated with each file. All file metadata is kept in the inode.
History and evolution
Early versions of Unix used filesystems referred to simply as FS. FS only included the boot block, superblock, a clump of inodes, and the data blocks. This worked well for the small disks early Unixes were designed for, but as technology advanced and disks got larger, moving the head back and forth between the clump of inodes and the data blocks they referred to caused thrashing. BSD optimized this in FFS (Fast File System) by inventing cylinder groups, breaking the disk up into smaller chunks, each with its own inode clump and data blocks.
The intent of BSD FFS is to try to localize associated data blocks and metadata in the same cylinder group, and ideally, all of the contents of a directory (both data and metadata for all the files) in the same or nearby cylinder group, thus reducing fragmentation caused by scattering a directory's contents over a whole disk.
Some of the performance parameters in the superblock included number of tracks and sectors, disk rotation speed, head speed, and alignment of the sectors between tracks. In a fully optimized system, the head could be moved between close tracks to read scattered sectors from alternating tracks while waiting for the platter to spin around.
As disks grew larger and larger, sector level optimization became obsolete (especially with disks that used linear sector numbering and variable sectors per track). With larger disks and larger files, fragmented reads became more of a problem. To combat this, BSD originally increased the filesystem block size from one sector to 1k in 4.0BSD, and, in FFS, increased the filesystem block size from 1k to 8k. This has several effects. The chances of a file's sectors being contiguous is much greater. The amount of overhead to list the file's blocks is reduced. The number of blocks representable in a fixed bit width block number is increased (allowing for larger disks).
With larger block sizes, disks with many small files would waste a lot of space, so BSD added block level fragmentation[1] (also called block suballocation, tail merging or tail packing), where the last partial block of data from several files may be stored in a single "fragment" block instead of multiple mostly empty blocks (Allen 2005).
Implementations
Vendors of some commercial Unix systems, such as SunOS/Solaris, System V Release 4, HP-UX, and Tru64 UNIX, have adopted UFS. Most of them adapted UFS to their own uses, adding proprietary extensions that may not be recognized by other vendors' versions of Unix. Surprisingly, many have continued to use the original block size and data field widths as the original UFS, so some degree of (read) compatibility remains across platforms. Compatibility between implementations as a whole is spotty at best and should be researched before using it across multiple platforms where shared data is a primary intent.
As of Solaris 7, Sun Microsystems included UFS Logging, which brought filesystem journaling to UFS. Solaris UFS also has extensions for large files and large disks and other features.
In 4.4BSD and BSD Unix systems derived from it, such as FreeBSD, NetBSD, OpenBSD, and DragonFlyBSD, the implementation of UFS1 and UFS2 is split into two layers — an upper layer that provides the directory structure and supports metadata (permissions, ownership, etc.) in the inode structure, and lower layers that provide data containers implemented as inodes. This was done to support both the traditional FFS and the LFS log-structured file system with common code for common functions. The upper layer is called "UFS", and the lower layers are called "FFS" and "LFS". In some of those systems, the term "FFS" is used for the combination of the FFS lower layer and the UFS upper layer, and the term "LFS" is used for the combination of the LFS lower layer and the UFS upper layer.
FreeBSD extended the FFS and UFS layers to support a new variant, called UFS2, which adds 64-bit block pointers (allowing volumes to grow up to 8 zettabytes), variable-sized blocks (similar to extents), extended flag fields and extended attribute support. FreeBSD also introduced soft updates and the ability to make file system snapshots for both UFS1 and UFS2. These have since been ported to NetBSD. Note that OpenBSD not does currently support UFS2, and does not support soft updates.[1]
Linux includes a UFS implementation for binary compatibility at the read level with other Unixes, but since there is no standard implementation for the vendor extensions to UFS, Linux does not have full support for writing to UFS. The native Linux ext2 filesystem is inspired by UFS. (In fact, in some 4.4BSD-derived systems, the UFS layer can use an ext2 layer as a container layer, just as it can use FFS and LFS.)
NeXTStep, which was BSD-derived, also used a version of UFS. In Apple Incorporated's Mac OS X up to Mac OS X v10.4, it was available as an alternative to HFS+, their proprietary filesystem. As of Mac OS X v10.5, one cannot install Mac OS X, "Leopard," on a UFS-formatted volume. In addition, one cannot upgrade older versions of Mac OS X installed on UFS-formatted volumes to Leopard. To upgrade to Leopard you need to reformat your startup volume.[2]
See also
Notes
- ^ A rather ambiguous name, block level fragmentation is also used as Database terminology - a web search on the term will find plenty of such examples.
References
- Marshall Kirk McKusick, William N. Joy, Samuel J. Leffler and Robert S. Fabry. A Fast File System for UNIX (PDF) (Technical report). Computer Systems Research Group, Computer Science Division, Department of Electrical Engineering and Computer Science, University of California, Berkeley, Berkeley, CA 94720.
{{cite tech report}}
: CS1 maint: multiple names: authors list (link) - Marshall Kirk McKusick, William N. Joy, Samuel J. Leffler and Robert S. Fabry (1984). "A Fast File System for UNIX" (PDF). ACM Transactions on Computer Systems. 2 (3): 181–197.
{{cite journal}}
: Unknown parameter|month=
ignored (help)CS1 maint: multiple names: authors list (link) - Marshall Kirk McKusick, Keith Bostic, Michael J. Karels, and John S. Quarterman (1996). "Local Filesystems". The Design and Implementation of the 4.4BSD Operating System. Addison-Wesley. ISBN 0-201-54979-4.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - Marshall Kirk McKusick, Keith Bostic, Michael J. Karels, and John S. Quarterman (1996). "Local Filestores". The Design and Implementation of the 4.4BSD Operating System. Addison-Wesley. ISBN 0-201-54979-4.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - Marshall Kirk McKusick and Gregory R. Ganger (1999). "Soft Updates: A Technique for Eliminating Most Synchronous Writes in the Fast Filesystem" (PDF). Proceedings of the FREENIX Track: 1999 USENIX Annual Technical Conference. pp. 1–18.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help) - Marshall Kirk McKusick (2002). "Running "fsck" in the Background". Proceedings of the BSDCon 2002. pp. 55–64.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help) - The Linux Documentation Project's Filesystems HOWTO: FFS. Note that the distinction this draws between FFS and UFS is wrong; both terms are used at present and have been used in the past.
- Little UFS2 FAQ: What is the difference between UFS and FFS? Note that this gets the relationship between FFS and UFS backwards; see the "Local Filesystems" chapter of The Design and Implementation of the 4.4BSD Operating System, which refers to the upper layer as UFS, and the "Local Filestores" chapter, which refers to the lower layer as FFS.
- The Sun Solaris UFS implementation chapter of the Solaris™ Internals: Solaris 10 and OpenSolaris Kernel Architecture, Second Edition book by Richard McDougall, Jim Mauro ISBN 0-13-148209-2
- Allen, Hervey (2005-06-20). "Introduction to FreeBSD, PacNOG I Workshop, Additional Topics", Network Startup Resource Center. Template:PDFlink
External links
- Little UFS2 FAQ
- Linux userspace UFS2 tools.
- Filesystems-HOWTO (part of The Linux Documentation Project, link is not Linux specific.)
- UFS2 Tools: An open source tool for accessing UFS2 (BSD) slices from within Windows
- http://docs.info.apple.com/article.html?artnum=306516 No UFS installation for Leopard