Jump to content

Unix filesystem

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by DemocraticLuntz (talk | contribs) at 20:02, 17 February 2016 (Reverted 1 good faith edit by Lww208phi using STiki). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Version 7 Unix filesystem layout: subdirectories of "/" and "/usr"

In Unix and operating systems inspired by it, the file system is considered a central component of the operating system.[1] It was also one of the first parts of the system to be designed and implemented by Ken Thompson in the first experimental version of Unix, dated 1969.[2]

Like in other operating systems, the filesystem provides information storage and retrieval, as well as interprocess communication, in the sense that the many small programs that traditionally comprise a Unix system can store information in files so that other programs can read these, although pipes complemented it in this role starting with the Third Edition. Additionally, the filesystem provides access to other resources through so-called device files that are entry points to terminals, printers, and mice.

The rest of this article uses "Unix" as a generic name to refer to both the original Unix operating system as well as its many workalikes.

Principles

The filesystem appears as a single rooted tree of directories.[1] Instead of addressing separate volumes such as disk partitions, removable media, and network shares as separate trees (as done in MS-DOS and Windows: each "drive" has a drive letter that denotes the root of its file system tree), such volumes can be "mounted" on a directory, causing the volume's file system tree to appear as that directory in the larger tree.[1] The root of the entire tree is denoted /.

In the original Bell Labs Unix, a two-disk setup was customary, where the first disk contained startup programs, while the second contained users' files and programs. This second disk was mounted at the empty directory named usr on the first disk, causing the two disks to appear as one filesystem, with the second's disks contents viewable at /usr.

Unix directories do not "contain" files. Instead, they contain the names of files paired with references to so-called inodes, which in turn contain both the file and its metadata (owner, permissions, time of last access, etc., but no name). Multiple names in the file system may refer to the same file, a feature known as (hard) linking.[1] If this feature is taken into account, the file system is a limited type of directed acyclic graph, although the directories still form a tree, as they may typically not be hard-linked. (As originally envisioned in 1969, the Unix file system would in fact be used as a general graph with hard links to directories providing navigation, instead of path names.[2])

File types

The original Unix file system supported three types of files: ordinary files, directories, and "special files", also known as device files.[1] The Berkeley Software Distribution (BSD) and System V each added a file type to be used for interprocess communication: BSD added sockets,[3] while System V added FIFO files.

BSD also added symbolic links (often termed "symlinks") to the range of file types, which are files that refer to other files, and complement hard links.[3] Other Unix systems may support additional types of files.[4]

Conventional directory layout

Certain conventions exist for locating particular kinds of files, such as programs, system configuration files and users' home directories. These were first documented in the hier(7) man page since Version 7 Unix;[5] subsequent versions, derivatives and clones typically have a similar man page.[6][7][8][9][10]

The details of the directory layout have varied over time. Although the file system layout is not part of the Single UNIX Specification, several attempts exist to standardize (parts of) it, such as the System V Application Binary Interface, the Intel Binary Compatibility Standard, the Common Operating System Environment and Linux Foundation's Filesystem Hierarchy Standard (FHS).[11]

Here is a generalized overview of common locations of files on a Unix operating system:

Directory or file Description
/ The slash / character alone denotes the root of the filesystem tree.
/bin
Stands for "binaries" and contains certain fundamental utilities, such as ls or cp, that are needed to mount /usr, when that's a separate filesystem, or to run in single-user (administrative) mode when /usr cannot be mounted. In System V.4, this is a symlink to /usr/bin.
/boot
Contains all the files that are required for successful booting process. In Research Unix, this was a single file rather than a directory.[12]
/dev
Stands for "devices". Contains file representations of peripheral devices and pseudo-devices.
/etc
Contains system-wide configuration files and system databases; the name stands for "et cetera".[12] Originally also contained "dangerous maintenance utilities" such as init,[13] but these have typically been moved to /sbin or elsewhere.
/home
Contains user home directories on Linux and some other systems. In the original version of Unix, home directories were in /usr instead.[14] Some systems use or have used different locations still: OS X has home directories in /Users, older versions of BSD put them in /u, FreeBSD has /usr/home.
/lib
Originally "essential libraries": C libraries, but not Fortran ones.[12] On modern systems, contains the shared libraries needed by programs in /bin, and possibly loadable kernel module or device drivers. Linux distributions may have variants /lib32 and /lib64 for multi-architecture support.
/media
Default mount point for removable devices, such as USB sticks, media players, etc.
/mnt
Stands for "mount". Empty directory commonly used by system administrators as a temporary mount point.
/opt
Contains locally installed software. Originated in System V, which has a package manager that installs software to this directory (one subdirectory per package).[15]
/proc
procfs virtual filesystem showing information about processes as files.
/root
The home directory for the superuser "root" - that is, the system administrator. This account's home directory is usually on the initial filesystem, and hence not in /home (which may be a mount point for another filesystem) in case specific maintenance needs to be performed, during which other filesystems are not available. Such a case could occur, for example, if a hard disk drive suffers physical failures and cannot be properly mounted.
/sbin
Stands for "system (or superuser) binaries" and contains fundamental utilities, such as init, usually needed to start, maintain and recover the system.
/srv
Server data (data for services provided by system).
/sys
In some Linux distributions, contains a sysfs virtual filesystem, containing information related to hardware and the operating system. On BSD systems, commonly a symlink to the kernel sources in /usr/src/sys.
/tmp
A place for temporary files not expected to survive a reboot. Many systems clear this directory upon startup or use tmpfs to implement it.
/unix
The Unix kernel in Research Unix and System V.[12] With the addition of virtual memory support to 3BSD, this got renamed /vmunix.
/usr
The "user file system": originally the directory holding user home directories,[14] but already by the Third Edition of Research Unix, ca. 1973, reused to split the operating system's programs over two disks (one of them a 256K fixed-head drive) so that basic commands would either appear in /bin or /usr/bin.[16] It now holds executables, libraries, and shared resources that are not system critical, like the X Window System, KDE, Perl, etc. In older Unix systems, user home directories might still appear in /usr alongside directories containing programs, although by 1984 this depended on "local customs".[12]
/include
Stores the development headers used throughout the system. Header files are mostly used by the #include directive in C programming language, which historically is how the name of this directory was chosen.
/lib
Stores the required libraries and data files for programs stored within /usr or elsewhere.
/libexec
On BSD systems[5] and older distributions of Linux, programs that are meant to be executed by other programs rather than by users directly. E.g., the Sendmail executable may be found in this directory.[17] Not present in the FHS; Linux distributions have typically moved the contents of this directory into /usr/lib, where they also resided in 4.3BSD.
/local
Resembles /usr in structure, but its subdirectories are used for additions not part of the operating system distribution, such as custom programs or files from a BSD Ports collection. Usually has subdirectories such as /usr/local/lib or /usr/local/bin.
/share
Architecture-independent program data. On Linux and modern BSD derivatives, this directory has subdirectories such as man for manpages, that used to appear directly under /usr in older versions.
/var
Short for "variable." A place for files that may change often - especially in size, for example e-mail sent to users on the system, or process-ID lock files.
/log
Contains system log files.
/mail
The place where all the incoming mails are stored. Users (other than root) can access their own mail only. Often, this directory is a symbolic link to /var/spool/mail.
/spool
Spool directory. Contains print jobs, mail spools and other queued tasks.
/tmp
The /var/tmp directory is a place for temporary files which should be preserved between system reboots.

See also

References

  1. ^ a b c d e Ritchie, D.M.; Thompson, K. (July 1978). "The UNIX Time-Sharing System". Bell System Tech. J. 57 (6). USA: AT&T: 1905–1929. doi:10.1002/j.1538-7305.1978.tb02136.x. Archived from the original on 19 January 2013.
  2. ^ a b Ritchie, Dennis M. (1979). The Evolution of the Unix Time-sharing System. Language Design and Programming Methodology Conf.
  3. ^ a b Leffler, Samuel J. (October 1989). The Design and Implementation of the 4.3BSD UNIX Operating System. Addison-Wesley. ISBN 0-201-06196-1. {{cite book}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  4. ^ stat(2) – Linux Programmer's Manual – System Calls
  5. ^ a b "hier(7) man page for OpenBSD". Retrieved 14 September 2013.
  6. ^ "hier(7) man page for FreeBSD".
  7. ^ "hier(7) man page for 2.9.1 BSD".
  8. ^ "hier(7) man page for ULTRIX 4.2".
  9. ^ "hier(7) man page for SunOS 4.1.3".
  10. ^ "hier(7) man page for Linux".
  11. ^ George Kraft IV (1 November 2000). "Where to Install My Products on Linux?". Linux Journal. Retrieved 13 November 2014.
  12. ^ a b c d e Kernighan, Brian W.; Pike, Rob (1984). The UNIX Programming Environment. Prentice-Hall. pp. 63–65.
  13. ^ hier(7) in the Seventh Edition Unix manual.
  14. ^ a b Ritchie, Dennis. "Unix Notes from 1972". Retrieved 4 October 2012.
  15. ^ System V Application Binary Interface Edition 4.1 (1997-03-18)
  16. ^ M. D. McIlroy (1987). A Research Unix reader: annotated excerpts from the Programmer's Manual, 1971–1986. CSTR 139, Bell Labs.
  17. ^ "Chapter 7. sendmail". UNICOS/mp Networking Facilities Administration. Cray. Retrieved 14 September 2013.