mdadm

mdadm
Original author(s)	Neil Brown
Developer(s)	Community contributors, ; Mariusz Tkaczyk
Initial release	2001
Stable release	4.3 / February 15, 2024; 6 months ago
Repository	github.com/md-raid-utilities/mdadm/
Written in	C
Operating system	Linux
Available in	English
Type	Disk utility
License	GNU GPL
Website	raid.wiki.kernel.org

mdadm is a Linux utility used to manage and monitor software RAID devices. It is used in modern Linux distributions in place of older software RAID utilities such as raidtools2 or raidtools.^[3]^[4]^[5]

mdadm is free software originally^[6] maintained by, and copyrighted to, Neil Brown of SUSE, and licensed under the terms of version 2 or later of the GNU General Public License.

Name

The name is derived from the md (multiple device) device nodes it administers or manages, and it replaced a previous utility mdctl.^{[citation needed]} The original name was "Mirror Disk", but was changed as more functions were added.^{[citation needed]} The name is now understood to be short for Multiple Disk and Device Management.^[3]

Overview

Linux software RAID configurations can include anything presented to the Linux kernel as a block device. This includes whole hard drives (for example, /dev/sda), and their partitions (for example, /dev/sda1).

RAID configurations

RAID 0 – Block-level striping. MD can handle devices of different lengths, the extra space on the larger device is then not striped.
RAID 1 – Mirror.
RAID 4 – Like RAID 0, but with an extra device for the parity.
RAID 5 – Like RAID 4, but with the parity distributed across all devices.
RAID 6 – Like RAID 5, but with two parity segments per stripe.
RAID 10 – Take a number of RAID 1 mirrorsets and stripe across them RAID 0 style.

RAID 10 is distinct from RAID 0+1, which consists of a top-level RAID 1 mirror composed of high-performance RAID 0 stripes directly across the physical hard disks. A single-drive failure in a RAID 10 configuration results in one of the lower-level mirrors entering degraded mode, but the top-level stripe performing normally (except for the performance hit). A single-drive failure in a RAID 0+1 configuration results in one of the lower-level stripes completely failing, and the top-level mirror entering degraded mode. Which of the two setups is preferable depends on the details of the application in question, such as whether or not spare disks are available, and how they should be spun up.

Non-RAID configurations

Linear – concatenates a number of devices into a single large MD device – (deprecated since 2021 and removed from the Linux kernel since 2023 ^[7])
Multipath – provides multiple paths with failover to a single device
Faulty – a single device which emulates a number of disk-fault scenarios for testing and development
Container – a group of devices managed as a single device, in which one can build RAID systems

Features

The original (standard) form of names for md devices is /dev/md<n>, where <n> is a number between 0 and 99. More recent kernels have support for names such as /dev/md/Home. Under 2.4.x kernels and earlier these two were the only options. Both of them are non-partitionable.

Since 2.6.x kernels, a new type of MD device was introduced, a partitionable array. The device names were modified by changing md to md_d. The partitions were identified by adding p<n>, where <n> is the partition number; thus /dev/md/md_d2p3 for example. Since version 2.6.28 of the Linux kernel mainline, non-partitionable arrays can be partitioned, the partitions being referred to in the same way as for partitionable arrays – for example, /dev/md/md1p2.

Since version 3.7 of the Linux kernel mainline, md supports TRIM operations for the underlying solid-state drives (SSDs), for linear, RAID 0, RAID 1, RAID 5 and RAID 10 layouts.^[8]

Booting

Since support for MD is found in the kernel, there is an issue with using it before the kernel is running. Specifically it will not be present if the boot loader is either (e)LiLo or GRUB legacy. Although normally present, it may not be present for GRUB 2. In order to circumvent this problem a /boot filesystem must be used either without md support, or else with RAID1. In the latter case the system will boot by treating the RAID1 device as a normal filesystem, and once the system is running it can be remounted as md and the second disk added to it. This will result in a catch-up, but /boot filesystems are usually small.

With more recent bootloaders it is possible to load the MD support as a kernel module through the initramfs mechanism. This approach allows the /boot filesystem to be inside any RAID system without the need of a complex manual configuration.

External metadata

Besides its own formats for RAID volumes metadata, Linux software RAID also supports external metadata formats, since version 2.6.27 of the Linux kernel and version 3.0 of the mdadm userspace utility. This allows Linux to use various firmware- or driver-based RAID volumes, also known as "fake RAID".^[9]

As of October 2013^[update], there are two supported formats of the external metadata:

DDF (Disk Data Format), an industry standard defined by the Storage Networking Industry Association for increased interoperability.^[10]
Volume metadata format used by the Intel Rapid Storage Technology(RST), former Intel Matrix RAID, implemented on many consumer-level motherboards.^[9]

mdmpd

mdmpd was^[11] a daemon used for monitoring MD multipath devices up to Linux kernel 2.6.10-rc1, developed by Red Hat as part of the mdadm package.^[12] The program was used to monitor multipath (RAID) devices, and is usually started at boot time as a service, and afterwards running as a daemon.

Enterprise storage requirements often include the desire to have more than one way to talk to a single disk drive so that in the event of some failure to talk to a disk drive via one controller, the system can automatically switch to another controller and keep going. This is called multipath disk access. The linux kernel implements multipath disk access via the software RAID stack known as the md (Multiple Devices) driver. The kernel portion of the md multipath driver only handles routing I/O requests to the proper device and handling failures on the active path. It does not try to find out if a path that has previously failed might be working again. That's what this daemon does. Upon startup, it reads the current state of the md raid arrays, saves that state, and then waits for the kernel to tell it something interesting has happened. It then wakes up, checks to see if any paths on a multipath device have failed, and if they have then it starts to poll the failed path once every 15 seconds until it starts working again. Once it starts working again, the daemon will then add the path back into the multipath md device it was originally part of as a new spare path.

If one is using the /proc filesystem, /proc/mdstat lists all active md devices with information about them. Mdmpd requires this to find arrays to monitor paths on, to get notification of interesting events and to monitor array reconstruction on Monitor mode.^[13]

Technical details RAID 1

The data on a RAID 1 volume is the same as on a normal partition. The RAID information is stored in the last 128kB of the partition. This means, to convert a RAID 1 volume to normal data partition, it is possible to decrease the partition size by 128kB and change the partition ID from fd to 83 (for Linux).

References

^ "Announcement: mdadm maintainer update". marc.info. 2023-12-14. Retrieved 2024-05-17.
^ "Release mdadm-4.3". 2024-02-15.
^ ^a ^b Bresnahan, Christine; Blum, Richard (2016). LPIC-2: Linux Professional Institute Certification Study Guide. John Wiley & Sons. pp. 206–221. ISBN 9781119150817.
^ Vadala, Derek (2003). Managing RAID on Linux. O'Reilly Media, Inc. p. 140. ISBN 9781565927308. mdadm linux.
^ Nemeth, Evi (2011). UNIX and Linux System Administration Handbook. Pearson Education. pp. 242–245. ISBN 9780131480056.
^ "Mdadm". Archived from the original on 2013-05-03. Retrieved 2007-08-25.
^ "md: Remove deprecated CONFIG_MD_LINEAR". GitHub. 2023-12-19. Retrieved 2024-04-30.
^ "Linux kernel 3.7, Section 5. Block". kernelnewbies.org. 2012-12-10. Retrieved 2014-09-21.
^ ^a ^b "External Metadata". RAID Setup. kernel.org. 2013-10-05. Retrieved 2014-01-01.
^ "DDF Fake RAID". RAID Setup. kernel.org. 2013-09-12. Retrieved 2014-01-01.
^ "117498 – md code missing event interface".
^ "Updated mdadm package includes multi-path device enhancements". RHEA-2003:397-06. Redhat. 2004-01-16.
^ "Mdadm(8): Manage MD devices aka Software RAID - Linux man page".

External links

mdadm source code releases
"Installation/SoftwareRAID". Ubuntu Community Documentation. 2012-03-01.
Lonezor (2011-11-13). "Setting up a RAID volume in Linux with >2TB disks". Archived from the original on 2011-11-19.

[1] "Announcement: mdadm maintainer update". marc.info. 2023-12-14. Retrieved 2024-05-17.

[2] "Release mdadm-4.3". 2024-02-15.

[lpic-2-3] Bresnahan, Christine; Blum, Richard (2016). LPIC-2: Linux Professional Institute Certification Study Guide. John Wiley & Sons. pp. 206–221. ISBN 9781119150817.

[4] Vadala, Derek (2003). Managing RAID on Linux. O'Reilly Media, Inc. p. 140. ISBN 9781565927308. mdadm linux.

[5] Nemeth, Evi (2011). UNIX and Linux System Administration Handbook. Pearson Education. pp. 242–245. ISBN 9780131480056.

[6] "Mdadm". Archived from the original on 2013-05-03. Retrieved 2007-08-25.

[linux-linear-removal-7] "md: Remove deprecated CONFIG_MD_LINEAR". GitHub. 2023-12-19. Retrieved 2024-04-30.

[8] "Linux kernel 3.7, Section 5. Block". kernelnewbies.org. 2012-12-10. Retrieved 2014-09-21.

[linux-kernel-raid-9] "External Metadata". RAID Setup. kernel.org. 2013-10-05. Retrieved 2014-01-01.

[10] "DDF Fake RAID". RAID Setup. kernel.org. 2013-09-12. Retrieved 2014-01-01.

[11] "117498 – md code missing event interface".

[12] "Updated mdadm package includes multi-path device enhancements". RHEA-2003:397-06. Redhat. 2004-01-16.

[13] "Mdadm(8): Manage MD devices aka Software RAID - Linux man page".

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]