# Finite element method in structural mechanics

The finite element method (FEM) is a powerful technique originally developed for numerical solution of complex problems in structural mechanics, and it remains the method of choice for complex systems. In the FEM, the structural system is modeled by a set of appropriate finite elements interconnected at discrete points called nodes. Elements may have physical properties such as thickness, coefficient of thermal expansion, density, Young's modulus, shear modulus and Poisson's ratio.

## History

The origin of finite method can be traced to the matrix analysis of structures [1][2] where the concept of a displacement or stiffness matrix approach was introduced. Finite element concepts were developed based on engineering methods in 1950s. The finite element method obtained its real impetus in the 1960s and 1970s by John Argyris, and co-workers; at the University of Stuttgart, by Ray W. Clough; at the University of California, Berkeley, by Olgierd Zienkiewicz, and co-workers Ernest Hinton, Bruce Irons;[3] at the University of Swansea, by Philippe G. Ciarlet; at the University of Paris; at Cornell University, by Richard Gallagher and co-workers. The original works such as those by Argyris [4] and Clough [5] became the foundation for today’s finite element structural analysis methods.

Straight or curved one-dimensional elements with physical properties such as axial, bending, and torsional stiffnesses. This type of element is suitable for modeling cables, braces, trusses, beams, stiffeners, grids and frames. Straight elements usually have two nodes, one at each end, while curved elements will need at least three nodes including the end-nodes. The elements are positioned at the centroidal axis of the actual members.

• Two-dimensional elements that resist only in-plane forces by membrane action (plane stress, plane strain), and plates that resist transverse loads by transverse shear and bending action (plates and shells). They may have a variety of shapes such as flat or curved triangles and quadrilaterals. Nodes are usually placed at the element corners, and if needed for higher accuracy, additional nodes can be placed along the element edges or even within the element. The elements are positioned at the mid-surface of the actual layer thickness.
• Torus-shaped elements for axisymmetric problems such as membranes, thick plates, shells, and solids. The cross-section of the elements are similar to the previously described types: one-dimensional for thin plates and shells, and two-dimensional for solids, thick plates and shells.
• Three-dimensional elements for modeling 3-D solids such as machine components, dams, embankments or soil masses. Common element shapes include tetrahedrals and hexahedrals. Nodes are placed at the vertexes and possibly in the element faces or within the element.

### Element interconnection and displacement

The elements are interconnected only at the exterior nodes, and altogether they should cover the entire domain as accurately as possible. Nodes will have nodal (vector) displacements or degrees of freedom which may include translations, rotations, and for special applications, higher order derivatives of displacements. When the nodes displace, they will drag the elements along in a certain manner dictated by the element formulation. In other words, displacements of any points in the element will be interpolated from the nodal displacements, and this is the main reason for the approximate nature of the solution.

## Practical considerations

From the application point of view, it is important to model the system such that:

• Symmetry or anti-symmetry conditions are exploited in order to reduce the size of the model.
• Displacement compatibility, including any required discontinuity, is ensured at the nodes, and preferably, along the element edges as well, particularly when adjacent elements are of different types, material or thickness. Compatibility of displacements of many nodes can usually be imposed via constraint relations.
• Elements' behaviours must capture the dominant actions of the actual system, both locally and globally.
• The element mesh should be sufficiently fine in order to produce acceptable accuracy. To assess accuracy, the mesh is refined until the important results shows little change. For higher accuracy, the aspect ratio of the elements should be as close to unity as possible, and smaller elements are used over the parts of higher stress gradient.
• Proper support constraints are imposed with special attention paid to nodes on symmetry axes.

Large scale commercial software packages often provide facilities for generating the mesh, and the graphical display of input and output, which greatly facilitate the verification of both input data and interpretation of the results.

## Theoretical overview of FEM-Displacement Formulation: From elements, to system, to solution

While the theory of FEM can be presented in different perspectives or emphases, its development for structural analysis follows the more traditional approach via the virtual work principle or the minimum total potential energy principle. The virtual work principle approach is more general as it is applicable to both linear and non-linear material behaviours. The virtual work method is an expression of conservation of energy: for conservative systems, the work added to the system by a set of applied forces is equal to the energy stored in the system in the form of strain energy of the structure's components.

The principle of virtual displacements for the structural system expresses the mathematical identity of external and internal virtual work:

${\displaystyle {\mbox{External virtual work}}=\int _{V}\delta {\boldsymbol {\epsilon }}^{T}{\boldsymbol {\sigma }}\,dV\qquad \mathrm {(1)} }$

In other words, the summation of the work done on the system by the set of external forces is equal to the work stored as strain energy in the elements that make up the system.

The virtual internal work in the right-hand-side of the above equation may be found by summing the virtual work done on the individual elements. The latter requires that force-displacement functions be used that describe the response for each individual element. Hence, the displacement of the structure is described by the response of individual (discrete) elements collectively. The equations are written only for the small domain of individual elements of the structure rather than a single equation that describes the response of the system as a whole (a continuum). The latter would result in an intractable problem, hence the utility of the finite element method. As shown in the subsequent sections, Eq.(1) leads to the following governing equilibrium equation for the system:

${\displaystyle \mathbf {R} =\mathbf {Kr} +\mathbf {R} ^{o}\qquad \qquad \qquad \mathrm {(2)} }$

where

${\displaystyle \mathbf {R} }$ = vector of nodal forces, representing external forces applied to the system's nodes.
${\displaystyle \mathbf {K} }$ = system stiffness matrix, which is the collective effect of the individual elements' stiffness matrices :${\displaystyle \mathbf {k} ^{e}}$.
${\displaystyle \mathbf {r} }$ = vector of the system's nodal displacements.
${\displaystyle \mathbf {R} ^{o}}$ = vector of equivalent nodal forces, representing all external effects other than the nodal forces which are already included in the preceding nodal force vector R. These external effects may include distributed or concentrated surface forces, body forces, thermal effects, initial stresses and strains.

Once the supports' constraints are accounted for, the nodal displacements are found by solving the system of linear equations (2), symbolically:

${\displaystyle \mathbf {r} =\mathbf {K} ^{-1}(\mathbf {R} -\mathbf {R} ^{o})\qquad \qquad \qquad \mathrm {(3)} }$

Subsequently, the strains and stresses in individual elements may be found as follows:

${\displaystyle \mathbf {\epsilon } =\mathbf {Bq} \qquad \qquad \qquad \qquad \mathrm {(4)} }$
${\displaystyle \mathbf {\sigma } =\mathbf {E} (\mathbf {\epsilon } -\mathbf {\epsilon } ^{o})+\mathbf {\sigma } ^{o}=\mathbf {E} (\mathbf {Bq} -\mathbf {\epsilon } ^{o})+\mathbf {\sigma } ^{o}\qquad \qquad \qquad \mathrm {(5)} }$

where

${\displaystyle \mathbf {q} }$ = vector of a nodal displacements--a subset of the system displacement vector r that pertains to the elements under consideration.
${\displaystyle \mathbf {B} }$ = strain-displacement matrix that transforms nodal displacements q to strains at any point in the element.
${\displaystyle \mathbf {E} }$ = elasticity matrix that transforms effective strains to stresses at any point in the element.
${\displaystyle \mathbf {\epsilon } ^{o}}$ = vector of initial strains in the elements.
${\displaystyle \mathbf {\sigma } ^{o}}$ = vector of initial stresses in the elements.

By applying the virtual work equation (1) to the system, we can establish the element matrices ${\displaystyle \mathbf {B} }$, ${\displaystyle \mathbf {k} ^{e}}$ as well as the technique of assembling the system matrices ${\displaystyle \mathbf {R} ^{o}}$ and ${\displaystyle \mathbf {K} }$. Other matrices such as ${\displaystyle \mathbf {\epsilon } ^{o}}$, ${\displaystyle \mathbf {\sigma } ^{o}}$, ${\displaystyle \mathbf {R} }$ and ${\displaystyle \mathbf {E} }$ are known values and can be directly set up from data input.

## Interpolation or shape functions

Let ${\displaystyle \mathbf {q} }$ be the vector of nodal displacements of a typical element. The displacements at any other point of the element may be found by the use of interpolation functions as, symbolically:

${\displaystyle \mathbf {u} =\mathbf {N} \mathbf {q} \qquad \qquad \qquad \mathrm {(6)} }$

where

${\displaystyle \mathbf {u} }$ = vector of displacements at any point {x,y,z} of the element.
${\displaystyle \mathbf {N} }$ = matrix of shape functions serving as interpolation functions.

Equation (6) gives rise to other quantities of great interest:

• Virtual displacements that are a function of virtual nodal displacements: ${\displaystyle \delta \mathbf {u} =\mathbf {N} \delta \mathbf {q} \qquad \qquad \qquad \mathrm {(6b)} }$
• Strains in the elements that result from displacements of the element's nodes:${\displaystyle \mathbf {\epsilon } =\mathbf {Du} =\mathbf {DNq} \qquad \qquad \qquad \qquad \mathrm {(7)} }$
where ${\displaystyle \mathbf {D} }$ = matrix of differential operators that convert displacements to strains using linear elasticity theory. Eq.(7) shows that matrix B in (4) is
${\displaystyle \mathbf {B} =\mathbf {DN} \qquad \qquad \qquad \qquad \mathrm {(8)} }$
• Virtual strains consistent with element's virtual nodal displacements: ${\displaystyle \delta {\boldsymbol {\epsilon }}=\mathbf {B} \delta \mathbf {q} \qquad \qquad \qquad \qquad \mathrm {(9)} }$

## Internal virtual work in a typical element

For a typical element of volume ${\displaystyle V^{e}}$, the internal virtual work due to virtual displacements is obtained by substitution of (5) and (9) into (1):

${\displaystyle {\mbox{Internal virtual work}}=\int _{V^{e}}\delta {\boldsymbol {\epsilon }}^{T}{\boldsymbol {\sigma }}\,dV^{e}=\delta \ \mathbf {q} ^{T}\int _{V^{e}}\mathbf {B} ^{T}{\big \{}\mathbf {E} (\mathbf {Bq} -\mathbf {\epsilon } ^{o})+\mathbf {\sigma } ^{o}{\big \}}\,dV^{e}\qquad \mathrm {(10)} }$

### Element matrices

Primarily for the convenience of reference, the following matrices pertaining to a typical elements may now be defined:

Element stiffness matrix ${\displaystyle \mathbf {K} ^{e}=\int _{V^{e}}\mathbf {B} ^{T}\mathbf {E} \mathbf {B} \,dV^{e}\qquad \mathrm {(11)} }$
Equivalent element load vector ${\displaystyle \mathbf {Q} ^{oe}=\int _{V^{e}}-\mathbf {B} ^{T}{\big (}\mathbf {E} \mathbf {\epsilon } ^{o}-\mathbf {\sigma } ^{o}{\big )}\,dV^{e}\qquad \mathrm {(12)} }$

These matrices are usually evaluated numerically using Gaussian quadrature for numerical integration. Their use simplifies (10) to the following:

${\displaystyle {\mbox{Internal virtual work}}=\delta \ \mathbf {q} ^{T}{\big (}\mathbf {K} ^{e}\mathbf {q} +\mathbf {Q} ^{oe}{\big )}\qquad \mathrm {(13)} }$

### Element virtual work in terms of system nodal displacements

Since the nodal displacement vector q is a subset of the system nodal displacements r (for compatibility with adjacent elements), we can replace q with r by expanding the size of the element matrices with new columns and rows of zeros:

${\displaystyle {\mbox{Internal virtual work}}=\delta \ \mathbf {r} ^{T}{\big (}\mathbf {K} ^{e}\mathbf {r} +\mathbf {Q} ^{oe}{\big )}\qquad \mathrm {(14)} }$

where, for simplicity, we use the same symbols for the element matrices, which now have expanded size as well as suitably rearranged rows and columns.

## System virtual work

Summing the internal virtual work (14) for all elements gives the right-hand-side of (1):

${\displaystyle {\mbox{System internal virtual work}}=\sum _{e}\delta \ \mathbf {r} ^{T}{\big (}\mathbf {k} ^{e}\mathbf {r} +\mathbf {Q} ^{oe}{\big )}=\delta \ \mathbf {r} ^{T}{\big (}\sum _{e}\mathbf {k} ^{e}{\big )}\mathbf {r} +\delta \ \mathbf {r} ^{T}\sum _{e}\mathbf {Q} ^{oe}\qquad \mathrm {(15)} }$

Considering now the left-hand-side of (1), the system external virtual work consists of:

• The work done by the nodal forces R: ${\displaystyle \delta \ \mathbf {r} ^{T}\mathbf {R} \qquad \mathrm {(16)} }$
• The work done by external forces ${\displaystyle \mathbf {T} ^{e}}$ on the part ${\displaystyle \mathbf {S} ^{e}}$ of the elements' edges or surfaces, and by the body forces ${\displaystyle \mathbf {f} ^{e}}$
${\displaystyle \sum _{e}\int _{S^{e}}\delta \ \mathbf {u} ^{T}\mathbf {T} ^{e}\,dS^{e}+\sum _{e}\int _{V^{e}}\delta \ \mathbf {u} ^{T}\mathbf {f} ^{e}\,dV^{e}}$
Substitution of (6b) gives:
${\displaystyle \delta \ \mathbf {q} ^{T}\sum _{e}\int _{S^{e}}\mathbf {N} ^{T}\mathbf {T} ^{e}\,dS^{e}+\delta \ \mathbf {q} ^{T}\sum _{e}\int _{V^{e}}\mathbf {N} ^{T}\mathbf {f} ^{e}\,dV^{e}}$
or ${\displaystyle -\delta \ \mathbf {q} ^{T}\sum _{e}(\mathbf {Q} ^{te}+\mathbf {Q} ^{fe})\qquad \mathrm {(17a)} }$
where we have introduced additional element's matrices defined below:
${\displaystyle \mathbf {Q} ^{te}=-\int _{S^{e}}\mathbf {N} ^{T}\mathbf {T} ^{e}\,dS^{e}\qquad \mathrm {(18a)} }$
${\displaystyle \mathbf {Q} ^{fe}=-\int _{V^{e}}\mathbf {N} ^{T}\mathbf {f} ^{e}\,dV^{e}\qquad \mathrm {(18b)} }$
Again, numerical integration is convenient for their evaluation. A similar replacement of q in (17a) with r gives, after rearranging and expanding the vectors ${\displaystyle \mathbf {Q} ^{te},\mathbf {Q} ^{fe}}$:
${\displaystyle -\delta \ \mathbf {r} ^{T}\sum _{e}(\mathbf {Q} ^{te}+\mathbf {Q} ^{fe})\qquad \mathrm {(17b)} }$

## Assembly of system matrices

Adding (16), (17b) and equating the sum to (15) gives: ${\displaystyle \delta \ \mathbf {r} ^{T}\mathbf {R} -\delta \ \mathbf {r} ^{T}\sum _{e}(\mathbf {Q} ^{te}+\mathbf {Q} ^{fe})=\delta \ \mathbf {r} ^{T}{\big (}\sum _{e}\mathbf {k} ^{e}{\big )}\mathbf {r} +\delta \ \mathbf {r} ^{T}\sum _{e}\mathbf {Q} ^{oe}}$

Since the virtual displacements ${\displaystyle \delta \ \mathbf {r} }$ are arbitrary, the preceding equality reduces to:

${\displaystyle \mathbf {R} ={\big (}\sum _{e}\mathbf {k} ^{e}{\big )}\mathbf {r} +\sum _{e}{\big (}\mathbf {Q} ^{oe}+\mathbf {Q} ^{te}+\mathbf {Q} ^{fe}{\big )}}$

Comparison with (2) shows that:

• The system stiffness matrix is obtained by summing the elements' stiffness matrices:
${\displaystyle \mathbf {K} =\sum _{e}\mathbf {k} ^{e}}$
• The vector of equivalent nodal forces is obtained by summing the elements' load vectors:
${\displaystyle \mathbf {R} ^{o}=\sum _{e}{\big (}\mathbf {Q} ^{oe}+\mathbf {Q} ^{te}+\mathbf {Q} ^{fe}{\big )}}$

In practice, the element matrices are neither expanded nor rearranged. Instead, the system stiffness matrix ${\displaystyle \mathbf {K} }$ is assembled by adding individual coefficients ${\displaystyle {k}_{ij}^{e}}$ to ${\displaystyle {K}_{kl}}$ where the subscripts ij, kl mean that the element's nodal displacements ${\displaystyle {q}_{i}^{e},{q}_{j}^{e}}$ match respectively with the system's nodal displacements ${\displaystyle {r}_{k},{r}_{l}}$. Similarly, ${\displaystyle \mathbf {R} ^{o}}$ is assembled by adding individual coefficients ${\displaystyle {Q}_{i}^{e}}$ to ${\displaystyle {R}_{k}^{o}}$ where ${\displaystyle {q}_{i}^{e}}$ matches ${\displaystyle {r}_{k}}$. This direct addition of ${\displaystyle {k}_{ij}^{e}}$ into ${\displaystyle {K}_{kl}}$ gives the procedure the name Direct Stiffness Method.