Halstead complexity measures

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Halstead complexity measures are software metrics introduced by Maurice Howard Halstead in 1977[1] as part of his treatise on establishing an empirical science of software development. Halstead made the observation that metrics of the software should reflect the implementation or expression of algorithms in different languages, but be independent of their execution on a specific platform. These metrics are therefore computed statically from the code.

Halstead's goal was to identify measurable properties of software, and the relations between them. This is similar to the identification of measurable properties of matter (like the volume, mass, and pressure of a gas) and the relationships between them (analogous to the gas equation). Thus his metrics are actually not just complexity metrics.

Calculation[edit]

For a given problem, Let:

  • \,\eta_1 = the number of distinct operators
  • \,\eta_2 = the number of distinct operands
  • \,N_1 = the total number of operators
  • \,N_2 = the total number of operands

From these numbers, several measures can be calculated:

  • Program vocabulary: \eta = \eta_1 + \eta_2 \,
  • Program length: N = N_1 + N_2 \,
  • Calculated program length: \hat{N} = \eta_1 \log_2 \eta_1 + \eta_2 \log_2 \eta_2
  • Volume: V = N \times \log_2 \eta
  • Difficulty : D = { \eta_1 \over 2  } \times { N_2 \over \eta_2 }
  • Effort: E =  D \times V

The difficulty measure is related to the difficulty of the program to write or understand, e.g. when doing code review.

The effort measure translates into actual coding time using the following relation,

  • Time required to program: T = {E \over 18} seconds

Halstead's delivered bugs (B) is an estimate for the number of errors in the implementation.

  • Number of delivered bugs : B = {E^{2 \over 3} \over 3000} or, more recently, B = {V \over 3000} is accepted[citation needed].

Example[edit]

Let us consider the following C program:

main()
{
    int a, b, c, avg;
    scanf("%d %d %d", &a, &b, &c);
    avg = (a + b + c) / 3;
    printf("avg = %d", avg);
} 

The unique operators are: main, (), {}, int, scanf, &, =, +, /, printf

The unique operands are: a, b, c, avg, "%d %d %d", 3, "avg = %d"

  • \eta_1 = 10, \eta_2 = 7, \eta = 17
  • N_1 = 16, N_2 = 15, N = 31
  • Calculated Program Length: \hat{N} = 10 \times log_2 10 + 7 \times log_2 7 = 52.9
  • Volume: V = 31 \times log_2 17 = 126.7
  • Difficulty: D = { 10 \over 2 } \times { 15 \over 7 } = 10.7
  • Effort: E = 10.7 \times 126.7 = 1,355.7
  • Time required to program: T = { 1,355.7 \over 18 } = 75.3 seconds
  • Number of delivered bugs: B = { 1,355.7 ^ { 2 \over 3 } \over 3000 } = 0.04

References[edit]

  1. ^ Halstead, Maurice H. (1977). Elements of Software Science. Amsterdam: Elsevier North-Holland, Inc. ISBN 0-444-00205-7. 

See also[edit]

External links[edit]