Sylvester matrix

In mathematics, a Sylvester matrix is a matrix associated to two univariate polynomials with coefficients in a field or a commutative ring. The entries of the Sylvester matrix of two polynomials are coefficients of the polynomials. The determinant of the Sylvester matrix of two polynomials is their resultant, which is zero when the two polynomials have a common root (in case of coefficients in a field) or a non-constant common divisor (in case of coefficients in an integral domain).

Sylvester matrices are named after James Joseph Sylvester.

Definition[edit]

Formally, let p and q be two nonzero polynomials, respectively of degree m and n. Thus:

p(z)=p_{0}+p_{1}z+p_{2}z^{2}+\cdots +p_{m}z^{m},\;q(z)=q_{0}+q_{1}z+q_{2}z^{2}+\cdots +q_{n}z^{n}.

The Sylvester matrix associated to p and q is then the $(n+m)\times (n+m)$ matrix constructed as follows:

if n > 0, the first row is:

{\begin{pmatrix}p_{m}&p_{m-1}&\cdots &p_{1}&p_{0}&0&\cdots &0\end{pmatrix}}.

the second row is the first row, shifted one column to the right; the first element of the row is zero.
the following n − 2 rows are obtained the same way, shifting the coefficients one column to the right each time and setting the other entries in the row to be 0.
if m > 0 the (n + 1)th row is:

{\begin{pmatrix}q_{n}&q_{n-1}&\cdots &q_{1}&q_{0}&0&\cdots &0\end{pmatrix}}.

the following rows are obtained the same way as before.

Thus, if m = 4 and n = 3, the matrix is:

S_{p,q}={\begin{pmatrix}p_{4}&p_{3}&p_{2}&p_{1}&p_{0}&0&0\\0&p_{4}&p_{3}&p_{2}&p_{1}&p_{0}&0\\0&0&p_{4}&p_{3}&p_{2}&p_{1}&p_{0}\\q_{3}&q_{2}&q_{1}&q_{0}&0&0&0\\0&q_{3}&q_{2}&q_{1}&q_{0}&0&0\\0&0&q_{3}&q_{2}&q_{1}&q_{0}&0\\0&0&0&q_{3}&q_{2}&q_{1}&q_{0}\end{pmatrix}}.

If one of the degrees is zero (that is, the corresponding polynomial is a nonzero constant polynomial), then there are zero rows consisting of coefficients of the other polynomial, and the Sylvester matrix is a diagonal matrix of dimension the degree of the non-constant polynomial, with the all diagonal coefficients equal to the constant polynomial. If m = n = 0, then the Sylvester matrix is the empty matrix with zero rows and zero columns.

A variant[edit]

The above defined Sylvester matrix appears in a Sylvester paper of 1840. In a paper of 1853, Sylvester introduced the following matrix, which is, up to a permutation of the rows, the Sylvester matrix of p and q, which are both considered as having degree max(m, n).^[1] This is thus a $2\max(n,m)\times 2\max(n,m)$ -matrix containing $\max(n,m)$ pairs of rows. Assuming $m>n,$ it is obtained as follows:

the first pair is:

{\begin{pmatrix}p_{m}&p_{m-1}&\cdots &p_{n}&\cdots &p_{1}&p_{0}&0&\cdots &0\\0&\cdots &0&q_{n}&\cdots &q_{1}&q_{0}&0&\cdots &0\end{pmatrix}}.

the second pair is the first pair, shifted one column to the right; the first elements in the two rows are zero.
the remaining $max(n,m)-2$ pairs of rows are obtained the same way as above.

Thus, if m = 4 and n = 3, the matrix is:

{\begin{pmatrix}p_{4}&p_{3}&p_{2}&p_{1}&p_{0}&0&0&0\\0&q_{3}&q_{2}&q_{1}&q_{0}&0&0&0\\0&p_{4}&p_{3}&p_{2}&p_{1}&p_{0}&0&0\\0&0&q_{3}&q_{2}&q_{1}&q_{0}&0&0\\0&0&p_{4}&p_{3}&p_{2}&p_{1}&p_{0}&0\\0&0&0&q_{3}&q_{2}&q_{1}&q_{0}&0\\0&0&0&p_{4}&p_{3}&p_{2}&p_{1}&p_{0}\\0&0&0&0&q_{3}&q_{2}&q_{1}&q_{0}\\\end{pmatrix}}.

The determinant of the 1853 matrix is, up to sign, the product of the determinant of the Sylvester matrix (which is called the resultant of p and q) by $p_{m}^{m-n}$ (still supposing $m\geq n$ ).

Applications[edit]

These matrices are used in commutative algebra, e.g. to test if two polynomials have a (non-constant) common factor. In such a case, the determinant of the associated Sylvester matrix (which is called the resultant of the two polynomials) equals zero. The converse is also true.

The solutions of the simultaneous linear equations

{S_{p,q}}^{\mathrm {T} }\cdot {\begin{pmatrix}x\\y\end{pmatrix}}={\begin{pmatrix}0\\0\end{pmatrix}}

where $x$ is a vector of size $n$ and $y$ has size $m$ , comprise the coefficient vectors of those and only those pairs $x,y$ of polynomials (of degrees $n-1$ and $m-1$ , respectively) which fulfill

x(z)\cdot p(z)+y(z)\cdot q(z)=0,

where polynomial multiplication and addition is used. This means the kernel of the transposed Sylvester matrix gives all solutions of the Bézout equation where $\deg x<\deg q$ and $\deg y<\deg p$ .

Consequently the rank of the Sylvester matrix determines the degree of the greatest common divisor of p and q:

\deg(\gcd(p,q))=m+n-\operatorname {rank} S_{p,q}.

Moreover, the coefficients of this greatest common divisor may be expressed as determinants of submatrices of the Sylvester matrix (see Subresultant).

References[edit]

^ Akritas, A.G., Malaschonok, G.I., Vigklas, P.S.:Sturm Sequences and Modified Subresultant Polynomial Remainder Sequences. Serdica Journal of Computing, Vol. 8, No 1, 29--46, 2014

Weisstein, Eric W. "Sylvester Matrix". MathWorld.

[amv2014-1] Akritas, A.G., Malaschonok, G.I., Vigklas, P.S.:Sturm Sequences and Modified Subresultant Polynomial Remainder Sequences. Serdica Journal of Computing, Vol. 8, No 1, 29--46, 2014

[1]

v t e Matrix classes
Explicitly constrained entries	Alternant Anti-diagonal Anti-Hermitian Anti-symmetric Arrowhead Band Bidiagonal Bisymmetric Block-diagonal Block Block tridiagonal Boolean Cauchy Centrosymmetric Conference Complex Hadamard Copositive Diagonally dominant Diagonal Discrete Fourier Transform Elementary Equivalent Frobenius Generalized permutation Hadamard Hankel Hermitian Hessenberg Hollow Integer Logical Matrix unit Metzler Moore Nonnegative Pentadiagonal Permutation Persymmetric Polynomial Quaternionic Signature Skew-Hermitian Skew-symmetric Skyline Sparse Sylvester Symmetric Toeplitz Triangular Tridiagonal Vandermonde Walsh Z
Constant	Exchange Hilbert Identity Lehmer Of ones Pascal Pauli Redheffer Shift Zero
Conditions on eigenvalues or eigenvectors	Companion Convergent Defective Definite Diagonalizable Hurwitz Positive-definite Stieltjes
Satisfying conditions on products or inverses	Congruent Idempotent or Projection Invertible Involutory Nilpotent Normal Orthogonal Unimodular Unipotent Unitary Totally unimodular Weighing
With specific applications	Adjugate Alternating sign Augmented Bézout Carleman Cartan Circulant Cofactor Commutation Confusion Coxeter Distance Duplication and elimination Euclidean distance Fundamental (linear differential equation) Generator Gram Hessian Householder Jacobian Moment Payoff Pick Random Rotation Seifert Shear Similarity Symplectic Totally positive Transformation
Used in statistics	Centering Correlation Covariance Design Doubly stochastic Fisher information Hat Precision Stochastic Transition
Used in graph theory	Adjacency Biadjacency Degree Edmonds Incidence Laplacian Seidel adjacency Tutte
Used in science and engineering	Cabibbo–Kobayashi–Maskawa Density Fundamental (computer vision) Fuzzy associative Gamma Gell-Mann Hamiltonian Irregular Overlap S State transition Substitution Z (chemistry)
Related terms	Jordan normal form Linear independence Matrix exponential Matrix representation of conic sections Perfect matrix Pseudoinverse Row echelon form Wronskian
Mathematics portal List of matrices Category:Matrices

Definition[edit]

A variant[edit]

Applications[edit]

See also[edit]

References[edit]