# Quantum algorithm for linear systems of equations

The quantum algorithm for linear systems of equations, designed by Aram Harrow, Avinatan Hassidim, and Seth Lloyd, is a quantum algorithm formulated in 2009 for solving linear systems. The algorithm estimates the result of a scalar measurement on the solution vector to a given linear system of equations.[1]

The algorithm is one of the main fundamental algorithms expected to provide a speedup over their classical counterparts, along with Shor's factoring algorithm, Grover's search algorithm and quantum simulation. Provided the linear system is a sparse and has a low condition number ${\displaystyle \kappa }$, and that the user is interested in the result of a scalar measurement on the solution vector, instead of the values of the solution vector itself, then the algorithm has a runtime of ${\displaystyle O(\log(N)\kappa ^{2})}$, where ${\displaystyle N}$ is the number of variables in the linear system. This offers an exponential speedup over the fastest classical algorithm, which runs in ${\displaystyle O(N\kappa )}$ (or ${\displaystyle O(N{\sqrt {\kappa }})}$ for positive semidefinite matrices).

An implementation of the quantum algorithm for linear systems of equations was first demonstrated in 2013 by Cai et al., Barz et al.and Pan et al. in parallel. The demonstrations consisted of simple linear equations on specially designed quantum devices.[2][3][4]

Due to the prevalence of linear systems in virtually all areas of science and engineering, the quantum algorithm for linear systems of equations has the potential for widespread applicability.[5]

## Procedure

The problem we are trying to solve is: given a ${\displaystyle N\times N}$ Hermitian matrix ${\displaystyle A}$ and a unit vector ${\displaystyle {\overrightarrow {b}}}$, find the solution vector ${\displaystyle {\overrightarrow {x}}}$ satisfying ${\displaystyle A{\overrightarrow {x}}={\overrightarrow {b}}}$. This algorithm assumes that the user is not interested in the values of ${\displaystyle {\overrightarrow {x}}}$ itself, but rather the result of applying some operator ${\displaystyle M}$ onto x, ${\displaystyle \langle x|M|x\rangle }$.

First, the algorithm represents the vector ${\displaystyle {\overrightarrow {b}}}$ as a quantum state of the form:

${\displaystyle |b\rangle =\sum _{i\mathop {=} 1}^{N}b_{i}|i\rangle .}$

Next, Hamiltonian simulation techniques are used to apply the unitary operator ${\displaystyle e^{iAt}}$ to ${\displaystyle |b\rangle }$ for a superposition of different times ${\displaystyle t}$. The ability to decompose ${\displaystyle |b\rangle }$ into the eigenbasis of ${\displaystyle A}$ and to find the corresponding eigenvalues ${\displaystyle \lambda _{j}}$ is facilitated by the use of quantum phase estimation.

The state of the system after this decomposition is approximately:

${\displaystyle \sum _{j\mathop {=} 1}^{N}\beta _{j}|u_{j}\rangle |\lambda _{j}\rangle ,}$

where ${\displaystyle u_{j}}$ is the eigenvector basis of ${\displaystyle A}$, and ${\displaystyle |b\rangle =\sum _{j\mathop {=} 1}^{N}\beta _{j}|u_{j}\rangle }$.

We would then like to perform the linear map taking ${\displaystyle |\lambda _{j}\rangle }$ to ${\displaystyle C\lambda _{j}^{-1}|\lambda _{j}\rangle }$, where ${\displaystyle C}$ is a normalizing constant. The linear mapping operation is not unitary and thus will require a number of repetitions as it has some probability of failing. After it succeeds, we uncompute the ${\displaystyle |\lambda _{j}\rangle }$ register and are left with a state proportional to:

${\displaystyle \sum _{i\mathop {=} 1}^{N}\beta _{i}\lambda _{j}^{-1}|u_{j}\rangle =A^{-1}|b\rangle =|x\rangle ,}$

Where ${\displaystyle |x\rangle }$ is a quantum-mechanical representation of the desired solution vector x. To read out all components of x would require the procedure be repeated at least N times. However, it is often the case that one is not interested in ${\displaystyle x}$ itself, but rather some expectation value of a linear operator M acting on x. By mapping M to a quantum-mechanical operator and performing the quantum measurement corresponding to M, we obtain an estimate of the expectation value ${\displaystyle \langle x|M|x\rangle }$. This allows for a wide variety of features of the vector x to be extracted including normalization, weights in different parts of the state space, and moments without actually computing all the values of the solution vector x.

## Explanation of the algorithm

### Initialization

Firstly, the algorithm requires that the matrix ${\displaystyle A}$ be Hermitian so that it can be converted into a unitary operator. In the case where ${\displaystyle A}$ is not Hermitian, define

${\displaystyle \mathbf {C} ={\begin{bmatrix}0&A\\A^{t}&0\end{bmatrix}}.}$

As ${\displaystyle C}$ is Hermitian, the algorithm can now be used to solve ${\displaystyle Cy={\begin{bmatrix}b\\0\end{bmatrix}}.}$ to obtain ${\displaystyle y={\begin{bmatrix}0\\x\end{bmatrix}}}$.

Secondly, The algorithm requires an efficient procedure to prepare ${\displaystyle |b\rangle }$, the quantum representation of b. It is assumed that there exists some linear operator ${\displaystyle B}$ that can take some arbitrary quantum state ${\displaystyle |\mathrm {initial} \rangle }$ to ${\displaystyle |b\rangle }$ efficiently or that this algorithm is a subroutine in a larger algorithm and is given ${\displaystyle |b\rangle }$ as input. Any error in the preparation of state ${\displaystyle |b\rangle }$ is ignored.

Finally, the algorithm assumes that the state ${\displaystyle |\psi _{0}\rangle }$ can be prepared efficiently. Where

${\displaystyle |\psi _{0}\rangle :={\sqrt {2/T}}\sum _{\tau \mathop {=} 0}^{T-1}\sin \pi \left({\tfrac {\tau +{\tfrac {1}{2}}}{T}}\right)|\tau \rangle }$

for some large ${\displaystyle T}$. The coefficients of ${\displaystyle |\psi _{0}\rangle }$ are chosen to minimize a certain quadratic loss function which induces error in the ${\displaystyle U_{\mathrm {invert} }}$ subroutine described below.

### Hamiltonian simulation

Hamiltonian simulation is used to transform the Hermitian matrix ${\displaystyle A}$ into a unitary operator, which can then be applied at will. This is possible if A is s-sparse and efficiently row computable, meaning it has at most s nonzero entries per row and given a row index these entries can be computed in time O(s). Under these assumptions, quantum Hamiltonian simulation allows ${\displaystyle e^{iAt}}$ to be simulated in time ${\displaystyle O(\log(N)s^{2}t)}$.

### ${\displaystyle U_{\mathrm {invert} }}$ subroutine

The key subroutine to the algorithm, denoted ${\displaystyle U_{\mathrm {invert} }}$, is defined as follows and incorporates a phase estimation subroutine:

1. Prepare ${\displaystyle |\psi _{0}\rangle ^{C}}$ on register C

2. Apply the conditional Hamiltonian evolution (sum)

3. Apply the Fourier transform to the register C. Denote the resulting basis states with ${\displaystyle |k\rangle }$ for k = 0, ..., T − 1. Define ${\displaystyle \lambda _{k}:=2\pi k/t_{0}}$.

4. Adjoin a three-dimensional register S in the state

${\displaystyle |h(\lambda _{k})\rangle ^{S}:={\sqrt {1-f(\lambda _{k})^{2}-g(\lambda _{k})^{2}}}|\mathrm {nothing} \rangle ^{S}+f(\lambda _{k})|\mathrm {well} \rangle ^{S}+g(\lambda _{k})|\mathrm {ill} \rangle ^{S},}$

5. Reverse steps 1–3, uncomputing any garbage produced along the way.

The phase estimation procedure in steps 1-3 allows for the estimation of eigenvalues of A up to error ${\displaystyle \epsilon }$.

The ancilla register in step 4 is necessary to construct a final state with inverted eigenvalues corresponding to the diagonalized inverse of A. In this register, the functions f, g, are called filter functions. The states 'nothing', 'well' and 'ill' are used to instruct the loop body on how to proceed; 'nothing' indicates that the desired matrix inversion has not yet taken place, 'well' indicates that the inversion has taken place and the loop should halt, and 'ill' indicates that part of ${\displaystyle |b\rangle }$ is in the ill-conditioned subspace of A and the algorithm will not be able to produce the desired inversion. Producing a state proportional to the inverse of A requires 'well' to be measured, after which the overall state of the system collapses to the desired state by the extended Born rule.

### Main loop

The body of the algorithm follows the amplitude amplification procedure: starting with ${\displaystyle U_{\mathrm {invert} }B|\mathrm {initial} \rangle }$, the following operation is repeatedly applied:

${\displaystyle U_{\mathrm {invert} }BR_{\mathrm {init} }B^{\dagger }U_{\mathrm {invert} }^{\dagger }R_{\mathrm {succ} }}$

where

${\displaystyle R_{\mathrm {succ} }=I-2|\mathrm {well} \rangle \langle \mathrm {well} |,}$

and

${\displaystyle R_{\mathrm {init} }=I-2|\mathrm {initial} \rangle \langle \mathrm {initial} |.}$

After each repetition, ${\displaystyle S}$ is measured and will produce a value of 'nothing', 'well', or 'ill' as described above. This loop is repeated until ${\displaystyle S}$ is measured, which occurs with a probability ${\displaystyle p}$. Rather than repeating ${\displaystyle {\frac {1}{p}}}$ times to minimize error, amplitude amplification is used to achieve the same error resilience using only ${\displaystyle O\left({\frac {1}{\sqrt {p}}}\right)}$ repetitions.

### Scalar measurement

After successfully measuring 'well' on ${\displaystyle S}$ the system will be in a state proportional to:

${\displaystyle \sum _{i\mathop {=} 1}^{N}\beta _{i}\lambda _{j}^{-1}|u_{j}\rangle =A^{-1}|b\rangle =|x\rangle .}$

Finally, we perform the quantum-mechanical operator corresponding to M and obtain an estimate of the value of ${\displaystyle \langle x|M|x\rangle }$.

## Run time analysis

### Classical efficiency

The best classical algorithm which produces the actual solution vector ${\displaystyle {\overrightarrow {x}}}$ is Gaussian elimination, which runs in ${\displaystyle O(N^{3})}$ time.

If A is s-sparse and positive semi-definite, then the Conjugate Gradient method can be used to find the solution vector ${\displaystyle {\overrightarrow {x}}}$ can be found in ${\displaystyle O(Ns\kappa )}$ time by minimizing the quadratic function ${\displaystyle |A{\overrightarrow {x}}-{\overrightarrow {b}}|^{2}}$.

When only a summary statistic of the solution vector ${\displaystyle {\overrightarrow {x}}}$ is needed, as is the case for the quantum algorithm for linear systems of equations, a classical computer can find an estimate of ${\displaystyle {\overrightarrow {x}}^{\dagger }M{\overrightarrow {x}}}$ in ${\displaystyle O(N{\sqrt {\kappa }})}$.

### Quantum efficiency

The quantum algorithm for solving linear systems of equations originally proposed by Harrow et al. was shown to be ${\displaystyle O(\kappa ^{2}\log N)}$. The runtime of this algorithm was subsequently improved to ${\displaystyle O(\kappa \log ^{3}\kappa \log N)}$ by Andris Ambainis.[6] Since the HHL algorithm maintains its logarithmic scaling only for sparse or low rank matrices, Wossnig et al.[7] extended the HHL algorithm based on a quantum singular value estimation technique and provide a linear system algorithm for dense matrices which runs in ${\displaystyle O({\sqrt {N}}\log N\kappa ^{2})}$ time compared to the ${\displaystyle O(N\log N\kappa ^{2})}$ of the standard HHL algorithm.

### Optimality

An important factor in the performance of the matrix inversion algorithm is the condition number of ${\displaystyle A}$ ${\displaystyle \kappa }$, which represents the ratio of ${\displaystyle A}$'s largest and smallest eigenvalues. As the condition number increases, the ease with which the solution vector can be found using gradient descent methods such as the conjugate gradient method decreases, as ${\displaystyle A}$ becomes closer to a matrix which cannot be inverted and the solution vector becomes less stable. This algorithm assumes that all elements of the matrix ${\displaystyle A}$ lie between ${\displaystyle {\frac {1}{\kappa }}}$ and 1, in which case the claimed run-time proportional to ${\displaystyle \kappa ^{2}}$ will be achieved. Therefore, the speedup over classical algorithms is increased further when ${\displaystyle \kappa }$ is a ${\displaystyle \mathrm {poly} (\log(N))}$.[1]

If the run-time of the algorithm were made poly-logarithmic in ${\displaystyle \kappa }$ then problems solvable on n qubits could be solved in poly(n) time, causing the complexity class BQP to be equal to PSPACE.[1]

## Error analysis

Performing the Hamiltonian simulation, which is the dominant source of error, is done by simulating ${\displaystyle e^{iAt}}$. Assuming that ${\displaystyle A}$ is s-sparse, this can be done with an error bounded by a constant ${\displaystyle \varepsilon }$, which will translate to the additive error achieved in the output state ${\displaystyle |x\rangle }$.

The phase estimation step errs by ${\displaystyle O\left({\frac {1}{t_{0}}}\right)}$ in estimating ${\displaystyle \lambda }$, which translates into a relative error of ${\displaystyle O\left({\frac {1}{\lambda t_{0}}}\right)}$ in ${\displaystyle \lambda ^{-1}}$. If ${\displaystyle \lambda \geq 1/\kappa }$, taking ${\displaystyle t_{0}=O(\kappa \varepsilon )}$ induces a final error of ${\displaystyle \varepsilon }$. This requires that the overall run-time efficiency be increased proportional to ${\displaystyle O\left({\frac {1}{\varepsilon }}\right)}$ to minimize error.

## Experimental realization

While there does not yet exist a quantum computer that can truly offer a speedup over a classical computer, implementation of a "proof of concept" remains an important milestone in the development of a new quantum algorithm. Demonstrating the quantum algorithm for linear systems of equations remained a challenge for years after its proposal until 2013 when it was demonstrated by Cai et al., Barz et al. and Pan et al. in parallel.

### Cai et al.

Published in Physical Review Letters 110, 230501 (2013), Cai et al. reported an experimental demonstration of the simplest meaningful instance of this algorithm, that is, solving 2*2 linear equations for various input vectors. The quantum circuit is optimized and compiled into a linear optical network with four photonic quantum bits (qubits) and four controlled logic gates, which is used to coherently implement every subroutine for this algorithm. For various input vectors, the quantum computer gives solutions for the linear equations with reasonably high precision, ranging from fidelities of 0.825 to 0.993.[8]

### Barz et al.

On February 5, 2013, Barz et al. demonstrated the quantum algorithm for linear systems of equations on a photonic quantum computing architecture. This implementation used two consecutive entangling gates on the same pair of polarization-encoded qubits. Two separately controlled NOT gates were realized where the successful operation of the first was heralded by a measurement of two ancillary photons. Barz et al. found that the fidelity in the obtained output state ranged from 64.7% to 98.1% due to the influence of higher-order emissions from spontaneous parametric down-conversion.[3]

### Pan et al.

On February 8, 2013 Pan et al. reported a proof-of-concept experimental demonstration of the quantum algorithm using a 4-qubit nuclear magnetic resonance quantum information processor. The implementation was tested using simple linear systems of only 2 variables. Across three experiments they obtain the solution vector with over 96% fidelity.[4]

## Applications

Quantum computers are devices that harness quantum mechanics to perform computations in ways that classical computers cannot. For certain problems, quantum algorithms supply exponential speedups over their classical counterparts, the most famous example being Shor's factoring algorithm. Few such exponential speedups are known, and those that are (such as the use of quantum computers to simulate other quantum systems) have so far found limited use outside the domain of quantum mechanics. This algorithm provides an exponentially faster method of estimating features of the solution of a set of linear equations, which is a problem ubiquitous in science and engineering, both on its own and as a subroutine in more complex problems.

### Electromagnetic scattering

Clader et al. provided an preconditioned version of the linear systems algorithm that provided two advances. First, they demonstrated how a preconditioner could be included within the quantum algorithm. This expands the class of problems that can achieve the promised exponential speedup, since the scaling of HHL and the best classical algorithms are both polynomial in the condition number. The second advance was the demonstration of how to use HHL to solve for the radar cross-section of a complex shape. This was one of the first end to end examples of how to use HHL to solve a concrete problem exponentially faster than the best known classical algorithm. [9]

### Linear differential equation solving

Dominic Berry proposed a new algorithm for solving linear time dependent differential equations as an extension of the quantum algorithm for solving linear systems of equations. Berry provides an efficient algorithm for solving the full-time evolution under sparse linear differential equations on a quantum computer.[10]

### Least-squares fitting

Wiebe et al. provide a new quantum algorithm to determine the quality of a least-squares fit in which a continuous function is used to approximate a set of discrete points by extending the quantum algorithm for linear systems of equations. As the amount of discrete points increases, the time required to produce a least-squares fit using even a quantum computer running a quantum state tomography algorithm becomes very large. Wiebe et al. find that in many cases, their algorithm can efficiently find a concise approximation of the data points, eliminating the need for the higher-complexity tomography algorithm.[11]

### Machine learning and big data analysis

Machine learning is the study of systems that can identify trends in data. Tasks in machine learning frequently involve manipulating and classifying a large volume of data in high-dimensional vector spaces. The runtime of classical machine learning algorithms is limited by a polynomial dependence on both the volume of data and the dimensions of the space. Quantum computers are capable of manipulating high-dimensional vectors using tensor product spaces are thus the perfect platform for machine learning algorithms.[12]

The quantum algorithm for linear systems of equations has been applied to a support vector machine, which is an optimized linear or non-linear binary classifier. A support vector machine can be used for supervised machine learning, in which training set of already classified data is available, or unsupervised machine learning, in which all data given to the system is unclassified. Rebentrost et al. show that a quantum support vector machine can be used for big data classification and achieve an exponential speedup over classical computers.[13]

## References

1. ^ a b c Harrow, Aram W; Hassidim, Avinatan; Lloyd, Seth (2008). "Quantum algorithm for solving linear systems of equations". Physical Review Letters. 103 (15): 150502. arXiv:. Bibcode:2009PhRvL.103o0502H. doi:10.1103/PhysRevLett.103.150502. PMID 19905613.
2. ^ Cai, X.-D; Weedbrook, C; Su, Z.-E; Chen, M.-C; Gu, Mile; Zhu, M.-J; Li, Li; Liu, Nai-Le; Lu, Chao-Yang; Pan, Jian-Wei (2013). "Experimental Quantum Computing to Solve Systems of Linear Equations". Physical Review Letters. 110 (23): 230501. arXiv:. Bibcode:2013PhRvL.110w0501C. doi:10.1103/PhysRevLett.110.230501. PMID 25167475.
3. ^ a b Barz, Stefanie; Kassal, Ivan; Ringbauer, Martin; Lipp, Yannick Ole; Dakić, Borivoje; Aspuru-Guzik, Alán; Walther, Philip (2014). "A two-qubit photonic quantum processor and its application to solving systems of linear equations". Scientific Reports. 4: 6115. arXiv:. Bibcode:2014NatSR...4E6115B. doi:10.1038/srep06115. ISSN 2045-2322. PMC . PMID 25135432.
4. ^ a b Pan, Jian; Cao, Yudong; Yao, Xiwei; Li, Zhaokai; Ju, Chenyong; Peng, Xinhua; Kais, Sabre; Du, Jiangfeng; Du, Jiangfeng (2013). "Experimental realization of quantum algorithm for solving linear systems of equations". Physical Review A. 89 (2): 022313. arXiv:. Bibcode:2014PhRvA..89b2313P. doi:10.1103/PhysRevA.89.022313.
5. ^
6. ^ Ambainis, Andris (2010). "Variable time amplitude amplification and a faster quantum algorithm for solving systems of linear equations". arXiv: [quant-ph].
7. ^ Wossnig, Leonard; Zhao, Zhikuan; Prakash, Anupam (2017). "A quantum linear system algorithm for dense matrices". Physical Review Letters. 120 (5): 050502. arXiv:. Bibcode:2018PhRvL.120e0502W. doi:10.1103/PhysRevLett.120.050502. PMID 29481180.
8. ^ Cai, X. -D; Weedbrook, Christian; Su, Z. -E; Chen, M. -C; Gu, Mile; Zhu, M. -J; Li, L; Liu, N. -L; Lu, Chao-Yang; Pan, Jian-Wei (2013). "Experimental Quantum Computing to Solve Systems of Linear Equations". Physical Review Letters. 110 (23): 230501. arXiv:. Bibcode:2013PhRvL.110w0501C. doi:10.1103/PhysRevLett.110.230501. PMID 25167475.
9. ^ Clader, B. D; Jacobs, B. C; Sprouse, C. R (2013). "Preconditioned Quantum Linear System Algorithm". Physical Review Letters. 110 (25): 250504. arXiv:. Bibcode:2013PhRvL.110y0504C. doi:10.1103/PhysRevLett.110.250504. PMID 23829722.
10. ^ Berry, Dominic W (2010). "High-order quantum algorithm for solving linear differential equations". Journal of Physics A: Mathematical and Theoretical. 47 (10): 105301. arXiv:. Bibcode:2014JPhA...47j5301B. doi:10.1088/1751-8113/47/10/105301.
11. ^ Wiebe, Nathan; Braun, Daniel; Lloyd, Seth (2012). "Quantum Data Fitting". Physical Review Letters. 109 (5): 050505. arXiv:. Bibcode:2012PhRvL.109e0505W. doi:10.1103/PhysRevLett.109.050505. PMID 23006156.
12. ^ Lloyd, Seth; Mohseni, Masoud; Rebentrost, Patrick (2013). "Quantum algorithms for supervised and unsupervised machine learning". arXiv: [quant-ph].
13. ^ Rebentrost, Patrick; Mohseni, Masoud; Lloyd, Seth (2013). "Quantum support vector machine for big feature and big data classification". arXiv: [quant-ph].