Perspectives of quantum annealing: methods and implementations

Quantum annealing is a computing paradigm that has the ambitious goal of efficiently solving large-scale combinatorial optimization problems of practical importance. However, many challenges have yet to be overcome before this goal can be reached. This perspectives article first gives a brief introduction to the concept of quantum annealing, and then highlights new pathways that may clear the way towards feasible and large scale quantum annealing. Moreover, since this field of research is to a strong degree driven by a synergy between experiment and theory, we discuss both in this work. An important focus in this article is on future perspectives, which complements other review articles, and which we hope will motivate further research.


Introduction
The last two decades have seen tantalizing progress in the engineering of quantum devices, raising the hope to realize quantum technologies based on precise control Thanks to the simplicity and elegance of this approach as well as its potential impact, several organizations including private companies are heavily investing efforts and resources toward its realization as large-scale quantum devices, mainly based on superconducting qubits [37,38,39,40,41]. Since the first quantum annealing device, the D-Wave device, has become commercially available in 2011 [38], a large number of proof-of-principle demonstrations have appeared, see, for example, Refs. [42,43,44,45,46,47,48,49,50] and references therein.
Nevertheless, there is an ongoing scientific debate whether such quantum annealing devices can offer an actual speedup over classical computers [51,52,53,54]. For the case of the simple transverse-field Ising model, where we choose H 1 as the transverse field Hamiltonian − N i=1 σ x i , with σ x i being the Pauli matrix at qubit i, and where H 0 is written only in terms of {σ z i }, an essential speedup over classical algorithms may not be easy to achieve because equilibrium properties of the model, as well as certain aspects of dynamics, can usually be simulated efficiently by quantum Monte Carlo methods [55], although with some exceptions, e.g., Ref. [56]. This property is shared by a class of models called 'stoquastic Hamiltonians' [57], which are defined as those having a matrix representation with all off-diagonal elements being non-positive in a product basis, usually the computational basis to diagonalize {σ z i }, leading to a classical representation without the sign problem for simulations. The situation is further complicated by the presence of noise and a variety of imperfections in real devices, all of which degrade the performance, sometimes significantly.
In the present article, we aim to highlight new pathways and perspectives for the development of quantum annealing in the hope to clear the way towards establishment of feasible approaches for efficiently solving large-scale problems of practical importance. Therefore, an important focus of this article is on future perspectives, making its characteristics rather complementary to typical review articles [58,59,60]. Correspondingly, the list of references may be far from complete, for which we apologize to many authors.
This article is organized as follows. In Sec. 2, we lay some theoretical framework of quantum annealing, starting from the representation of combinatorial optimization problems as classical Ising spin glasses, followed by an explanation of the essence of the adiabatic condition and the connection of the minimum energy gap to computational complexity. We also discuss how quantum annealing performs as compared to classical algorithms, and we point out the open question regarding the role of quantum entanglement in any possible quantum speedup. The core of the article consists of Secs. 3 and 4. In Sec. 3, we highlight several promising routes towards enhancing the performance of quantum annealing based on improvements in algorithms. These include judicious choices of the quantum driving term H 1 and non-adiabatic schemes. In Sec. 4, we discuss perspectives on the superconducting qubit platform and illustrate potential alternatives based on ultracold Rydberg atoms and trapped ions. Finally, we briefly summarize our discussions in Sec. 5.

Theoretical framework
In this section, we recapitulate some of the theoretical framework that underlies the concept of quantum annealing, which will be useful for subsequent discussions.

From optimization problems to spin glasses
As stated in the Introduction, the main motivation for building quantum annealing machines such as the D-Wave device is to solve combinatorial optimization problems, in particular in the form of binary optimization problems. While the restriction to binary variables might seem narrow at first, a wide variety of NP-hard optimization problems of interest in industry fall under this category. Because in most cases only heuristic approaches to tackle these hard problems are known, there has been much interest in harnessing quantum effects in an attempt to find better solutions. Here, a 'better solution' can signify a cost closer to optimality, faster reaching of optimality at fixed cost, or a more diverse solution pool (if the problem has more than one minimizing configuration). Prominent potential applications range from spin glasses [61], to lattice protein models [62], circuit fault diagnosis [63,64], planning [65], jobshop scheduling [66], machine learning [67], molecular similarity in chemistry [68], or the optimal trading trajectory problem [69], to name a few in addition to those listed in the Introduction. These problems can be cast in a form expressed in terms of binary (Boolean) variables x i ∈ {0, 1}, generally as high-order polynomial unconstrained binary optimization problems (HOBO) written as k-local interactions with k ≥ 3 . However, due to manufacturing constraints, current experimental devices can only handle 2-local interactions, i.e., quadratic unconstrained binary optimization problems (QUBOs) with a cost function of the form The problem to be optimized is then fully specified by Q ij and c i . A broad class of paradigmatic optimization problems from vertex covers to the traveling salesperson problem have been mapped to QUBO form. For a comprehensive study, see Ref. [27]. If the problem of interest has a cost function of high-order interactions (k-local with k ≥ 3), one should reduce it to QUBO format by using ancillary variables to implement it on a real device. For example, a 3-local expression x 1 x 2 x 3 is reduced to x 1 x 4 if we define x 4 = x 2 x 3 . The latter condition can be imposed by an additional term in the cost function This expression is 0 only when x 4 = x 2 x 3 and is positive (higher cost) otherwise. Other expressions than Eq. (3) are possible for the same purpose with an appropriate coefficient for each term. While this technique has the advantage that any problem can in principle be studied on a currently-available device, it usually leads to a large overhead in the number of variables. Therefore, effort should further be made in studying combinatorial problems in their native HOBO form. We concentrate on QUBO in this article. Equation (2) can be conveniently mapped to an Ising-like expression with the transformation σ z i = 1 − 2x i : where A k-local interaction is a term in the cost function, the function to be minimized, proportional to a product of k binary variables, and σ z i is regarded at this stage as a classical variable taking the values ±1. The Hamiltonian H SG is a typical example of the classical Ising Hamiltonian H 0 introduced in the Introduction. In Eq. (4) the constant (physically irrelevant) energy shift has been neglected. If the couplers J ij are chosen from a random distribution, the Ising model given in Eq. (4) is also known as a spin glass. In what follows and without loss of generality, quantum annealing is discussed from the point of view of spin-glass Hamiltonians. We note, however, that the approach can be extended to other physical systems. There is a solid body of work on the disordered magnetic systems described as spin glasses, see, e.g., Refs. [28,29,30,70] and references therein. Yet, despite decades of study, our theoretical understanding is still insufficient, and simulations are restricted to small system sizes because the inherent frustration and disorder result in optimization problems of superb difficulty. This challenge has led to the development of sophisticated algorithms to study the thermodynamic behavior of classical spin glasses and, more recently, the adaptation of these methods as heuristics to minimize the energy of these systems [71,72,73]. We later discuss the performance of quantum annealing compared to these new, 'quantum inspired' optimization techniques. In quantum annealing, the classical Ising variables σ z i are promoted to quantum spin operators, i.e., Pauli matrices. The solution to the optimization problem encoded in the ground state of Eq. (4) is then sought by slowly sweeping the system from a simple Hamiltonian H 1 , whose ground state can be easily prepared and which does not commute with H SG , to H 0 = H SG , see Eq. (1). During this procedure, the probability to find a given classical configuration converges from a uniform distribution to a distribution that is ideally strongly peaked at the ground state of H SG , see Fig. 2. In this context, the difficulty of solving the spin-glass problem translates into the presence of exponentially many energy gaps that are exponentially small once the annealing procedure enters the spin-glass phase [51,74]. This makes quantum annealing a formidable task, for theoretical simulation and experimental realization alike.
Moreover, because the connectivity graph of the currently-available quantum annealers is limited, the study of any problem that does not happen to match, e.g., the underlying Chimera graph structure of the D-Wave device (see Fig. 3) requires 'embedding' [75], i.e., a representation of the logical optimization problem in the physically available graph structure. This embedding typically consumes a considerable overhead in terms of spin variables [76]. For example, in the extreme case of a fullyconnected graph, approximately 30 logical spin variables can be embedded in the 2000 variables of the D-Wave 2000Q quantum annealer. This overhead severely limits any asymptotic scaling analysis. As such, random spin-glass problems defined on the native  Chimera lattice of the D-Wave device [37] have been extensively used to benchmark these machines, as well as quantum annealing in general [43,44,45,46,47,48,49,50,42]. An alternative encoding of optimization problems for quantum annealing is the LHZ (Lechner-Hauke-Zoller) scheme [77,78]. In this architecture, the optimization problem is encoded in the local fields acting on the individual qubits, and all interactions are problem independent 4-body terms among nearest neighbors on a two dimensional grid. See Sec. 3.2 for more details. Using LHZ, all-to-all models can be implemented in various platforms including transmon qubits [79], Kerr non-linear oscillators [80,81], flux qubits [82] and Rydberg atoms [83] which will be discussed in Section 4.3 in detail.

Adiabatic approximation and adiabatic theorem
As already suggested, the existence of quantum phase transitions and small energy gaps is one of the main sources of difficulties in quantum annealing. We describe this point in this and the following part of this section.
One of the important theoretical bases of quantum annealing is the adiabatic approximation [32,33]. The statement is that a quantum system with slowly timedependent parameters in the Hamiltonian closely follows the instantaneous eigenstate in its time evolution if we start the dynamics in one of the eigenstates of the initial Hamiltonian. ¶ Let us write the instantaneous eigenvalue equation of a Hamiltonian as where j = 0 denotes the instantaneous ground state and j ≥ 1 represents instantaneous excited states. We assume for simplicity that states are discrete and non-degenerate. In Eq. (8), the time variable t is fixed and is regarded as a parameter. The adiabatic approximation is concerned with the solution |ψ(t) to the timedependent Schrödinger equation generated by H(t) with t now a dynamical variable. Our interest focuses mostly on the ground state, and thus we assume that the time evolution of the system starts from the ground state of the initial Hamiltonian H(0) = H 1 , |ψ(0) = |0(0) . Then, the adiabatic approximation is that the wave function is considered to stay close to the instantaneous ground state after time evolution to t = τ , | 0(τ )|ψ(τ ) | 2 ≈ 1 , as long as the adiabatic condition holds max 0≤t≤τ j(t)| dH(t) dt |0(t) ∆ j,0 (t) 2 1 (j ≥ 1) .
Here, ∆ j,0 (t) = j (t) − 0 (t) is the energy gap between the ground state and the jth excited state. The adiabatic condition, Eq. (9), is derived from the evaluation of excitation probabilities and is not necessarily mathematically rigorous. Indeed, counterexamples are known in which the state of the system deviates significantly from the instantaneous ground state even when the adiabatic condition is satisfied. In particular, when the Hamiltonian has a time-dependent parameter with a time scale different from the intrinsic time scale related to the energy scale of the system, the system sometimes displays an oscillatory behavior even when the adiabatic condition, Eq. (9), is satisfied [33].
This last possibility is excluded if the time dependence of the Hamiltonian is parametrized by a single variable s = t/τ , where t changes from 0 to τ and accordingly s runs from 0 to 1,H(s) ≡ H(t). If the Hamiltonian has this property, which is the case in most theoretical models of quantum annealing, the adiabatic condition Eq. (9) ¶ We distinguish the term 'adiabatic theorem' from 'adiabatic approximation', the former being reserved for rigorous mathematical theorems [34,35,36] whereas the latter is for approximations [32,33].
for j = 1 is rewritten as, This equation gives an estimate of the computation time τ in terms of the minimum energy gap ∆ min = min s ∆ 1,0 (s) and is frequently used in theoretical analyses. There exist several mathematically rigorous statements (adiabatic theorems) that give sufficient conditions for the system to stay sufficiently close to the instantaneous ground state [85,34,35,36]. Nevertheless, since our interest is in the qualitative properties of the computation time τ as a function of the system size N , and rigorous adiabatic theorems lead to qualitatively the same conclusions, either polynomial or exponential time dependence of τ , we simply use the adiabatic condition of Eq. (10) in the following discussion.

Adiabatic quantum computing and quantum annealing
Often, the framework of quantum computation that imposes the adiabatic condition Eq. (10), such that the system is supposed to always follow the instantaneous ground state, is called adiabatic quantum computing. The idea was first proposed in the context of a typical optimization problem of Exact Cover [86], and provides a framework that is known in its most general form to be theoretically equivalent to the traditional circuit(gate)-based model of quantum computing [87,88].
Quantum annealing is a related but broader concept [18] + . Quantum annealing allows not only strict adiabatic processes but also non-ideal situations including diabatic (non-adiabatic) transitions due to fast changes of Hamiltonian parameters as well as noisy environments of thermal and other origins. Nevertheless, the term quantum annealing is often used in the restricted sense of adiabatic computing, and we follow this tradition as long as it does not lead to a confusion.

Energy gap and computational complexity
As seen in Eq. (10), performance of quantum annealing strongly depends on the properties of the energy gap between ground and first excited state, ∆ 1,0 , which should not be too small lest the necessary computation time τ becomes large or the system departs from the instantaneous ground state. However, the ground states of the initial and final Hamiltonians are qualitatively quite different because the initial state is supposed to be trivial, while the final state should encode a highly non-trivial optimization problem. This means that a phase transition occurs during the annealing procedure, and a quantum phase transition is known to be characterized by a vanishing energy gap in the thermodynamic limit [94]. + Several papers had proposed the term 'quantum annealing' [89,90,91] (see also [92,93]) before Ref. [18] formulated quantum annealing in its current style. Those early papers used classical processes following, typically, the imaginary-time Schrödinger equation.
Let us denote the minimum gap min s ∆ 1,0 (s) encountered during the annealing protocol as ∆ min . Suppose that this gap closes exponentially as a function of the system size, ∆ min ∝ e −cN (c > 0), which occurs at a first-order quantum phase transition * . The order of magnitude of the numerator on the left-hand side of Eq. (10) is the same as that of the Hamiltonian, i.e., O(N ), and thus can be ignored in comparison with the muchstronger exponential dependence of the denominator. The computational complexity (the size dependence of the computation time) is then exponential, τ ∝ e 2cN . In this case, the problem is considered hard to solve. If, on the other hand, the minimum gap closes polynomially ∆ min ∝ N −l (l > 0), as is usually observed in a second-order phase transition according to finite-size scaling [98], then the computation time is polynomial τ ∝ N 2l+1 and the problem is considered easy. Therefore the size dependence of the gap and the order of transition play important roles in the analysis of quantum annealing in the adiabatic framework. Notice here that a phase transition is a phenomenon that occurs only in the thermodynamic limit. Nevertheless, the order of transition in this limit gives us important information on the computational complexity for large but finite-size problems . Consequently, one approach to improving the performance of quantum annealers is by mitigating the gap closing, e.g., by judicious choice of the annealing schedule [99] or by inhomogeneous driving of the transverse fields [100,101], see Sec. 3.1.2.

Quantum annealing vs classical simulation
A driving force for the development of quantum annealers is the hope to outperform classical devices, a question that is intrinsically tied to the scaling of the minimum gap. Before one can compare quantum annealing to classical approaches, however, one must define the metrics to be used. There are two main tracks to compare the performance of algorithms: Resource estimation and performance for specific tasks.
Resource estimation [102] amounts to determining the necessary resources a particular method needs to achieve a target value. These resources can be, for example, time, energy, or memory. The target value can either be a particular energy threshold or the true optimum of a benchmark instance. In the case of quantum annealing, it has been common to use spin-glass-like Hamiltonians as benchmark problems with the resources measured being the time it takes to solve an ensemble of problems as a function of the size of the input (here, the number of variables). Most recently, it has been argued [50] that energy should be included in such metrics. In addition, careful definitions of speedup are needed between quantum and classical paradigms. The first careful definition of quantum speedup was done in Ref. [103], later extended in Ref. [48] to add more granularity. * A few exceptions are known [95,96,97].
There are known instances of relevant optimization problems where the minimum gap, e.g., is polynomial for small systems and becomes a stretched exponential only for sufficiently large problem instances [74].
One can also use specific tasks to evaluate the performance of an algorithm. For example, one could analyze the quality of the solutions, as well as how correlated or uncorrelated a set of solutions are when the algorithms are run multiple times. Multiple studies [104,105] have shown that quantum annealing is intrinsically biased and is therefore not a good sampler. As such, we focus here on resource requirements. However, it is worth emphasizing that advanced control techniques as outlined in this review, as well as non-stoquastic drivers, might alleviate the sampling problem for selected problems.
There have been multiple studies on the performance of quantum annealing and quantum annealing machines [106,107,39,108,109,110,44,43,111,112,113,52,114,115,45,116,117,46,61,118,119,120]. However, to date, it remains controversial if there is any "quantum speedup" or not. Such a statement strongly depends on the benchmark problem used and, to date, there is no industrial application where quantum annealing outperforms classical heuristics to the best of our knowledge. To extend the scaling regime of the different benchmark studies, researchers have shifted their focus to simple basic spin-glass problems, ideally on the native topology of quantum annealing machines. Unfortunately, it was demonstrated that random spin-glass benchmarks [44] on sparse graphs might not be complex enough to observe any advantage [43,45]. Efforts have recently been focused on synthetic benchmark problems constructed using planting methods [46,120], post-selection [45,119,121], and gadgets [122,50]. Despite all these efforts, only Ref. [50] has demonstrated a constant speedup of quantum annealing over the best available heuristics for a synthetic benchmark. However, a constant speedup can easily be overcome by better implementations of the classical heuristics on better classical hardware [123,124].
Reference [48] performed a comprehensive analysis of quantum annealing compared to multiple classical heuristics using synthetic benchmark problems. Quantum annealing was outperformed by parallel tempering Monte Carlo [125,126] with isoenergetic cluster updates [127,128], as well as the Hamze-de Freitas-Selby heuristic [129,130], and the hybrid cluster method [61]. Nevertheless, within the class of sequential algorithms, quantum annealing on the D-Wave device as well as simulated quantum annealing outperformed simulated annealing [131] and population annealing Monte Carlo [132,133,134,135], therefore being the most efficient sequential optimization method to date. However, if the best-known quantum inspired optimization methods [136,137,138,48,49,50] are included, a different picture emerges. We emphasize that this statement concerns standard quantum annealing with a simple transverse field driver and a simple annealing schedule.
Thus, a definite quantum speedup remains to be found, and it is an important goal of this article to present possible pathways for quantum annealing using advanced control parameters and more complex drivers that may point to a possible future direction toward unlocking the long sought-after quantum speedup over classical hardware.

Role of entanglement in quantum annealing
For various quantum technologies, especially quantum cryptography [139] and quantumenhanced metrology [140], entanglement has often been regarded as a resource that enables tasks beyond the capabilities of classical devices.
In the case of quantum annealers, the presence of entanglement has been witnessed [141,142], but these experiments did not show any direct connection to the success probability or any other figure of merit of the quantum annealer. As several theoretical investigations studying different measures for entanglement indicate, the maximal entanglement achieved during the sweep has no implication on the success probability or a quantum speedup [143]. However, Ref. [144] found that the entanglement that survives at the end of the sweep does give an upper bound on the success probability. That is, if the system does not manage to get rid of the entanglement, the final state cannot assume a unique classical configuration, and the probability to reach the Ising ground state is diminished. A second indication for the role of entanglement in quantum annealing comes from Ref. [145], where it was shown that matrix product states with a larger cutoff for the allowed entanglement (as measured by the bond dimension) perform better in a simulation of the annealing procedure. Nevertheless, much further work is needed in order to understand the role of entanglement and quantumness towards any quantum speedup in a quantum annealing device.

Perspectives on Methods
Due to the difficulty of benchmarking quantum annealers for spin-glass problems numerically and experimentally, the ultimate gain possible by these quantum devices over classical machines is an unresolved question of current research [53,54]. There are, however, various pathways towards boosting their performance, on the conceptual as well as on the experimental side. These represent the bulk of the rest of this article. This section treats perspectives on methods while those on physical implementations are discussed in Sec. 4.

Improvements by non-traditional terms and parameter control
This section explains recent approaches to the improvement of the performance of quantum annealing, which use methods that lie out of the traditional formulation of quantum annealing based on the control of a simple transverse field applied uniformly over all qubits.

Non-stoquastic Hamiltonians.
Most theoretical and experimental studies of quantum annealing refer to the transverse-field Ising model with the Hamiltonian where H 0 is the classical Ising model of Eq. (4). The Hamiltonian in Eq. (11) with nonnegative B(t) has all its off-diagonal elements real and non-positive on the standard basis to diagonalize σ z i (∀i) (the computational basis). A Hamiltonian satisfying this condition is called 'stoquastic', a compound of 'stochastic' and 'quantum' [146]. † A stoquastic Hamiltonian can generally be simulated efficiently on a classical computer by the standard method of Trotter decomposition (see [149,150] for exceptions), which leads to non-negative effective local Boltzmann factors [151]. This means that there is no sign problem, which plagues classical simulations of many nontrivial quantum-mechanical systems [152,153,151]. Another interesting property of a stoquastic Hamiltonian is that all coefficients of the ground-state wave function can be chosen to be non-negative in the standard basis, due to the Perron-Frobenius theorem, which excludes quantum interference effects in the wave function. These facts suggest the possibility that a stoquastic Hamiltonian may be devoid of essential quantum effects necessary for quantum advantage over classical algorithms. Related is the absence of a formal proof that the computational power of quantum annealing based on stoquastic Hamiltonians exceeds that of classical algorithms for specific problems with a few exceptions including the glued-tree problem [154], in which one ingeniously takes advantage of intermediate diabatic transitions. See also [155], where the nonstandard adaptive measurements in the final state are shown to be useful to achieve quantum enhancement in stoquastic Hamiltonians. This latter development is worth further scrutiny to clarify the power of stoquastic Hamiltonians.
A non-stoquastic Hamiltonian has arbitrary signs (and even complex values) in off-diagonal elements in the standard basis. A typical example is the Hamiltonian where Γ i ≥ 0 and some of the coefficients γ ij are positive ‡. Also, quantum annealing based on a non-stoquastic Hamiltonian of the form of Eq. (12) is known to be equivalent to gate-based quantum computation [88]. See also Ref. [87].
We are thus motivated to study both theoretically and experimentally the properties of non-stoquastic Hamiltonians. There exist, however, many obstacles because, for example, it is difficult to study numerically large-size spin-glass-like systems by classical simulations due to a large relaxation time even in the absence of the sign problem, see Sec. 2.1. Experimentally, realizing strong, tunable transverse interactions is also challenging, particularly for artificial atoms that are engineered to emulate spin-1/2 systems. For example, in the case of superconducting qubits (see Sec. 4.1) using inductive couplings, the engineered magnetic moments I x,i used to realize the † More precisely, a Hamiltonian is stoquastic if the off-diagonal elements can be chosen real and nonpositive in a basis which is a product of local bases [147]. The existence/absence of such a basis is not necessarily trivial [147,148]. γ ij σ x i σ x j term are themselves dependent on the external control fields applied during the annealing protocol. This can be contrasted with an ideal spin-1/2 system, for which the magnetic moment µ x,z is independent of the externally-applied magnetic fields B x,y,z . Even when the I x,i are appropriately compensated, the magnitude of the resulting engineered coupling may not be as large as desired without increased engineering complexity, for example, through additional coupling elements to enhance the maximal value of γ ij .
Nevertheless, several theoretical results have been reported that reveal enhanced computational capabilities of non-stoquastic Hamiltonians. In Refs. [156,157,158,159], it was shown that, in simple mean-field-type models, an additional XX term with positive coefficient as in Eq. (12) reduces first-order phase transitions present in the conventional transverse-field Ising model of Eq. (11) to a second order transition. As described in Sec. 2.4, this change of the order of the phase transition means an exponential speedup relative to the stoquastic case.
More specifically, let us adopt the p-spin model as the cost function, where p ≥ 3 is an integer. Although the ground state is trivial for this model-all spin up σ z i = 1 (and all spin down σ z i = −1 for p even)-the system undergoes a firstorder phase transition if we use the conventional transverse field as the quantum driving Hamiltonian H 1 . This means that quantum annealing cannot solve this simple problem [160]. It is nevertheless possible to avoid this difficulty by the introduction of an XX term with a positive coefficient [156], where s and λ are time-dependent parameters that change from s = 0, λ arbitrary at t = 0 to s = λ = 1 at t = τ . The last term in Eq. (14) makes the Hamiltonian non-stoquastic in the standard computational basis. Conventional quantum annealing is recovered for λ = 1, and λ < 1 represents the introduction of non-stoquastic effects. The phase diagram derived from statistical-mechanical calculations is displayed in Fig. 4. This figure shows that the line of first-order transitions (red solid line) terminates in the middle of the phase diagram and is replaced by a line of second-order transitions (blue dashed). It is thus possible to choose a path connecting the starting point s = 0, λ arbitrary and the final point s = λ = 1 without crossing a first-order transition. This means an exponential speedup relative to the conventional method that runs along the line λ = 1. Note that this speedup is not against best classical algorithms and also that an ingenious trick enables one to simulate classically this particular nonstoquastic Hamiltonian [162]. These notes notwithstanding, it is encouraging that an example exists that explicitly shows exponential performance enhancement by nonstoquastic Hamiltonians relative to the stoquastic counterpart. Also, a small-size spin  [161]. Based on [156].
glass problem was shown to have enhanced performance by a non-stoquastic Hamiltonian [163]. Though some efforts have begun to understand how non-stoquastic effects lead to enhanced performance [164], our knowledge is still primitive and more extensive studies should follow to reveal the computational power of non-stoquastic Hamiltonians, which may lead to essential quantum-mechanical enhancement even against classical algorithms.

Inhomogeneous driving of the transverse field.
In conventional quantum annealing, the amplitude of a transverse field is controlled uniformly (homogeneously) over all qubits as in Eq. (11). As discussed in Sec. 2.4, an adiabatic control of the homogeneous field from the initial value B(0) = 1 to B(τ ) = 0 causes a phase transition, which is one of the sources of difficulties. A control of the field that is inhomogeneous over space, where the amplitude Γ i depends on i, may weaken the effects of a phase transition, because the number of qubits with the right critical values of coefficients for a phase transition to happen, A(t)/(B(t)Γ i ) = critical, is reduced from macroscopic N for the uniform case (Γ i = 1 ∀i) to a much smaller number. In other words, only a small portion of the system is at the phase transition point, and the effects of phase transition may be weakened. This is the basic idea of inhomogeneous driving of the transverse field. References [165,166,167,168] studied the one-dimensional Ising model with an inhomogeneous field analytically based on the Kibble-Zurek mechanism [169,170] and found evidence for better performance as measured by the number of defects (misaligned spins) in the final state. Numerical [171] and analytical [172,173] work on typical combinatorial optimization problems of 3-SAT and maximum independent set problems also indicated improvements by inhomogeneity of the transverse field. Equilibrium statistical mechanical analysis of the p-spin model showed that the first-order phase transition is removed by the inhomogeneity of the transverse field [100,101], which means an exponential speedup over the homogeneous protocol. Following Refs. [100,101], let us choose in Eq. (15) the coefficients as A(t) = t/τ (≡ s), B(t) = 1, and the amplitude Γ i = 1 for i = 1, 2, · · · , N (τ − 1) (0 ≤ τ ≤ 1) for qubits with field and Γ i = 0 for i = N (τ − 1) + 1, · · · , N for qubits without field, i.e., where H 0 is the cost function of the p-spin model Eq. (13). Here, the parameter τ is for the ratio of qubits without transverse field (not to be confused with the computation time), and we start the annealing process with s = τ = 0 (H = − N i=1 σ x i ) and end with s(> 0) arbitrary and τ = 1. The phase diagram is drawn in terms of these two parameters as in Fig. 5, where τ = 0 represents the conventional protocol with transverse field homogeneously applied to all qubits. Each line represents a series of first-order transitions for a given p, starting from the τ = 0 line and ending in the middle of the phase diagram. It is clearly possible to choose a path connecting the starting point s = τ = 0 and ending at s arbitrary and τ = 1, implying the existence of a way to switch off the field qubit by qubit as time s proceeds without hitting a phase transition. This means an exponential speedup as compared to the conventional method (τ = 0) in the same sense as in the previous section for non-stoquastic Hamiltonians.
Experimentally, the current version of the D-Wave device features the option of an 'anneal offset,' which is to change the amplitude of the transverse field individually to some extent. Although it is not a complete realization of inhomogeneous field driving as described above, it nevertheless shows improved performance for some hard optimization problems [174]. In addition, it improves the biased sampling [105] that has been observed with standard transverse-field annealing protocols [141].

Reverse annealing.
Reverse annealing is another protocol to control the amplitude of the transverse field in a non-traditional way. As proposed in Ref. [175] (see also [176]), we may be able to take advantage of partial knowledge on the correct solution, and use it as a candidate state expected to be close to the solution. Such a candidate may have been obtained by another method like classical simulations. We then start from this specific classical state under zero transverse field and increase the amplitude of the transverse field to a pre-assigned finite value and then decrease it again to zero. The system explores the space of states near the candidate state, which may be more efficient than the whole-space search from the tabula rasa initial condition in the traditional method.
Numerical results on a typical combinatorial optimization problem of 3-SAT indicate encouraging results [175]. An analysis of the p-spin model revealed that a firstorder phase transition in the traditional formulation disappears by reverse annealing if the initial condition is chosen close to the correct solution [177]. In order to enforce the candidate state as the initial state, we add a term to the Hamiltonian of the p-spin model where s and λ are parameters to control the time evolution, running from s = λ = 0 initially to s = λ = 1 finally. The parameter i (= ±1) represents the candidate state, and the final term in the Hamiltonian Eq. (17) constrains the initial state to σ z i = i (∀i) at s = λ = 0. The ground state is trivial for the p-spin model (σ z i = 1 (∀i) for p odd and also σ z i = −1 (∀i) for p even), and i is expected to be 1 with a probability c reasonably close to 1. In the phase diagram depicted in Fig. 6, the conventional quantum annealing (along the vertical line λ = 1) is seen to encounter a first order transition (blue curve) during the time evolution along the s axis. The line of first order transitions drawn blue terminates in the middle of the phase diagram if the initial condition is fairly close to the final solution, i.e. c close to 1 as in panels (b) and (c). This signifies an exponential speedup over the conventional approach along the line λ = 1.
Further theoretical, numerical, and experimental studies would help us understand when and how reverse annealing is effective.

Non-adiabatic schemes
As discussed in Sec. 2.2, the adiabatic condition poses a fundamental speed limit to adiabatic quantum computing given by near-crossings of energy levels. A strategy to overcome this fundamental limitation is to detach from the adiabatic approximation and introduce non-adiabatic methods, so-called counter-diabatic driving [178, 179, 180,

181, 182]
and/or short-cut to adiabaticity [183,184,185]. Let us summarize the main idea of counter-diabatic driving.
We consider the frame rotating with respect to a unitary operator U and define the Hamiltonian in the rotating frameĤ = U † HU and the corresponding wave function |ψ = U † |ψ . The Schrödinger equation is then where have defined the adiabatic gauge potential as In the laboratory frame the gauge potential eliminates the off-diagonal terms of the moving Hamiltonian and thus any transition. This can be seen from differentiatinĝ H(λ) with respect to λ. We obtain as [∂ λĤ ,Ĥ] = 0. The adiabatic gauge potential A λ from this equation is an exact formulation of the diabatic transition due to a general unitary transformation. The idea of counter diabatic driving is to cancel this term by introducing it with a negative sign in the Hamiltonian Here, H(t) is the original time-dependent Hamiltonian from the adiabatic quantum computing protocol. In adding the exact counter-diabatic gauge potential, transitions get suppressed and the system remains in its instantaneous ground state for all times and sweeps [179]. However, from an experimental point of view, the exact gauge potential A λ is of limited use, because the terms that appear in the Hamiltonian are highly non-local and include all orders of k-body interactions [179,181]. It is thus a natural question whether one can find approximate counter-diabatic Hamiltonians that can be built with the available resources and still sufficiently improve the ground-state fidelity.
Recently, a variational method to find approximate counter-diabatic Hamiltonians has been introduced by Sels and Polknovnikov [181]. In this scheme, the recipe to construct approximate counter-diabatic terms is to use an ansatz that includes the experimental available interactions, which is then variationally optimized with respect to the adiabatic gauge potential. Here, we introduce A * λ as the approximate Gauge potential which we variationally optimize. The idea is to minimize the operator distance between the exact Gauge potential A λ and the approximate ansatz Gauge potential A * λ defined as where G is the Hilbert-Schmidt norm The aim is to minimize the operator distance with respect to the variational parameters of A * λ . In Ref. [181] it was shown that operator distance is equivalent to the minimum of the action associated with the approximate adiabatic gauge potential A * λ , i.e., where δ denotes the partial derivative. In Ref. [181], this variational approach was demonstrated for a range of model systems, including an Ising spin chain. In this model, the counter-diabatic terms are derived from the ansatz Here, the switching functions α, β and γ describes a smooth function of the form where τ is the sweep time and λ 0 and λ f the initial and finals values, respectively. This particular form of the switching function is chosen to satisfy the condition that first and second derivatives at t = 0 and t = τ vanish. Adding a Hamiltonian term of the form of Eq. (27) with a switching function as in Eq. (28) to the standard annealing Hamiltonian improves the ground-state fidelity considerably. However, Eq. (27) consists of all-to-all σ y σ x couplings, which is experimentally challenging. In Ref. [186], the variational counter diabatic terms have been combined with the LHZ implementation of all-to-all spin models [77]. The LHZ architecture encodes the optimization problem in local fields, as compared to the conventional Ising spin representation where the problem is encoded in pair interactions. The LHZ Hamiltonian reads as where A, B, and C are switching protocols, K is the number of connections (e.g., K = N (N − 1)/2 for all-to-all models) and the indices in the last term run over four qubits in each plaquette of a square lattice. The main result of Ref. [186] is a protocol to find optimal counter-diabatic terms with local qubit operations from a single-qubit ansatz for the counter-diabatic terms of the form A * λ = i α i σ y i . The complete protocol is a quantum-classical hybrid approach where a variational parameter is updated from the outcome of a measurement from a quantum protocol. This protocol consists of two distinct variational optimizations: First, the analytical result from the gauge potential and second, the iterative optimization of a single optimization parameter from repeated measurements.
The possibility to loosen the strict adiabatic condition by using additional information about the system, be it variationally or analytically, may open the door to a wide range of theoretical and practical advances in adiabatic quantum computing. Approximate counter-diabatic Hamiltonians may allow one to trade sweep time for available information about the problem and experimentally available resources.

Perspectives on Implementations
Further theoretical advances are just one missing piece in order to arrive at quantum annealers that solve problems of practical relevance. Just as important will be a strong continued progress in the design of quantum devices. In this section, we discuss the state of the art and future prospective of superconducting qubits, the paradigm platform for quantum annealing, as well as ideas for alternative platforms based on Rydberg atoms and trapped ions.

Superconducting qubits
Superconducting qubits are artificial spin-1/2 systems comprising lithographically defined inductors, capacitors, wires, and Josephson tunnel junctions. Their lithographic scalability, compatibility with electrical control signals, potential for high coherence, and the existence of a superconducting classical digital logic all converge to make the superconducting qubit modality the most advanced quantum annealers available today.  Despite D-Wave's truly remarkable engineering achievement, it remains unknown if the D-wave quantum annealer -or quantum annealers in general -will afford quantum advantage for a general class of problems. The limitations of contemporary superconducting quantum annealers include: • Short coherence times (compared with gate-model quantum computers).
• Limited connectivity (thus limiting the optimization problems that can be solved, see Sec. 2.1, or requiring mappings such as the LHZ approach, Eq. (29)).
• Limited ability to tailor the annealing schedule for different qubits (hence limiting inhomogeneous-driving schemes, see Sec. 3.1.2).
At the time of this writing, it is not known if remedying these limitations will practically lead to quantum advantage, but addressing them is a good place to start.

Introduction to superconducting qubits.
As mentioned, superconducting qubits are lithographically defined circuits comprising inductors, capacitors, interconnects, and Josephson junctions [187]. When cooled to milliKelvin temperatures, these circuits exhibit quantized energy levels corresponding to quantized states of electric charge or magnetic flux, depending on the parameter values of these various circuit elements.
In the context of gate-model quantum computation, the lowest-two of these energy levels are generally used as a quantum bit to represent logical states |0 and |1 .
Here, coherence within this reduced manifold is a premium, and, over the past 20 years, research groups worldwide have devoted significant resources to improving qubit coherence through a combination of materials, fabrication, and design advances. State of the art for gate-model qubits stands at approximately 100 µs for both the energy decay time (longitudinal relaxation time T 1 ) and coherence time (transverse relaxation time T 2 ) [188,189,190,12,191].
The approaches taken to achieve this level of coherence are often incompatible with the needs for quantum annealing. For example, qubits with fixed frequencies (determined during fabrication) or qubits with only σ z tunability reduce the number of noise channels that can impact coherence, but do not allow for fully tunable transverse Ising Hamiltonians with σ z and σ x terms. Additionally, the materials (typically aluminum) and fabrication approaches (typically single-metal layers) are kept as simple and minimal as possible to avoid introducing unwanted defects that reduce T 1 and T 2 , at the expense of circuit complexity. There are several types of gate-model qubits in common use today, including the transmon, capacitively shunted flux qubit, and fluxonium.
In contrast, the superconducting qubits used for quantum annealing are generally more complex by necessity. The most common type is called a "X-and Z-tunable flux qubit," comprising superconducting loops interrupted by Josephson junctions (indicated by the symbol "X" in Fig. 7A) [192,193]. In its conventional form, the Z-field is tunable by threading a magnetic field Φ z through the σ z -loop (blue). Clockwise and counterclockwise circulating currents are the two classical (diabatic) states of this circuit and serve to help screen Φ z . When Φ z = Φ 0 /2, where Φ 0 = h/2e is the superconducting quantum unit of flux, both circulating-current states have the same energy and the system is frustrated. The green loop and its junctions serve to couple the diabatic currents, opening an avoided crossing at this degeneracy point. The size of this avoided crossing, the strength of the X-field, is tunable by a magnetic flux Φ x applied to the σ x loop (green). In essence, the potential profile for this circuit is a two-dimensional double well. Each well corresponds to a circulating current, and the applied flux Φ z tilts the wells. The wells are tunnel-coupled through a potential barrier, and the height of this barrier (and, thus, the tunneling rate) is tuned by the applied flux Φ x .
Thus, a X-and Z-tunable flux qubit behaves as a spin-1/2 system in a magnetic field. The applied magnetic flux Φ x,z and magnetic moment I x,z ×A x,z of the flux qubit, where A x,z is the loop area, correspond to the applied field B x,z and magnetic moment µ x,z of the natural spin-1/2, respectively. One important distinction, however, is that the magnetic moment I x of the flux qubit is generally dependent on the applied flux Φ x . This should be contrasted with an ideal spin-1/2, for which the magnetic moment is constant and independent of the applied field. This difference must be compensated to realize ideal spin-1/2 behavior by either a careful calibration of the X-field and, more likely, additional complexity to realize the full range and strength of desired σ x i σ x j coupling.

Approaches to coupling.
Qubit coupling is achieved using mutual inductance between qubits and the qubit circulating currents. Although a direct interaction between qubits is possible in principle, it is not useful in practice, as the coupling strength cannot be tuned independently of the single-qubit field strengths. Rather, achieving large and tunable σ z i σ z j and σ x i σ x j coupling is implemented using intermediate qubits called "couplers" (see Fig. 7B, C).
The couplers are qubits operated in a regime where they behave as tunable inductors with effective inductance L eff . The X-field for the coupling is biased large enough that the coupler always remains in its ground state. Taking σ z i σ z j coupling for example, the coupling energy J between two qubits, each with circulating current I z and mutual coupling M , is J = (I z M ) 2 /L eff . In turn, the inductance of the coupler is tunable using Φ c z applied to the coupler Z-loop, and the coupling can be tuned from ferromagnetic (J < 0) to antiferromagnetic (J > 0). For a detailed description of how this works, see Refs. [194,40]. A similar approach can be used to couple the X loops and achieve σ x i σ x j . By tuning the sign of the σ x i σ x j coupling, one can realize both stoquastic and non-stoquastic Hamiltonians. Furthermore, the couplers themselves can be cascaded to achieve fanout [195]. We note that σ y i σ y j coupling can also be realized using capacitive coupling.
There is a fundamental trade-off between strong coupling and high coherence. Large circulating currents lead to strong coupling, since J ∝ I 2 x,z . However, coherence is severely impacted by the size of the circulating current and its corresponding magnetic moment. For example, the energy relaxation time scales as 1/I 2 x,z and the dephasing time scales as 1/I x,z [40]. Thus, quantum annealers that use large circulating currents generally have low coherence times (around 10 ns), whereas gate-model qubits with high coherence (around 100 µs) all use small circulating currents [12]. The couplers can be used to compensate a weak coupling due to small circulating currents, and coupling strengths J = 1 GHz have been demonstrated using annealing qubits with coherence times around 1 µs [40].
4.1.4. 3D integration: coherence, connectivity, and tailored annealing. Current approaches to building quantum annealers generally use monolithic, multi-layer fabrication processes. The advantage of this approach is that it supports high levels of complexity, high connectivity, and the annealing protocol can be implemented on chip using superconducting control electronics, such as is done for the D-Wave 2000Q. However, this type of multi-layer fabrication exacerbates low coherence times due to the many fabrication steps required and the use of interwiring layer amorphous dielectrics. The result is an abundance of materials and fabrication-induced defects that sap coherence from the qubits.
An alternative approach was proposed in Ref. [41], and is called the "three stack" (see Fig. 8). In this approach, there are three layers with unique functionalities, and each is fabricated separately. The qubit chip is fabricated in a high-coherence process similar to those used for gate-model qubits. The readout and interconnect layer is a multi-layer process similar to conventional annealing processes; it can support dense interconnects, high connectivity, and active devices such as parametric amplifiers [196] and SFQ electronics [197]. To isolate this layer from the qubit chip, an interposer layer is introduced. The interposer is made of low-loss, intrinsic silicon and features superconducting through-silicon vias to bring signals from the readout and interconnect layer up to the qubit plane. By keeping the readout and interconnect layer far from the qubit chip, the electric and magnetic fields associated with the qubits are constrained to the low-loss qubit and interposer layers. The layers are then joined together using standard indium bump-bonding techniques.
With the three-stack or similar approaches, one can controllably study the impact of high coherence, high connectivity, and non-stoquastic coupling on the quantum annealing and its potential to achieve quantum advantage. Each layer can be fabricated independently and then combined in a manner that is itself extensible. This approach also supports both on-chip and off-chip control electronics, to enable annealing protocols that are tailored to each qubit and coupler individually. Once the key to achieving quantum advantage is understood, one can then focus on building larger machines using the most appropriate approach(es).

Trapped ions
While the mainstream platform for quantum annealing is superconducting qubits, with impressive progress in recent years, there do exist alternative systems. In particular, since atomic levels do not suffer from fabrication errors, quantum simulation platforms based on neutral or charged atoms represent promising alternatives in order to implement quantum annealers for practical applications or as highly-controlled test beds for fundamental questions. Examples include Rydberg atoms (see next section) and trapped ions, which we discuss in this section.
Charged ions [1] can be trapped by electromagnetic confining potentials in high vacuum, e.g., in linear Paul traps or Penning traps. The ions can then be cooled almost to the absolute ground state of their motional degrees of freedom, through which they arrange in regular crystal patterns due to their mutual Coulomb repulsion. The constituent qubits are encoded as two electronic hyperfine states, on which singlequbit rotations can be realized with extremely high fidelity by laser light or microwaves, depending on the chosen hyperfine states. These single-qubit terms can have arbitrary spatial dependence, thanks to single-ion addressing [198,199,200]. Due to the strength of the Coulomb repulsion, the distance between individual ions is typically on the order of a few micrometers, over which direct interactions between the hyperfine qubits are negligible. To introduce effective qubit-qubit interactions, one can couple the qubits off-resonantly via laser or microwave radiation to the collective phononic vibrations of the ion crystal [201,202,203]. Eliminating the phonons in second-order perturbation theory, the resulting interaction yields an Ising Hamiltonian with long-ranged coupling terms J ij [204,205,206,198,207,208].
Thus, including single-qubit rotations, the natural Hamiltonian for trapped-ion systems is Time dependence of the Hamiltonian parameters is controlled simply by ramping laser intensities. This Hamiltonian thus provides all the ingredients for a basic annealing protocol.
As one limiting factor, the spatial dependence of the interactions J ij is determined by the laser parameters, dimensionality of the crystal, and the phonon modes. If a single frequency µ is used to generate the interactions, they are typically given by [206] Here, Ω i is the laser Rabi frequency at ion i, ω q is the frequency of the phonon mode q, and η q i is determined by the amplitude of the vibrational mode q at ion i. In typical implementations, the frequency that generates the spin-spin interactions is tuned outside of the phonon band. For small systems, can then be approximated by a tunable powerlaw in the range α ∈ [0, 3] [208], while for larger systems it is better represented by a combination of an exponential and a dipolar decrease with distance [207]. Following a groundbreaking adiabatic protocol A) B) Figure 9. Adiabatic preparation of a spin-model ground in trapped ions, as reported in Ref. [206]. The model is described by the Hamiltonian Energy levels E relative to the ground state E g , computed for N = 6. The thick red line denotes the first excited level that gets coupled to the ground state and which defines the minimum gap ∆ c . The ramp begins at B(t = 0) ≡ B 0 max(J ij ), which also sets the energy scale. (B) For experimentally accessible ramp times, the overlap of the final state with the ground state rapidly decreases with system size. This effect can be mitigated by optimizing the ramp, for example, by being slower close to the minimum gap (denoted local adiabatic). Taken from Ref. [206].
involving only two spins [209], quantum annealing with such long-range interactions and homogeneous fields has been realized with up to 18 qubits [205,206] (although phrased in the context of quantum simulation and adiabatic state preparation of spin-model ground states). Though the trade-off between annealing speed and decoherence limited the overlap to the ground state at the end of the sweep, see Fig. 9, these experiments opened the road towards adiabatic protocols using trapped ions.
To proceed further on the road towards applications in quantum annealing, one requires besides engineering improvements (see below), the ability to freely program a desired optimization problem. In Ref. [144], it was shown that Hamiltonian Eq. (30) is sufficient to encode Coulomb glass problems as well as a variety of NP-complete models in a completely programmable manner, including the knapsack problem, number partitioning, and instances of the max-cut problem. To do this, one requires in addition to the programmable local rotations B β i only the ability to locally adjust the intensity Ω i in Eq. (31). Both are experimentally feasible with existing singlesite addressing-techniques using acousto-optic modulators or digital-mirror devices [199,200]. An alternative approach to obtain benchmark models for quantum annealing is by generating deterministic instances of disordered spin-glass problems. This can be realized by tuning the radiation frequency µ into the phonon band [210], which generate interactions with high randomness and thus glassy behavior. Although the models resulting in this way may not be freely programmable, they can nevertheless provide valuable tests for quantum annealing protocols. Finally, if a number of frequencies that scales with the number of qubits is used to induce the qubit-qubit interactions, arbitrary interaction patterns J ij can be programmed [211].
Regarding trap design and qubit coherence, there has been strong progress since the seminal experiments of Ref. [209,205,206]. Quantum simulations in linear Paul traps are now performed with tens of ions [212,213]. Cryogenic traps will allow experimentalists to scale to 100 ions or more [214]. Numbers of this order are already being achieved in Penning traps [204], where adiabatic protocols in the Dicke model (an infinite-range spin model coupled to a bosonic mode) have recently been attempted for ∼ 70 ions [215]. Segmented traps [216] and microfabricated surface traps [217,218,219,220] also open new possibilities for the design of interaction patterns. All the while, trapped-ion experiments are characterized by extremely large degrees of coherence. Life-times of the qubit levels are on the order of several seconds. Current limiting factors are fluctuations of magnetic fields and laser intensities. Often, these limit coherence times during the dynamics to several milliseconds, a time scale that is comparable to the one set by typical interaction strengths on the order of several kHz [212,215]. By use of decoherence-free subspaces, these times can be increased to tens of milliseconds [198,200].
As an interesting feature in this context, trapped ions allow for testing the effect of decoherence on the annealing protocol in a controlled manner. For example, by letting the local fields fluctuate in a temporally random way, dephasing noise can be simulated [221,144]. Recently, it has been demonstrated that practically arbitrary noise spectral functions can be realized in this way, which has been used to study the effect of noise on quantum transport in a toy model for a photosynthetic complex [200]. It may be interesting to extend such analyses to true "quantum baths", e.g., by resonant coupling to phonon modes as has been realized in a small toy model in Ref. [222]. Thus, the trapped-ion platform may provide a valuable test bed to benchmark quantum annealing subject to designed forms of decoherence.

Neutral atoms
Another ideal platform for quantum simulation in and out of equilibrium is Rydberg atoms, which offer large coherence times and individual control of qubits [223]. Using so-called dressing schemes [224], Rydberg atoms can implement Ising-spin models [225,226,227,228] with nearest neighbor interactions. In the context of adiabatic quantum computing, a natural question is which models can be directly implemented in current neutral atom platforms, given that interactions are induced by laser-fields, which are challenging to tune individually.
In Ref. [83], the authors propose a Rydberg implementation for coherent quantum annealing based on the LHZ scheme [77], see Eq. (29). In this square lattice scheme, interactions are problem independent [C n in Eq. (29)] and can thus be applied using a few lasers globally among all atoms. Local fields, which encode the optimization problem [J i in Eq. (29)], can be applied with digital mirror devices with single site resolution. The main challenge in implementing LHZ is the realization of the required 4-body ZZZZ interactions. In Ref. [83], this issue is solved by a decomposition of the 4body interactions into 2-body interactions with an overhead of one ancilla per plaquette.
These ancilla atoms are required to be of a different species (e.g., by combining Rb and Cs atoms), which poses an experimental challenge.
A direct implementation of the maximum independent set problem in neutral atoms was recently introduced in Ref. [229]. The maximum independent set of a graph is defined as the maximum set of vertices where no pair of vertices in each subset is connected by an edge. An experimental simplification can be achieved when restricting to unit disk graphs, which still leaves the problem NP-complete. These are planar graphs where vertices are placed on a two-dimensional grid and edges between vertices with a pair-wise distance smaller than a certain threshold. This problem can be directly mapped to Rydberg atoms using the so-called blockade mechanism, by which there exists a blockade radius around any atom that is excited to a Rydberg state within which no other atoms can be excited. Thus, the proposal serves as a direct test of quantum optimization without overhead from embedding.
As pointed out in Sec. 2.1, a main challenge in quantum annealing is the infinite range interactions typical for hard optimization problems. In Ref. [230], the authors propose a direct implementation of an infinite range system with atoms in a cavity. The encoded problem is a variation of the N -Queens problem. The N -Queens problem is: given an N × N chess-board, place N queens such that no pieces attack each other (queens attack along rows, columns, and diagonals). A particular variation, the blocked-diagonals variation, is NP-complete and cannot be solved with current classical algorithms for N > 21. The problem can be mapped to an annealing protocol with N N -level systems. Each N -level system represents a column of the chess-board and each level a row. For example, if the second level of the third N -level system is occupied it means that a queen is placed on the second row of the third column on the chess board. The N -level systems are realized with atoms in one-dimensional optical lattices and the non-attacking condition is implemented with strong repulsive interactions along diagonals and rows. These interactions are mediated by the optical cavity and infinite range. The proposal is an attractive test bed for direct implementations of optimization problems. In particular, for this problem classical algorithms fail already for small numbers of atoms that will be available in near term. The proposal of Ref. [230] is based on coherence times limited to the order of 100ms and interaction strengths of Jt = 50, which is the same order of magnitude as already demonstrated in experiment [231].

Summary
With the advent of intermediate scale quantum devices, we are currently witnessing the start of the second quantum revolution, where quantum mechanics will be used as a tool for computations tasks. With its relative simplicity and reduced demands on resources, quantum annealing may serve as one of the first crucial stepping stones towards fullfledged quantum computing. In this perspectives article, we have summarized possible routes towards solving the open questions in the field, including the question of possible quantum speedup from novel protocols, new encoding strategies that enable high coherent qubit platforms for quantum annealing, and novel hardware architectures that push the limits of coherence and control. The field is still open and evolving rapidly, and we hope this article serves as a guideline for future research paving the way for quantum technology as a computational tool.