← Back to digest

Asymptotic optimality of Grover–Radhakrishnan–Korepin algorithm

Kun Zhang, Kang-Yuan Chen, Xiao-Hui Wang, Vladimir Korepin · arXiv:2604.15886 · new submission 2026-05-24 · score 8/10

Abstract

Grover's algorithm is a cornerstone of quantum algorithms and is strictly optimal in oracle-query complexity. While the full search problem admits no further improvement, one may trade accuracy for speed in the partial search problem, where the task is to identify only the block containing the target item. The best known quantum algorithm for the partial search problem is the Grover–Radhakrishnan–Korepin (GRK) algorithm, whose optimality has long been conjectured but not proved. In this work, we prove the optimality of GRK in the large-block limit. We formulate partial search as a time-optimal control problem and apply the Pontryagin maximum principle to derive the switching-function dynamics, establish the bang–bang structure of regular extremals, and exclude non-optimal switching patterns. As a result, we show that the optimal regular extremal has the global–local–global form, which yields a control-theoretic proof of the asymptotic optimality of the GRK algorithm in oracle-query complexity.

Executive summary

Zhang, Chen, Wang, and Korepin (the same Korepin in the algorithm's name) settle a long-open conjecture: the Grover–Radhakrishnan–Korepin (GRK) algorithm is asymptotically optimal for partial search in the large-block limit. The paper recasts partial search as a time-optimal control problem on the 3-dimensional reduced space spanned by |t⟩, |ntt⟩, |u⟩, applies the Pontryagin maximum principle to get a bang-bang structure between the two skew-symmetric generators X, Y ∈ 𝔰𝔬(3) corresponding to global and local Grover, and proves that the only optimal pattern is XYX — exactly the global–local–global structure of GRK. For Yuan, the directly relevant theme is Y4: the Grover-family techniques and the design of structured oracles/diffusion stages are the same ingredients underlying Y4's cardinality-constrained algorithm, and the control-theoretic optimality framework presented here is a methodological template that could be adapted to argue (in)optimality of multi-stage Grover algorithms with structured feasible spaces.

Main contribution

The paper proves a structural optimality theorem (Theorem 5.4) for partial quantum search: within the asymptotic continuous-limit formulation, the only nontrivial time-optimal operator ordering is G_n G_m^{k₂} G_n^{k₁} (global–local–global), with the GRK parameters tan(2η_K/√K) = √(3K−4)/(K−2) and cos(2α_K) = (K−2)/(2(K−1)). Mechanically: in the large-block limit each elementary Grover step is a near-identity rotation, so the discrete operator sequence becomes a time-optimal switching problem between two SO(3) generators. The Pontryagin maximum principle then forces a bang-bang structure on regular extremals. Two key compression theorems exclude all longer alternating sequences: any interior three-bang block of the form YXY or XYX can be replaced by a single shorter bang with the same reduced endpoint data. Together with endpoint arc lemmas in the appendix (the optimal extremal must start and end with X), this collapses the admissible patterns down to X (trivial) and XYX (the GRK structure).

Key theorems / lemmas / algorithms

Detailed walkthrough

The partial-search problem asks: in an unstructured database of N = 2n items with a unique target |t⟩, decompose t = t₁ t₂ with |t₁| = n−m, partition the database into K = 2n−m blocks of size b = 2m, and identify the block t₁ without recovering t₂. Grover & Radhakrishnan (2005) showed this is faster than full search; the GRK algorithm G_n G_m^{k₂} G_n^{k₁}|s_n⟩ with optimised k₁, k₂ achieves an asymptotic query count of (π/4)√N − (η_K − α_K)√b + O(1). Multiple alternative orderings (global-local, local-global-local-global, etc.) were studied but no one proved XYX was best.

The 3D reduction (Sec. 2.2) collapses the entire dynamics to the basis {|t⟩, |ntt⟩, |u⟩}, where |ntt⟩ is the equal superposition over non-target states inside the target block and |u⟩ is the equal superposition over states in non-target blocks. With sin θ₂ = 1/√b and sin γ = 1/√K, the initial state is |s_n⟩ = sin γ sin θ₂ |t⟩ + sin γ cos θ₂ |ntt⟩ + cos γ |u⟩ and the success condition is simply that the |u⟩-amplitude vanish. In the large-block limit, both θ_1 = arcsin(1/√N) and θ_2 = arcsin(1/√b) are infinitesimal, so the global and local Grover operators become exponentials of skew-symmetric generators X, Y ∈ 𝔰𝔬(3): X = [[0, s², sc], [−s², 0, 0], [−sc, 0, 0]], Y = [[0, 1, 0], [−1, 0, 0], [0, 0, 0]], with s = sin γ, c = cos γ.

The time-optimal control problem (Sec. 3) is then: starting from |s_n⟩, choose a piecewise control A(t) ∈ {X, Y} to reach the terminal plane Σ = {⟨u|ψ⟩ = 0} in minimum total time. The Pontryagin maximum principle gives necessary conditions through a switching function ϕ_X(t) − ϕ_Y(t); positive ⇒ use X, negative ⇒ use Y. Switching can happen only at zeros of the switching difference.

The technical core is Sec. 4. The authors derive a "universal reduced switching map": although the individual switching functions ϕ_X, ϕ_Y, ϕ_Z (with Z = [X, Y]) are nontrivial along each arc, at each switching point they collapse to canonical coordinates (0, a, b) (with a ≠ 0 for regular switchings), and one full switching-to-switching arc — of either type X or type Y — sends (0, a, b) ↦ (0, −a, b). This is the key observation: both arc types implement the same reduced map on the switching surface; only their durations differ. The X-arc has duration τ_X = π/sin γ (Lemma 4.2); the Y-arc has duration in (0, 2π), and consecutive Y-arcs separated by one bang satisfy ℓ₁ + ℓ₂ = 2π (Lemma 4.3).

The compression argument (Sec. 5.1) then becomes a simple time-comparison. An interior YXY block sends (0,a,b) ↦ (0,−a,b) overall and has total length 2π + π/sin γ; a single X-arc induces the same map in time π/sin γ. Hence the YXY block is strictly suboptimal. The dual argument excludes interior XYX blocks (overall length 2π/sin γ + ℓ > ℓ, beaten by a single Y-arc). Combined with two appendix lemmas showing the first and last bang must be X, every regular normal time-optimal extremal has the structure X (trivial) or XYX (non-trivial). The XYX pattern is precisely the global–local–global structure of GRK.

The final step (Theorem 5.4) is to optimise arc lengths within the XYX family; this reduces to the standard GRK parameter optimisation, recovering tan(2η_K/√K) = √(3K−4)/(K−2) and cos(2α_K) = (K−2)/(2(K−1)). So GRK is asymptotically optimal in oracle-query complexity in the large-block regime.

The discussion (Sec. 6) raises two caveats Yuan should note. First, the optimality is in oracle-query count; circuit depth — the actually-relevant resource on NISQ hardware — is a different question, and Zhang et al. explicitly flag that "GRK remains optimal in depth only in the regime where the oracle depth dominates the total circuit cost." Second, a recent companion paper (Jiang et al., 2026) shows that for m > n/2 GRK is not efficient relative to classical, and the authors leave open whether alternative diffusion operators can recover an advantage in that regime — a natural opening for follow-up.

Figures

No figures extracted from source (the paper is purely analytical and contains no figures).

Citations to Yuan's papers

No direct citation to any of Y1–Y6 found in bibliography.

Overlap with Y1–Y6

Recommended action for Yuan