← Back to digest

A Penalty-Free Pipeline for Direct Quantum-Annealer Portfolio Optimization

Luis Lozano · arXiv:2605.17628 · new submission 2026-05-19 · score 10/10

Abstract

Direct quantum-annealer portfolio optimization is commonly formulated as a penalty-encoded QUBO and submitted to D-Wave hardware. We show that this standard formulation fails on current devices and identify the structural reason: the cardinality penalty contributes a dense rank-one term proportional to the all-ones matrix that makes the logical interaction graph complete regardless of the covariance structure. On Pegasus and Zephyr, chain-break fractions reach 83 percent at N equal to 24 and 92 percent at N equal to 49, producing no feasible samples. Attempting to fix this through topology-aware sparsification reveals a second problem: any sparsifier that removes off-diagonal entries also dilutes the cardinality constraint, so raw samples remain infeasible even when chains no longer break, and an ablation shows that for structurally favorable cases such as betting with settlement-graph priors the classical feasibility projector alone explains the result rather than the QPU. We propose dropping the penalty entirely: build an objective-only QUBO from the expected returns and the risk-scaled covariance, sample it on hardware, and enforce the cardinality constraint classically as a post-processing step. On D-Wave Advantage and Advantage2 for equities up to N equal to 49 and betting up to N equal to 48, mean chain-break fractions per sample averaged over reads drop from the range of 71 to 92 percent down to at most 0.04 percent. The QPU returns lower-energy feasible portfolios than the greedy heuristic on betting at N equal to 39 and 48, which is an energy comparison and not a proof of optimality, and the equity post-processed regret is at most 0.03 percent at all tested scales. These results establish that the penalty encoding, not the sparse hardware topology, is the binding constraint for direct QPU portfolio optimization at currently accessible scales.

Executive summary

This paper diagnoses why the standard penalty-encoded cardinality-constrained portfolio QUBO fails on D-Wave hardware (the rank-one penalty term A·1·1ᵀ makes the logical graph complete regardless of the covariance structure), shows that the obvious fix (topology-aware sparsification) creates a second failure mode (constraint dilution), and proposes a penalty-free pipeline: objective-only QUBO on hardware + classical cardinality projection. Chain-break fractions drop from 71–92% (penalty) to ≤ 0.04% (penalty-free) on Pegasus and Zephyr up to N = 49 equities and N = 48 betting. This is the closest method-level match in today's batch to Yuan's Y2: both papers independently identify the cardinality penalty as the structural bottleneck and respond by removing it (Y2 via hard mixer, this paper via classical projector). Reading this paper today is a high-leverage cross-paradigm validation of the Y2 design choice.

Main contribution

Three findings, in order of importance: (i) the cardinality penalty is the binding bottleneck on current direct-QPU portfolio optimization, not the sparse hardware topology — proved by ablation against random projection; (ii) sparsifying the penalty-encoded QUBO does not work because any off-diagonal entry removed is also constraint mass removed; (iii) the objective-only QUBO with classical post-processing yields essentially-zero chain breaks and competitive regret on live D-Wave Advantage and Advantage2 at the full FF49 universe. The paper is paired with companion 2605.17623, which extends the same diagnosis to the hybrid CQM/BQM layer.

Key theorems / experimental results

Detailed walkthrough

This is the most direct method-level match in today's batch to Yuan's Y2 (quasi-binary encoding with hard mixer, no penalty terms). The author and Y2 reach independently the same structural conclusion: penalty terms for hard combinatorial constraints are the binding bottleneck on current quantum hardware, and the right response is to remove them entirely. The paper attacks the cardinality-constrained portfolio problem on D-Wave Advantage and Advantage2 — exactly the hardware substrate adjacent to Yuan's QAOA-portfolio work in Y3.

The structural diagnosis (§1.1, §3.1) is sharper than I've seen before. The cardinality penalty A(1ᵀx − K)² expands to A·1·1ᵀ + diagonal + constant. The 1·1ᵀ rank-one matrix adds the value A to every off-diagonal entry of Q. For betting (block-diagonal natural covariance, one 3-clique per match) this turns a 3M-edge instance into a (3M choose 2)-edge instance. For equities (dense natural covariance) it compounds the existing density. Either way, after the penalty is added, the logical interaction graph is K_N and minor-embedding cost on Pegasus / Zephyr scales catastrophically.

Section 4 (Topology-Aware Sparsification) is the failed-fix arc. The author considers four sparsification families: threshold, top-k, domain-prior, and domain-prior with residual edges. Sparsification reliably reduces chain-break fractions to near zero — but Propositions A.1 / A.2 quantify the inevitable side effect: any off-diagonal entry removed is also penalty mass removed, so the cardinality constraint is diluted. The raw samples from a sparsified penalty-encoded QUBO are infeasible (cardinality ≠ K) even when chains hold. This is exactly the failure mode Y2 predicts a priori: any penalty-encoded approach to a hard constraint will fight the encoding's own constraint enforcement against the device's connectivity budget.

The §4.4 ablation is the key honesty move. On betting with settlement-graph priors, the sparsify-and-project pipeline produces zero-regret feasible portfolios. But when the author substitutes random projection for QPU sampling, the random-projection result degrades with N while QPU+projection stays competitive — and on block-diagonal favourable cases, the zero-regret outcome is explained by the projector's backward-elimination behavior alone, not by the QPU. The author writes (§4.4): "Sparsification plus projection is a working pipeline, but the QPU contribution is unclear and the result is dominated by classical post-processing." This is the cleanest published statement of the "where is the quantum actually contributing?" question on this problem class.

Section 5.10 (Penalty-Free Pipeline) is the constructive contribution. The proposal: drop the penalty entirely. Build Q_obj = −diag(μ) + λΣ (objective only). Sample on QPU. Enforce 1ᵀx = K classically via greedy backward elimination. Live hardware results (Tables 5–6, Figures 6–8): chain-break fractions drop from 71–92% (penalty) to ≤ 0.04% (penalty-free) at N up to 49 on Pegasus and Zephyr. For betting at N ∈ {39, 48}, post-processed QPU portfolios are lower-energy than the greedy reference (an energy comparison, not a proof of optimality). For equities, post-processed regret is ≤ 0.03% across all scales including full FF49.

The author is careful not to claim quantum advantage: §1.4 explicitly says "we do not claim quantum advantage; we claim that the penalty encoding, not the hardware, is the binding constraint." This is the right level of claim, and it is structurally identical to Y2's framing of quasi-binary encoding as a constraint-aware design rather than a speed-up.

For Yuan, the cross-paradigm parallel is striking. Y2 attacks the same problem on gate-based QAOA via a hard-constraint mixer; this paper attacks it on annealer hardware via classical post-processing of the projector. Both work because they avoid representing the hard constraint in the Hamiltonian itself. The natural Y2 / Y3 follow-up is a unified comparison: quasi-binary + hard-mixer QAOA vs. objective-only-QUBO + classical projector annealing, on the same DGMVP instances. The paper does not run this comparison, but provides all the apparatus needed.

Figures

Figure 1
Figure 1. Three-stage direct-QPU pipeline for penalty-encoded portfolio QUBOs: sparsification improves embeddability, but induces constraint dilution, so feasibility-aware post-processing is required to recover exact-K portfolios.
Figure 2
Figure 2. Raw QPU sample cardinality collapse under penalty encoding. Left: target cardinality K relative to the all-ones (N) and random (N/2) references. Right: pre-projection feasibility rate collapses toward zero with increasing N, confirming that the QPU does not enforce the cardinality constraint at these chain-break rates.
Figure 3
Figure 3. Mean chain length versus problem size for dense and best-sparse (top-k, k = 1 for equities; settlement graph for betting) QUBOs on both topologies. Dense chain lengths grow with N; the plotted sparse variants maintain unit chains at all scales. Other sparsifiers achieve near-unit chains (see Table~).
Figure 4
Figure 4. QPU vs.\ projector ablation. Random projection (mean) degrades with N, while all-ones projection and greedy construction achieve near-zero regret, indicating that the classical projector contributes significantly to the pipeline's output quality on penalty-encoded dense QUBOs.
Figure 5
Figure 5. Logical edge counts after sparsification for equity and betting instances. The dense baseline (\binom{N}{2} edges) is not shown; all four sparsifiers reduce the edge count substantially, with domain-prior methods achieving the largest reduction while preserving the domain interaction structure.
Figure 6
Figure 6. Objective regret vs.\ qubit overhead ratio. Lower-left is better. Domain-prior methods achieve low regret at low overhead, while threshold and top-k trade more aggressively.
Figure 7
Figure 7. Physical qubit count and mean chain length on Pegasus vs.\ Zephyr for the same logical graphs. Zephyr consistently achieves lower overhead.
Figure 8
Figure 8. Qubit overhead ratio and mean chain length as a function of logical graph density. All instances up to N = 49 embed successfully on both topologies, but denser graphs require more physical qubits and longer chains. Zephyr achieves lower overhead than Pegasus throughout.

Citations to Yuan's papers

No direct citation to any of Y1–Y6 found in bibliography.

Overlap with Y1–Y6

Recommended action for Yuan