Design proposal — analytic SCF-energy gradient w.r.t. basis parameters¶

Status: Phase 0 complete and verified. Maintainer selected the phased approach (§4): Phase 0 = integral-level FD + analytic assembly. The build-independent assembly (vibeqc.basis_optimization.energy_gradient) has landed and is verified against a self-contained mock RHF (tests/basisset_dev/test_energy_gradient_assembly.py). The real VibeqcIntegralProvider path was verified on a build — mars, 2026-06-17 (planetx was administratively down), via scripts/basisset_dev/verify_energy_gradient.py: on closed-shell H₂ / pob-TZVP / RHF (a multi-centre case), the frozen-P/W assembly matched the full re-SCF total-energy finite difference to |Δ| ≤ 1.5×10⁻⁷ Ha (diffuse-s exponent 1.4×10⁻¹⁰; contracted-s coefficient 1.5×10⁻⁷). Phase 1 (analytic shell-incrementation) is now fully specced and grounded (§4) — confirmed C++/libint with the gradient.cpp pattern — but is build-gated: it cannot be developed via the vq path that served Phase 0 (§4 “Build constraint”), so it awaits a build environment.

1. Why¶

optimize_bdiis currently finite-differences the SCF energy term of the objective Ω = E + penalty. FD costs 2·n_params SCF solves per gradient and is noisy. The penalty term already has an analytic gradient (gradients.py, commit 50d73b99). The remaining piece is the analytic energy gradient dE/dη w.r.t. each free basis parameter η (a Gaussian exponent α or a contraction coefficient c). This is what makes BDIIS cheap on real periodic SCF and is the headline accuracy claim for the basis-optimisation paper.

2. The math — it is the nuclear gradient with ∂/∂R → ∂/∂η¶

For RHF (closed shell), with density P = 2·Σ_i^occ CᵢCᵢᵀ and energy-weighted density W = 2·Σ_i^occ εᵢ CᵢCᵢᵀ:

dE/dη = Σ_μν P_μν (∂T_μν/∂η + ∂V_μν/∂η)
      + ½ Σ_μνλσ P_μν P_λσ ∂(μν‖λσ)/∂η
      − Σ_μν W_μν ∂S_μν/∂η
      + ∂E_xc/∂η            [KS only]

where (μν‖λσ) = (μν|λσ) − ½(μσ|λν) for RHF (hybrid-scaled for KS), and the nuclear repulsion E_nn does not depend on basis parameters. The −Σ W ∂S/∂η Pulay term captures the wavefunction response exactly (variational HF/KS ⇒ no CPHF needed), the same reason it works for nuclear gradients.

This is structurally identical to the existing nuclear gradient. vibe-qc already assembles W and the Pulay term in cpp/src/gradient.cpp (W = 2 Σ ε_i C_μi C_νi, line ~607) and contracts P/W against libint deriv_order=1 integral derivatives (gradient.hpp). The periodic analogue is periodic_gradient.cpp (lattice-sum Pulay). We reuse all of that. Only the integral-derivative source changes: ∂/∂R → ∂/∂η.

3. The only new piece — integral derivatives w.r.t. η¶

libint provides geometric derivatives ∂/∂R (deriv_order=1) natively; it does not provide exponent derivatives. Two sub-cases:

Contraction coefficients (easy). A contracted integral is linear in its contraction coefficients: I = Σ_p c_p·(primitive_p term). So ∂I/∂c_p is just the integral re-contracted with a unit coefficient on primitive p and zero elsewhere (carrying the per-primitive normalisation). No new integral type — reuse the existing libint path with a modified contraction. This already matches the closed-form coefficient gradient validated for the penalty in gradients.py.
Exponents (the real work). ∂/∂α of a normalised primitive splits into a normalisation-derivative term plus −r²·(primitive). Since r²·g_l raises angular momentum, ∂I/∂α is a linear combination of integrals over the same shells with the varied shell promoted l → l+2 (plus l and lower terms from normalisation). libint computes arbitrary-L integrals, so the l+2 integrals are available; the exponent derivative is assembled in our layer from them. This is the standard shell-incrementation route used by derivative-Gaussian theory.

4. Decision point — integral-derivative strategy (needs maintainer input)¶

Option	What	Cost / accuracy	Effort
A. Integral-level central FD	perturb α, recompute integrals via libint, FD	~2 integral builds/param; FD-grade	low
B. Analytic shell-incrementation	assemble ∂I/∂α from libint l+2 integrals + normalisation term	exact; reusable	high (C++)
C. Coefficient derivatives	unit-recontraction (both A and B need this; trivial)	exact	trivial

Recommendation: ship in two phases.

Phase 0 — Option A (integral-level FD) + the analytic assembly. Gets a correct, validated W-Pulay assembly pipeline end-to-end fast, reusing gradient.cpp’s P/W contraction. Even at FD-grade integral derivatives this validates everything except the exponent-derivative integrals themselves, and is the reference for Phase 1.
Phase 1 — Option B (analytic shell-incrementation) for the exponent integrals → the production gradient. Coefficient derivatives (C) are exact in both phases.

Resolved (investigation 2026-06-17 against the C++ tree):

A-then-B — A (Phase 0) is done and verified; B is Phase 1.
Where: C++, confirmed required. The exponent derivative needs per-shell / per-primitive control over libint2::Shell objects (build l+2-promoted auxiliary shells, combine with the −r² + normalisation terms). vibe-qc drives libint2 directly — basis.libint() returns the libint2::BasisSet, gradient.cpp builds libint2::Engine(Operator, max_nprim, max_l, deriv_order) and loops shell pairs, and ao_eval.cpp reads shell.alpha / shell.contr. The Python bindings expose only contracted, whole-molecule integrals (compute_overlap(basis) etc.) — no per-shell hook — so a Python l+2 route is not available. Phase 1 lands in a new cpp/src/basis_param_gradient.cpp reusing the gradient.cpp engine+shell-loop pattern, exposed via a binding, behind an AnalyticIntegralProvider swappable for VibeqcIntegralProvider.
First cut: HF molecular, deferring KS (∂E_xc/∂η) and periodic (reuse periodic_gradient’s lattice-sum Pulay).
libint capability: yes. The engine is constructed with max_l, and shells are built/iterated as libint2::Shell (see gradient.cpp, ao_eval.cpp, schwarz.hpp shift_shells_to_cell), so arbitrary-L / ad-hoc shells are available in C++.

Build constraint (the gate on Phase 1)¶

Phase 1 is integral-engine C++ that must compile + pass tests before it lands on main (CLAUDE.md § 14: the build-test CI job is the build-break gate). Verifying C++ on planetx/mars via vq does not help here: vq runs the already-deployed build, and --refresh rebuilds from main — so the C++ would have to be pushed to main before it could be compiled there, which is exactly the pattern § 14 forbids (an un-compiling push reds the build gate for everyone). Phase 0 was developable build-free because it is pure Python; Phase 1 is not.

Therefore Phase 1 needs a build environment for develop+validate (a local toolchain, or the maintainer building on a dev host with CI gating), not the vq payload path that served Phase 0. The design above is complete and grounded; the implementation is the build-gated next step. The Phase-0 integral-FD gradient is the ready-made numerical reference to validate it against.

5. Integration with what exists¶

The result plugs straight into the optimiser as a grad= callable. The composed gradient is energy_grad(x) + penalty_grad(x), where:

penalty_grad = vibeqc.basis_optimization.penalty_gradient(...) (already landed, analytic).
energy_grad = new analytic energy gradient (this proposal), mapped to optimiser space by the same log/linear chain rule the penalty gradient uses, and to the per-(shell, primitive, field) free-spec layout of BasisParametrisation.

The reusable kernel from gradients.py — the analytic same-center ∂S/∂α — is exactly the atomic block of the molecular ∂S/∂η Pulay term, so it doubles as an independent unit check on the new multi-centre overlap derivative.

6. Validation plan¶

Integral level: chosen ∂I/∂η vs central FD of libint integrals; the gradients.py same-center analytic ∂S/∂α as an independent check on the atomic block.
Energy level: dE/dη vs central FD of the SCF energy (the current BDIIS FD path) to ≤ 1e-6 Ha per unit η, on single-atom HF then a small molecule.
Parity: optimise a known case and compare to CRYSTAL OPTBASIS (the vibe-basis parity runs drive CRYSTAL out-of-process for the reference, per CLAUDE.md § 10).

7. Phasing summary¶

P0  HF, integral-level FD exponent derivs + analytic W-Pulay assembly  (validates pipeline)
P1  HF, analytic shell-incrementation exponent derivs                  (production gradient)
P2  KS (add ∂E_xc/∂η)  →  periodic (lattice-sum Pulay)                  (coverage)

Each phase is independently landable and FD-validated before the next. All phases need a build; implementation waits on the §4 decisions.