In the previous chapter we met the Josephson junction and used it to build the charge qubit, the Cooper-pair box, a tiny superconducting island whose state is set by how many Cooper pairs have tunnelled across the junction. It made a beautiful two-level system, but it had a fatal flaw, it was painfully sensitive to stray electric charge. A single fluctuating charge near the island would smear out the qubit frequency and kill coherence in microseconds. The transmon is the fix, and it turned out to be such a good fix that it became the workhorse of nearly every superconducting quantum processor built today.
The whole trick is one design choice: make the Josephson energy
flowchart TD
A["Big shunt capacitor → small E_C → large E_J/E_C (~50+)"] --> B["Phase φ localized in a deep cosine well"]
B --> C["Charge n delocalized over many charge states"]
C --> D["Charge dispersion ~ exp(−√(8 E_J/E_C)) : exponentially suppressed"]
D --> E["Long T₂, offset charge n_g ignorable"]
A --> F["Potential nearly harmonic (Duffing oscillator)"]
F --> G["Weak anharmonicity α ≈ −E_C"]
G --> H["Gate-speed limit / leakage to |2⟩ → DRAG"]
A --> I["ω_q = √(8 E_J E_C) − E_C : a few GHz"]
The single most important point: the transmon is not a new circuit. It is the same Josephson-junction-plus-capacitor as the Cooper-pair box, with the same Hamiltonian. We only change one number, the ratio
$E_J/E_C$ , by adding a large shunt capacitor that lowers$E_C$ .
Here
Where does the factor of
The convention
Charge qubits live around
Think of the
The rigorous statement is a Bloch-band argument:
The decisive feature is the factor
Intuition aside. It's like tuning a heavy bell versus a light one. A featherweight bell rings at a pitch that changes with every breeze (every stray charge). A massive cathedral bell rings at essentially the same note no matter what the air does. We traded a nimble, twitchy qubit for a heavy, steady one, and steadiness is what coherence needs.
Misconception check.
$n_g$ does not disappear from the Hamiltonian. Its effect on the frequency is exponentially suppressed. The qubit still has an$n_g$ in it; you just can't measure the resulting frequency wobble.
Deep in the well the potential is nearly parabolic, so the transmon is a weakly anharmonic (Duffing) oscillator: a harmonic ladder with a small quartic softening. Let's derive its whole spectrum and read everything off one line.
Step 1: drop
Step 2: expand the cosine. Near the bottom of the well,
Step 3: identify the harmonic oscillator. The piece
Step 4: introduce ladder operators. Writing
Step 5: treat the quartic term as a perturbation. Here is the magic cancellation:
Step 6: collect. Putting it together (Koch et al. 2007):
The three terms are exactly the three pieces of physics:
Now read off the observables:
-
Qubit frequency.
$\hbar\omega_{01} = E_1-E_0 = \sqrt{8E_JE_C},[(1+\tfrac12)-(0+\tfrac12)] - \tfrac{E_C}{12}[(6{+}6{+}3)-(0{+}0{+}3)]$ $= \sqrt{8E_JE_C} - \tfrac{E_C}{12}(12) = \sqrt{8E_JE_C} - E_C.$ -
Absolute anharmonicity.
$E_{12}=E_2-E_1=\sqrt{8E_JE_C}-2E_C$ , and$E_{01}=\sqrt{8E_JE_C}-E_C$ , so$$\alpha \equiv E_{12}-E_{01} \simeq -E_C.$$ -
Relative anharmonicity.
$\displaystyle \alpha_r \equiv \frac{\alpha}{\omega_{01}} \simeq \frac{-E_C}{\sqrt{8E_JE_C}} = -\sqrt{\frac{E_C}{8E_J}} = -\left(\frac{8E_J}{E_C}\right)^{-1/2}.$
Here is the energy ladder, with the crucial detail that
energy
│
│ ┌────────────── |3⟩
│ │ spacing ≈ √(8E_J E_C) − 3E_C (smallest)
│ ├────────────── |2⟩
│ │ E_12 = √(8E_J E_C) − 2E_C
│ ├────────────── |1⟩ ── α = E_12 − E_01 = −E_C
│ │ E_01 = √(8E_J E_C) − E_C
│ └────────────── |0⟩
│
│ \ /
│ \ dashed parabola / ← harmonic / Duffing approx
│ \____________________ _____/
│ true −E_J cos φ well (depth −E_J)
└────────────────────────────────────────► φ
The central trade-off. Distinguish the two anharmonicities carefully:
-
Absolute
$\alpha\approx -E_C$ is roughly constant, it sets a fixed, ns-scale gate-speed limit. -
Relative
$\alpha_r = -(8E_J/E_C)^{-1/2}$ is the quantity that slowly shrinks as we push the ratio up.
And
| charge dispersion |
|||
|---|---|---|---|
| 1 | 2.8 | ||
| 10 | 8.9 | ||
| 20 | 12.6 | ||
| 50 | 20.0 | ||
| 100 | 28.3 |
All values illustrative. As
Misconception check. Bigger
$E_J/E_C$ is not always better. By$E_J/E_C\sim 50$ -$100$ the charge dispersion is already utterly negligible; pushing higher buys essentially no extra charge protection but keeps eroding anharmonicity and slowing your gates. There is an optimum window, not a monotonic "more is better."
Pick generic teaching values:
1 · Qubit frequency.
2 · Anharmonicity. Absolute:
3 · Gate-speed intuition. A square pulse of duration
4 · Charge dispersion. Exponent
The quantitative heart of the chapter. At
A fixed junction gives a fixed
Two junctions in parallel give
with
Why asymmetry on purpose? Real junctions are never perfectly matched, so
Sweet spots. Since
At a sweet spot, expanding gives
ω_q Φ₀/2 Φ₀/2
│ ●●●●● (symmetric (symmetric
│ ●● ●● dips to 0) dips to 0)
│ ● ● ●●● ●●●
│ ● SWEET ● ● ● ● ● ← asymmetric d>0:
│ ● SPOT ● ● ● ● ● min never reaches 0,
│ ● dω/dΦ=0, ● ● ● ● ● flatter / less tuning
│● best T₂ ●● ●● ●●●●
│ steep flanks = ●●●●●●
│ high flux-noise sensitivity
└────┼────────┼────────┼────────┼────────► Φ/Φ₀
−1.0 −0.5 0.0 0.5 1.0
Misconception check. A sweet spot removes only the first-order flux sensitivity (slope
$=0$ ); the curvature still couples second-order flux noise. That is why tunable transmons generally trail fixed-frequency ones in$T_2$ , and why a symmetric SQUID tunes$E_{J,\mathrm{eff}}$ to zero at$\Phi_0/2$ but an asymmetric one never does.
-
"It's a different circuit from the CPB." No, same circuit, same Hamiltonian; only
$E_J/E_C$ changes. -
"Maximize
$E_J/E_C$ ." No, there's an optimum window (~50-100); beyond it you only lose anharmonicity and gate speed. - "$\alpha<0$ is a defect." No, it just means upper levels are more closely spaced; the sign matters for choosing DRAG parameters.
- "$n_g$ vanishes." No, only its effect on the frequency is exponentially suppressed.
-
"These formulas are exact." No,
$\omega_q=\sqrt{8E_JE_C}-E_C$ and$\alpha=-E_C$ are leading-order teaching approximations; for design you diagonalize the full Hamiltonian (Mathieu functions / numerics).
- The transmon is a Cooper-pair box run at large
$E_J/E_C$ ($\gtrsim50$ ), achieved with a big shunt capacitor, same circuit, one ratio changed. - One perturbative line,
$E_m\simeq -E_J+\sqrt{8E_CE_J}(m+\tfrac12)-\tfrac{E_C}{12}(6m^2+6m+3)$ , yields$\omega_q$ ,$\alpha$ , and the level crowding. - Charge dispersion is suppressed exponentially as
$e^{-\sqrt{8E_J/E_C}}$ ; anharmonicity falls only as a power law$\alpha_r=-(8E_J/E_C)^{-1/2}$ , exponential beats power law, opening a wide usable window. -
Absolute
$\alpha\approx-E_C$ (fixed, sets the ns gate-speed limit) differs from relative$\alpha_r$ (slowly shrinks). Finite$\alpha$ ⇒ leakage to$|2\rangle$ ⇒ DRAG. - A SQUID makes
$E_{J,\mathrm{eff}}(\Phi)$ flux-tunable; junction asymmetry$d$ trades tuning range for flux-noise robustness; operate at a sweet spot where$\partial\omega_q/\partial\Phi=0$ for best$T_2$ .
- J. Koch et al., "Charge-insensitive qubit design derived from the Cooper pair box," Phys. Rev. A 76, 042319 (2007), the original transmon paper and source of every formula here, arXiv:cond-mat/0703002.
- P. Krantz et al., "A Quantum Engineer's Guide to Superconducting Qubits," Appl. Phys. Rev. 6, 021318 (2019), readable engineering review; best for the Duffing picture, flux tunability, and asymmetric SQUIDs, arXiv:1904.06560.
- A. Blais, A. L. Grimsmo, S. M. Girvin, A. Wallraff, "Circuit Quantum Electrodynamics," Rev. Mod. Phys. 93, 025005 (2021), authoritative modern review with rigorous derivations of the charge/flux Hamiltonians and the harmonic-plus-quartic expansion, arXiv:2005.12667.