Statistics of electron-multiplying charge-coupled devices

Brian M. Sutin

doi:10.1117/1.JATIS.9.2.028001

6 April 2023 Statistics of electron-multiplying charge-coupled devices

Brian M. Sutin

Journal of Astronomical Telescopes, Instruments, and Systems, Vol. 9, Issue 2, 028001 (April 2023). https://doi.org/10.1117/1.JATIS.9.2.028001

Abstract

Electron-multiplying charge-coupled devices are efficient imaging devices for low-surface-brightness ultraviolet astronomy from space. The large amplification allows photon counting (PC), the detection of events versus nonevents. This paper provides the statistics of the observation process, the photon-counting process, the amplification process, and the compression. The expression for the signal-to-noise of PC is written in terms of the polygamma function. The optimal exposure time is a function of the clock-induced charge. The exact distribution of amplification process is a simple-to-compute powered matrix. The optimal cutoff for comparing to the read noise is close to a strong function of the read noise and a weak function of the electron-multiplying gain and photon rate. A formula gives the expected compression rate.

1. Science Motivation

The Earth’s atmosphere is essentially opaque to ultraviolet (UV) radiation at wavelengths shorter than about 300 nm. All sorts of interesting astrophysical phenomena exist to be imaged in the UV, such as emission lines from interstellar gas, redshifted Ly- $α$ from galaxies, and zodiacal light caused by Rayleigh scatter of sunlight. Most of the UV sources mentioned above have very low-UV surface brightness, so observation requires an imaging telescope above the Earth’s atmosphere that integrates for long periods, coupled with an imaging detector with very high efficiency. Typical photon rates are on the order of a photon per $1000 pixels$ per second. As space-based missions are limited in the returned data, compression of data consisting of nondetections is relevant.

Electron-multiplying charge-coupled devices (EMCCDs) are the enabling technology for space-based UV imaging, especially with enhanced UV quantum efficiency. The Teledyne e2v CCD201-20 EMCCD has flight heritage from the Faint Intergalactic Redshifted Emission Balloon.¹^–³ Roman coronagraphic instrument has extensively tested and qualified the CCD201-20.⁴ Other space applications using the CCD201-20 are SPARCS,⁵^–⁷ currently in production, and the Polarized Zodiacal Light Experiment concept.⁸

2. What is an EMCCD?

The EMCCD is a CCD modified to achieve high signal-to-noise ratio (SNR) by rendering the read noise effectively zero. Compared to conventional CCDs, EMCCDs have an additional serial register (604 extra charge-coupled “pixels” [see Fig. 1]), where one of the register clocks is replaced by a high-voltage clock (25 to 50 V). The higher voltage causes a multiplication process that stochastically turns one electron into many, resulting in thousands of electrons at the output amplifier. This allows detection of single-photon events by thresholding above amplifier read noise.¹⁰

Fig. 1

Device architecture diagram for the e2v CCD201-20 EMCCD.⁹ The device has two separate output amplifiers, one of which is amplified with the 604 multiplication elements.

In the amplification process, if the charge transfer from one register to the next is done with enough input energy, a signal electron will knock lose another electron. This is the avalanche photodiode (APD) effect. Repeating this process makes an “APD staircase,”¹¹ magnifying the signal by having some extra CCD-like charge transfers explicitly run using overdrive. Since the digitization takes place after magnification, the read noise is proportionally smaller when compared to the original, unamplified signal.

EMCCDs are generally run in “photon counting” (PC) mode, meaning that the detector is read out often enough that the expected signal counts in each pixel are $≪ 1$ . After magnification, the final counts are generally either very large or very small, signifying detecting or not detecting signal photons. This is similar to a photomultiplier tube (PMT). The downside, as compared to a PMT or an intensified CCD, is that the dark current and clock-induced charge (CIC) may also add significant noise. The EMCCD data output is then a combination of science signal, dark current, CIC, and readout noise. Dark current decreases with temperature, but sufficient cooling is not always available to make the dark current negligible. CIC is fixed per exposure and is discussed in Sec. 6. Without PC, sufficient EM amplification will make the read noise comparatively small, but the discussion in Sec. 11 shows that in PC mode, the read noise is eliminated with the proper choice of parameters.

3. Probability Notation

The rest of this paper mainly consists of probability theory related to processing and understanding the signal from these detectors. As the intended target of this paper is engineers and scientists rather than mathematicians, neither mathematical rigor nor mathematically rigorous notation are prioritized. The notation used is as follows:

$X, X_{Y}$ : a random variable; i.e., a variable that can take on randomly generated values
$P (X)$ : a probability distribution
$E (X)$ : the expected value of $X$ , also known as the first moment or mean
$V (X)$ : the variance of $X$ , equal to the second moment of $X - E (X)$

The various symbols used are, in order of definition:

$λ$ : the number of counts expected in a pixel’s well for a given exposure [Eq. (1)]
$Q (n, λ)$ : regularized gamma function [Eq. (3)]
$ε$ : probability of one or more signal counts in a pixel’s well [Eq. (6)]
$N$ : number of exposures in an observation for a single pointing [Eq. (7)]
${PC}_{N}$ : number of exposures out of $N$ that have one or more counts [Eq. (7)]
$c$ : an observed value of ${PC}_{N}$ [Eq. (7)]
$B (α, β)$ : Beta function with parameters $α$ and $β$ [Eq. (9)]
$M_{n} (λ)$ : $n$ ’th moment of the probability distribution of $λ$ [Eq. (11)]
$ψ, ψ_{1}$ : digamma and trigamma functions [Eqs. (13) and (14)]
$T$ : total exposure time in seconds for series of exposures [Eq. (15)]
$λ_{CIC}$ : CIC count rate in counts per exposure [Eq. (15)]
$η$ : photon count rate from the signal source in counts per second [Eq. (15)]
$\bar{ε}$ : expected value of $ε$ [Eq. (17)]
$λ_{opt}$ : the counts per exposure that maximize SNR [Eq. (19)]
$\hat{μ}$ : read out noise when not using PC [Eq. (21)]
$P_{m} (n)$ : the probability of $n$ counts after $m$ stages of amplification [Eq. (25)]
$B_{n k}$ : a matrix containing the probability of $k$ counts multiplying to $n$ counts [Eq. (27)]
$G$ : amplification gain of a set of multiplication elements [Eq. (30)]
$μ$ : read out noise when using PC [Eq. (33)]
$τ$ : cutoff criterion for amplified counts to assuming a detection [Eq. (36)]
$\hat{τ}$ : optimal value for photon-counting cutoff $τ$ [Eq. (41)]

4. Poisson Distributions

For independent events arriving at a detector, the Poisson distribution is appropriate to use. For a Poisson-distributed random variable $X$ , $E (X) = V (X) = λ$ , and $X$ is distributed as

Eq. (1)

P (X = k) = \frac{λ^{k} e^{- λ}}{k!} .

Similar to the Gaussian assumption, the SNR goes as the square root of the counts as a result of $E (X) = V (X)$ . For no events,

Eq. (2)

P (X = 0) = e^{- λ}, P (X > 0) = 1 - e^{- λ} .

Useful is the cumulative distribution function, which is

Eq. (3)

F_{X} (n) = P (X \leq n) = \sum_{k = 0}^{n} \frac{λ^{k} e^{- λ}}{k!} = Q (n + 1, λ) .

Here $Q (n, λ)$ is the regularized gamma function; e.g., evaluated in Excel as

Eq. (4)

Q (n, λ) = 1 - GAMMADIST (λ, n, 1, true)

5. Photon Counting

Consider an ideal photon-counting detector with infinite gain. If any photons at all are detected, then the detector returns 1, otherwise 0. Let the random variable $X_{W}$ be the number of electrons in the well of a pixel. Using Eq. (2), the expected value (and second and all higher moments) of this process is then

Eq. (5)

E (Ideal PC) = \sum_{w = 1}^{\infty} (1) P (X_{W} = w) = P (X_{W} > 0) = 1 - e^{- λ} .

In practice, a single exposure is not useful. Rather, $λ$ is estimated by taking a sequence of $N$ exposures, each in PC mode. Define the $ε$ as the probability of detection

Eq. (6)

ε = 1 - e^{- λ} .

The number of exposures with a signal detection are then given by the binomial distribution

Eq. (7)

P ({PC}_{N} = c | ε) = (\begin{matrix} N \\ c \end{matrix}) ε^{c} {(1 - ε)}^{N - c} .

Here $c$ is the total number of counts for the $N$ exposures. If $ε$ is ½, then this is the distribution of the number of heads after $N$ flips of a fair coin. What we want to know is $ε$ , which is on the wrong side of the equation. Using Bayes’ theorem,

Eq. (8)

P (ε {| PC}_{N} = c) = \frac{P ({PC}_{N} = c | ε) P (ε)}{P ({PC}_{N} = c)} .

We need a prior for $P (ε)$ and select the most commonly used prior for the binomial distribution, the beta distribution, given by

Eq. (9)

P (ε) = \frac{ε^{α - 1} {(1 - ε)}^{β - 1}}{B (α, β)}

The denominator is the beta function $B (α, β) = Γ (α) Γ (β) / Γ (α + β)$ , and the distribution is valid over ${α > 0, β > 0}$ . A common choice is the uniform prior ${α = 1, β = 1}$ . For a reader unfamiliar with Bayesian statistics, the prior can be used to bias the result using prior knowledge (thus the name). However, the actual choice of prior makes little difference with sufficient data. More useful is to reverse this last statement; if the result of an observation depends significantly on the choice of prior, then the data quantity is insufficiently robust. Using the above prior,

Eq. (10)

P (ε {| PC}_{N} = c) = \frac{ε^{c + α - 1} (1 - ε)^{N - c + β - 1}}{B (c + α, N - c + β)} .

We did not need to explicitly calculate the Bayesian denominator $P ({PC}_{N} = c)$ . The denominator is determined by noticing that the binomial distribution combined with a beta distribution is another beta distribution, so the new denominator is the normalization factor from the new beta distribution.

The expression for the moments of $λ$ is found using the probability distribution for $ε$ to find the powers of $λ$ written as a function of $ε$

Eq. (11)

M_{n} (λ) = \int_{ε = 0}^{1} {(- \ln (1 - ε))}^{n} \frac{ε^{c + α - 1} (1 - ε)^{N - c + β - 1}}{B (c + α, N - c + β)} d ε

A change of variable gives, to within a sign, the formula for the geometric moments of the beta distribution

Eq. (12)

M_{n} (λ) = \int_{x = 0}^{1} (- \ln x)^{n} \frac{x^{N - c + β - 1} (1 - x)^{c + α - 1}}{B (N - c + β, c + α)} d x .

So, the mean and variance of $λ$ are (within a sign) the geometric mean and geometric variance of a beta distribution, respectively. The expected value can be written in terms of the digamma function $ψ$

Eq. (13)

E (λ) = - M_{1} (λ) = ψ (N + α + β) - ψ (N - c + β),

and the variance in terms of the trigamma function

ψ_{1}

Eq. (14)

V (λ) = M_{2} (λ) - M_{2}^{2} (λ) = ψ_{1} (N - c + β) - ψ_{1} (N + α + β) .

The polygama function $ψ_{n}$ is a transcendental function available in language math libraries or evaluated with a series approximation.

6. Optimal Exposure Time

$λ$ , the expected number of counts in an exposure, depends on the exposure time, while the relevant quantities for an observation are the rate of photons per unit time and the total exposure time available. The obvious choice is to have a very large number of exposures and make the exposure time arbitrarily small. However, in practice, an EMCCD has a noise source called CIC that is added to every exposure,³^,¹² where an extra charge is added during clocking of the charge-coupled pixels. CIC has been measured as low as $7 \times 10^{- 4} e^{-} / pixel / readout$ ,¹³ while achieving is $5 \times 10^{- 3} e^{-} / pixel / readout$ is relatively easy.

CIC is not constant across the detector. For a $1 k \times 1 k$ -pixel frame transfer device, the number of pixel-to-pixel transfers can vary from $1 k$ to $3 k$ (frame transfer plus row number plus column number) before arriving at the readout and amplification chain (see Fig. 1).

For this section, we define the following variables:

• $T$ - the total exposure time in seconds for a series of exposures
• $λ_{C I C}$ - the CIC count rate in counts per exposure
• $η$ - the photon count rate from the science signal source in counts per second

The variable $η$ is the fundamental astronomical parameter being measured, but includes dark current. Our expected counts per exposure $λ$ is then

Eq. (15)

λ = λ_{CIC} + \frac{η T}{N} .

The expected photon-counting event detection rate is

Eq. (16)

\bar{ε} = 1 - e^{- λ_{CIC} - η T / N} .

To avoid the difficult polygama function, we will find the optimal number of exposures $N$ by maximizing the SNR with respect to $ε$ rather than $λ$ . This should be acceptable, as CIC is only critical for small $λ$ . Choose the number of exposures $N$ to be $N ≫ α$ and $N ≫ β$ (or choose $α = β = 1$ ). Since $ε$ is beta distributed, we know the mean and variance, and thus the SNR

Eq. (17)

E (\bar{ε}) = c / N, V (\bar{ε}) = \frac{(c / N) (1 - c / N)}{N}, SNR (\bar{ε}) = {(\frac{c}{1 - c / N})}^{1 / 2} .

The SNR evaluated at the expected $\bar{ε}$ is then

Eq. (18)

SNR (\bar{ε}) = N^{1 / 2} {(e^{λ_{CIC} + η T / N} - 1)}^{1 / 2} .

Setting the derivative with respect to $N$ to zero gives the transcendental equation

Eq. (19)

(1 + λ_{CIC} - λ_{opt}) e^{λ_{opt}} = 1 .

Over the typical region of $10^{- 3} < λ_{CIC} < 5 \times 10^{- 3}$ , $λ_{opt}$ is well approximated by the fit

Eq. (20)

λ_{opt} \approx \frac{3}{2} λ_{CIC}^{1 / 2} .

So, for $CIC \sim 0.001$ , the optimal number of well counts per exposure is $λ_{opt} \sim 0.045$ counts for best SNR. Given that CIC for current state-of-the-art, EMCCD controllers is in the range of 0.001 to 0.005, $λ$ for optimal SNR is restricted to (0.045 to 0.10). Since CIC varies across the detector, $λ_{opt}$ can never be more than a compromise. Other considerations might lead to using shorter exposure times than for optimal SNR. For example, bright stars in the field can cause “blooming,” where the wells of the amplification staircase elements overfill and spill charge into adjacent elements. The exposure time may also be limited by the spacecraft stability for applications where spatial resolution is important.

7. Comparison to Standard Mode

For bright sources, the detector frame rate may not be high enough to achieve the optimal count rate $λ$ per exposure. In this case, observing with standard mode (SM) might provide better SNR. The SNR in SM is

Eq. (21)

SNR (SM) = \frac{η T}{{(η T + \hat{μ})}^{1 / 2}} .

Here $\hat{μ}$ is the readout noise for standard mode, which may be different than the readout noise $μ$ for PC mode if the EMCCD device has separate readout amplifiers (see Fig. 1). Given the total number of expected events $η T$ for an observation and a known $λ_{CIC}$ , the number of PC-mode exposures is

Eq. (22)

N = η T / (λ - λ_{CIC}) .

The expected number of PC counts is

Eq. (23)

E (c) = N - N e^{- λ} .

Assuming $N$ is large enough that the prior is irrelevant and replacing $c$ with the expected counts, the PC-mode SNR is

Eq. (24)

SNR (PC) = \frac{ψ (N) - ψ (N e^{- λ})}{{(ψ_{1} (N e^{- λ}) - ψ_{1} (N))}^{1 / 2}} .

Equations (21) and (24) can be compared to choose which mode is best. A worked example is shown in Fig. 2. In practice, the standard-mode exposure time $T$ is limited by cosmic rays, whereas for PC mode the exposure time $T / N$ is limited by readout speed.

Fig. 2

Assuming $λ_{CIC} = 0.002 e^{-}$ and $μ = {\hat{μ}}^{} = 6 e^{-}$ for this plot, PC mode always has better SNR than standard mode. The staircase effect for lower counts in PC mode is real and due to rounding up to the nearest $N$ integer number of exposures.

8. Amplification

The amplification process of EMCCD amplification is similar to the amplification process in an APD.¹¹^,¹⁴^–¹⁸ The amplification of a single electron from Matsuo 1984¹¹ (hereafter “Matsuo”) is given by a recurrence relation for each stage $m$ by

Eq. (25)

P_{m} (n) = (1 - Q) P_{m - 1} (n) + Q \sum_{k = 0}^{n} P_{m - 1} (n - k) P_{m - 1} (k), P_{0} (n) = δ_{1, n} .

$Q$ is the probability of an electron generating a secondary. The distribution of counts for each stage $P_{m} (n)$ , where $n$ is the number of counts, depends on the distribution of counts at the previous stage. What happens if the initial distribution of counts is not restricted to a single count, or $P_{0} (n) \neq δ_{1, n}$ ? Then the recurrence relation spectacularly fails. However, the most important issue with this recurrence relation is that it physically makes no sense; the relation connects the opposite tails of the distribution to generate the next stage. So, although the relation may hold true, it is more of a mathematical curiosity than anything else.

The APD staircase effect can be described as a process where, at each step, there is a chance when transferring the charge that an electron generates a second electron. Now make two assumptions. First, the electrons are not aware of each other, equivalent to the process at each step being linear in the number of electrons generated on average. Second, assume that an original electron can never generate two or more additional electrons, equivalent to assuming that the energy of an electron, once spent on generating a second electron, is gone. With these two assumptions, we can write down a recurrence relation.

Consider, at some step $m$ in the APD staircase, the distribution of electrons $P_{m} (n)$ . How was this distribution generated if the previous step had $k$ electrons? Clearly, $n - k$ electrons each generated a single electron (first assumption), while $k - (n - k) = 2 k - n$ electrons did not. Since the electrons are independent (second assumption), the applicable distribution is the binomial distribution

Eq. (26)

P_{m} (n) = \sum_{k = n / 2}^{n} (\begin{matrix} k \\ n - k \end{matrix}) {(1 - Q)}^{2 k - n} Q^{n - k} P_{m - 1} (k) .

This expression looks unwieldy; however, the right-hand side is nothing more than a fixed matrix multiplication to get from one stage of the APD to the next. The matrix is

Eq. (27)

B_{n k} = (\begin{matrix} k \\ n - k \end{matrix}) {(1 - Q)}^{2 k - n} Q^{n - k}, k \in [⌈ \frac{n}{2} ⌉, n] .

Once the matrix $B_{n k}$ is formed with a size corresponding to the largest electron counts of interest, the recurrence relation becomes

Eq. (28)

P_{m} (n) = B_{n k} P_{m - 1} (k) .

The final amplified distribution is then

Eq. (29)

P_{m} (n) = {(B_{n k})}^{m} P_{0} (k) .

Computing $P_{m} (n)$ is only a few lines of MATLAB code,

% Create the transform matrix for a single stage of a staircase APD

Q = gain^(1/stages) - 1;

for n = (0:nmax);

for k = (0:nmax);

if((k <= n) && (n-k <= k))

B(k+1,n+1) = exp(gammaln(k+1) - gammaln(n-k+1) - gammaln(2*k-n+1) + (2*k-n)*log(1-Q) + (n-k)*log(Q));

end % k

end % n

% Transform initial distribution to the final distribution

P = Po*BB^stages; % Po and P are row vectors here

This recurrence relation gives identical results to Matsuo for an initial distribution of a single electron $P_{0} (n) = δ_{1, n}$ , while also, unlike Matsuo, working for any initial distribution.

9. Comparison to Basden

Basden 2003¹⁹ (hereafter “Basden”) gives the formula for the distribution of final amplified counts $X_{A}$ as [Basden, Eq. (1)]

Eq. (30)

\tilde{P} (X_{A} = a | X_{W} = w) = \frac{a^{w - 1} e^{- a / G}}{G^{w} (w - 1)!} a \geq w, w > 0, \tilde{P} (X_{A} = 0 | X_{W} = 0) = 1 .

$G$ here is the gain of the entire APD staircase, so the gain at stage $m$ would be $G_{m} = (1 + Q)^{m}$ .

Basden’s derivation is based on assuming the first assumption above (independent electrons) and a few approximations. We write $\tilde{P}$ because the formula is not a probability distribution; the sum over all values is not equal to unity. The formula has the functional form of an Erlang distribution, but the Erlang distribution is a continuous distribution in $a$ . The Erlang distribution is the distribution of the sum of $w$ independent identically distributed (IID) exponentially distributed random variables. This is a hint that the Basden equation could be derived as a sum of IID geometrically distributed random variables. In fact, a plot of the Matsuo distribution for a single initial electron does look very similar to an exponential distribution.

The recurrence relation of Eq. (25) deviates from that of Eq. (26), mostly for larger counts (Fig. 3). Basden explicitly states that their approximation should not be used in this region.

Fig. 3

The Basden approximation is close to the binomial model for moderate counts but fits poorly for the less likely counts. For this plot, the gain is 100, with 50 stages. “Matsuo” is Eq. (25), “Basden” is Eq. (30), and “Binomial” is Eq. (26).

In the limit as the number of stages $m$ becomes large and the amplification $Q$ becomes small for fixed $G$ , a closed-form solution in terms of $G$ based on Eq. (26) should exist. At a minimum, the recurrent relation can be written in terms of the eigenvectors and eigenvalues

Eq. (31)

B_{n k}^{m} = V Λ^{m} V^{- 1} .

Since the matrix $B_{n k}$ is triangular, the diagonal elements are the eigenvalues. Taking the limit as $m$ goes to infinity for fixed $G$ gives the eigenvalues as

Eq. (32)

Λ_{i} = G^{- i} .

In the limit, the eigenvectors $V$ are independent of $G$ .

10. Combining Signal with Read Noise

In practice, we add the amplified well counts to the read noise $X_{R}$ . Read noise is also Poisson distributed, so

Eq. (33)

P (X_{R} = y) = \frac{μ^{y} e^{- μ}}{y!} .

The expected value of the read noise $μ$ is trivially measured by taking a zero-exposure-time frame or alternatively running the analog-to-digital converter without advancing the CCD charge. Since the read noise and amplified counts are independent, the distribution of the sum read out by the detector is

Eq. (34)

P (X_{A} + X_{R} = c) = \sum_{a = 0}^{c} P (X_{A} = a) P (X_{R} = c - a) .

Inserting in the expressions from Eqs. (1), (29), and (33)

Eq. (35)

P (X_{A} + X_{R} = c) = \sum_{a = 0}^{c} \frac{μ^{c - a} e^{- μ}}{(c - a)!} B_{a k}^{m} \frac{λ^{k} e^{- λ}}{k!} .

11. PC Cutoff Setting

The algorithm used for PC is that $c$ is considered a single count if higher than some threshold $τ$ . The expected value (and second and all higher moments) of this process is then

Eq. (36)

E_{τ} (PC) = \sum_{c = τ + 1}^{\infty} (1) P (X_{A} + X_{R} = c) .

Since the argument of the summation is a probability distribution over $c$ , this is equal to

Eq. (37)

E_{τ} (PC) = 1 - \sum_{c = 0}^{τ} P (X_{A} + X_{R} = c) .

Substituting in Eq. (35) and swapping the summations gives

Eq. (38)

E_{τ} (PC) = 1 - e^{- λ} \sum_{a = 1}^{τ} \sum_{m = 1}^{τ - a} \frac{μ^{m} e^{- μ}}{m!} B_{a k}^{m} \frac{λ^{k}}{k!} .

The sum over $m$ is again the regularized gamma function from Eq. (3)

Eq. (39)

E_{τ} (PC) = 1 - e^{- λ} \sum_{a = 0}^{τ} Q (τ - a + 1, μ) B_{a k}^{m} \frac{λ^{k}}{k!} .

The optimal choice for the cutoff parameter $τ$ is such that the read noise does not bias the PC process. This is equivalent to setting $τ$ such that false positives equal false negatives. With this choice, lost counts from low-amplification signal counts balances added counts from high noise. The PC method is then an “ideal” photon counter, as in Sec. 5. One can prove that setting false positives to false negatives is equivalent to the read noise having no effect, and so the expected value of $ε$ with noise is equal to $ε$ without noise, or

Eq. (40)

E_{τ} (PC) ≅ ε .

Let $\hat{τ}$ be the optimal $τ$ that makes Eq. (39) satisfied. Then

Eq. (41)

\sum_{a = 0}^{\hat{τ}} Q (\hat{τ} - a + 1, μ) B_{a k}^{m} \frac{λ^{k}}{k!} = 1 .

Over the range of typical values for $λ$ , $μ$ , and $G$ , the optimal $\hat{τ}$ is well fit to within 5% by

Eq. (42)

\hat{τ} \approx 1.72 μ^{0.88} (G / λ)^{0.03} .

$\hat{τ}$ only has a weak dependence on $λ$ and $G$ . The residuals of the fit are due to the $μ$ dependence not being sufficiently well-fit with a power law. The residuals are shown in Fig. 4.

Fig. 4

The residuals of the Eq. (42) fit to the cutoff $τ$ in Eq. (41) are within 2%. The scatter is due to variations in $λ$ over 0.04 to 0.1 and $G$ over 10 to 1000.

12. Compression of Photon-Counting Mode Data

For a typical photon-counting exposure, the expected image data are a list of binary values, the vast majority of which are zeros. Since CIC may be a significant contributor to the signal and is unstructured random noise, compression algorithms based on entropy (structure in the image) are likely not optimal. On the other hand, an algorithm that depends on most of the data being zero might be.

If the events in each pixel are independent, then the statistics are Poisson distributed with a “rate” or expected number of pixels between events, of $λ$ . For a detector where the noise is CIC-dominated and CIC is $\sim 0.001 e^{-} / pix / frame$ , the number of pixels between each photon detection is about $1 / (0.001) = 1000$ .

The simplest compression method for sparse binary data is to count zeros between events. This nominally gives a compression ratio of approximately the number of pixels divided by the number of bits needed to encode the number, or $\sim (1 / λ) {/ \log}_{2} (1 / λ)$ . A compression ratio roughly on the order of 100:1 should be possible for the above CIC alone.

This algorithm is a variation of run-length encoding (RLE), which is to count the number of bits before a change. The heritage of the RLE algorithm stretches back to at least the early 1960s.²⁰ For mostly empty data, the appropriate standard compression algorithm is the run-length limited (RLL) variation, which has been common for encoding hard disks and optical storage since the 1980s, and only counts zeros. We use $(0, b)$ RLL.

The encoding algorithm for $RLL (0, b)$ is as follows:

1. Count number of zeros until a nonzero.
2. For an $b$ -bit encoding, do not count more than $2^{b} - 2$ zeros.
3. If no nonzero encountered, then return $2^{b} - 1$ .
4. Else, return number of zeros.

For the data generated by the photon-counting process, we can compute the expected compression ratio. Equation (1) gives the probability for an event (1) and a nonevent (0) for an image pixel. Assuming independent events, the probability of $n$ zeros followed by a nonzero is

Eq. (43)

P (X_{\leq n} = 0, X_{n + 1} = 1) = e^{- n λ} (1 - e^{- λ}) .

The expected number of inputs bits consumed by a $b$ -bit encoding is then the probability of all zeros (consuming one encoding value), or some run of zeros followed by a nonzero

Eq. (44)

E (bits consumed) = 2^{b} P (X_{\leq 2^{b}} = 0) + \sum_{n = 0}^{2^{b} - 1} (n + 1) P (X_{\leq n} = 0, X_{n + 1} = 1) .

Simplifying,

Eq. (45)

E (bits consumed) = 2^{b} e^{- 2^{b} λ} + (1 - e^{- λ}) \sum_{n = 0}^{2^{b} - 1} (n + 1) e^{- n λ} .

Using the derivative of the formula for the sum of a geometric series, this simplifies to

Eq. (46)

E (bits consumed) = 1 + 2^{b} e^{- 2^{b} λ} - 2^{b} e^{- (2^{b} - 1) λ} + \frac{e^{- λ} - e^{- 2^{b} λ}}{(1 - e^{- λ})} .

The final compression ratio is then $E (bits consumed) / b$ . The boundaries in $λ$ for the ranges of optimal compression are found by looking for the $λ$ where compression $(λ, b)$ = compression $(λ, b + 1)$ . The results are shown in Table 1.

Table 1

Optimal code length for given counts per exposure λ.

Code length	λlo	λhi
2	0.2272	0.4114
3	0.1636	0.2272
4	0.0955	0.1636
5	0.0531	0.0955
6	0.0288	0.0531
7	0.0154	0.0288
8	0.0082	0.0154
9	0.0043	0.0082
10	0.0023	0.0043
11	0.0012	0.0023
12	0.0006	0.0012

13. Conclusions

The following steps specify the operating mode (PC or SM), expected SNR, optimal exposure time, and data volume. For a space-based astronomical observatory, knowing these parameters is critical to mission design. The steps to set up an EMCCD for an observation are

1. Measure the CIC $λ_{CIC}$ by amplifying short dark frames.
2. Use Eqs. (19) or (20) to find the target expected counts per exposure $λ$ .
3. Choose a target SNR from science requirements.
4. Use Eqs. (22) and (24) to find the number of exposures $N$ required.
5. Estimate the photon rate $η$ (from radiometry).
6. Check if dynamic range (e.g., bright stars, stability) requires adjusting $N$ and thus $λ$ .
7. Use Eq. (15) to find the total observation time T.
8. Use Eq. (21) to check if SM has superior SNR.
9. Adjust $T$ depending on maximum frame rate (PC) and cosmic ray tolerance (SM).
10. Choose amplification gain $G$ based on device operation considerations.
11. Measure the readout noise $μ$ by reading out unamplified short dark frames.
12. Use Eq. (41) to find the optimal cutoff $\hat{τ}$ .
13. Use Table 1 to find the compression code length and Fig. 5 for the compression ratio.

Fig. 5

An encoding with 5-bit codes has the highest compression ratio for $0.053 < λ < 0.095$ , which encompasses most of the range of $λ$ for optimal SNR from Sec. 6. The most likely lossless compression assuming optimal SNR is thus from 2:1 to 3:1. The optimal compression ratio for $λ < 0.01$ is well fit by the approximation $0.264 \times λ^{- 0.825}$ .

For a given observation of time $T$ , the number of exposures $N$ and exposure time $T / N$ must be chosen to ensure that the required SNR is met for all image pixels, taking into account variations in CIC, dark current, and signal brightness across the detector. The optimal photon-counting cutoff $\hat{τ}$ can be chosen per pixel, either in postprocessing or adaptively as exposures are taken. Using these equations, even bright (but unsaturated) stars will have correctly computed expected values.

Acknowledgments

This research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (Grant No. 80NM0018D0004). Government sponsorship acknowledged.

References

1.

A. D. Jewell et al., “Detector performance for the FIREBall-2 UV experiment,” Proc. SPIE, 9601 96010N https://doi.org/10.1117/12.2190167 PSISDG 0277-786X (2015). Google Scholar

2.

E. Hamden et al., “FIREBall-2: the faint intergalactic medium redshifted emission balloon telescope,” Astrophys. J., 898 (2), 170 https://doi.org/10.3847/1538-4357/aba1e0 ASJOAB 0004-637X (2020). Google Scholar

3.

G. Kyne et al., “Delta-doped electron-multiplying CCDs for FIREBall-2,” J. Astron. Telesc. Instrum. Syst., 6 (1), 011007 https://doi.org/10.1117/1.JATIS.6.1.011007 (2020). Google Scholar

4.

L. K. Harding et al., “Technology advancement of the CCD201-20 EMCCD for the WFIRST coronagraph instrument: sensor characterization and radiation damage,” J. Astron. Telesc. Instrum. Syst., 2 (1), 011007 https://doi.org/10.1117/1.JATIS.2.1.011007 (2015). Google Scholar

5.

A. D. Jewell et al., “Ultraviolet detectors for astrophysics missions: a case study with the star-planet activity research cubesat (SPARC),” Proc. SPIE, 10709 107090C https://doi.org/10.1117/12.2312972 PSISDG 0277-786X (2018). Google Scholar

6.

D. R. Ardila et al., “The Star-Planet Activity Research CubeSat (SPARCS): a mission to understand the impact of stars in exoplanets,” (2018). Google Scholar

7.

P. A. Scowen et al., “Monitoring the high-energy radiation environment of exoplanets around low-mass stars with SPARCS (Star-Planet Activity Research CubeSat),” Proc. SPIE, 10699 106990F https://doi.org/10.1117/12.2315543 PSISDG 0277-786X (2018). Google Scholar

8.

N. Turner et al., “PoZoLE: a tiny space telescope to snap our solar system’s ultraviolet selfie,” in Amer. Astron. Soc. Meeting Abstracts, (2022). Google Scholar

9.

Teledyne e2v, “CCD201-20 datasheet electron multiplying CCD sensor,” (2019). Google Scholar

10.

P. Jerram et al., “The LLCCD: low-light imaging without the need for an intensifier,” Proc. SPIE, 4306 178 –186 https://doi.org/10.1117/12.426953 PSISDG 0277-786X (2001). Google Scholar

11.

K. Matsuo, M. Teich and B. Saleh, “Noise properties and time response of the staircase avalanche photodiode,” J. Lightwave Technol., 3 (6), 1223 –1231 https://doi.org/10.1109/JLT.1985.1074334 JLTEDG 0733-8724 (1985). Google Scholar

12.

O. Daigle et al., “CCCP: a CCD controller for counting photons,” Proc. SPIE, 7014 70146L https://doi.org/10.1117/12.788929 PSISDG 0277-786X (2008). Google Scholar

13.

N. Bush et al., “Measurement and optimization of clock-induced charge in electron multiplying charge-coupled devices,” J. Astron. Telesc. Instrum. Syst., 7 (1), 016002 https://doi.org/10.1117/1.JATIS.7.1.016002 (2021). Google Scholar

14.

M. S. Robbins and B. J. Hadwen, “The noise performance of electron multiplying charge-coupled devices,” IEEE Trans. Electron. Devices, 50 (5), 1227 –1232 https://doi.org/10.1109/TED.2003.813462 IETDAI 0018-9383 (2003). Google Scholar

15.

O. Daigle, C. Carignan and S. Blais-Ouellette, “Faint flux performance of an EMCCD,” Proc. SPIE, 6276 62761F https://doi.org/10.1117/12.669433 PSISDG 0277-786X (2006). Google Scholar

16.

O. Daigle et al., “Extreme faint flux imaging with an EMCCD,” Publ. Astron. Soc. Pacific, 121 (882), 866 https://doi.org/10.1086/605449 PASPAU 0004-6280 (2009). Google Scholar

17.

O. Daigle and S. Blais-Ouellette, “Photon counting with an EMCCD,” Proc. SPIE, 7536 753606 https://doi.org/10.1117/12.840047 PSISDG 0277-786X (2010). Google Scholar

18.

O. Daigle et al., “Astronomical imaging with EMCCDs using long exposures,” Proc. SPIE, 9154 91540D https://doi.org/10.1117/12.2056617 PSISDG 0277-786X (2014). Google Scholar

19.

A. Basden, C. Haniff and C. Mackay, “Photon counting strategies with low-light-level CCDs,” Mon. Not. R. Astron. Soc., 345 (3), 985 –991 https://doi.org/10.1046/j.1365-8711.2003.07020.x MNRAA4 0035-8711 (2003). Google Scholar

20.

C. Cherry et al., “An experimental study of the possible bandwidth compression of visual image signals,” Proc. IEEE, 51 (11), 1507 –1517 https://doi.org/10.1109/PROC.1963.2620 IEEPAD 0018-9219 (1963). Google Scholar

Biography

Brian M. Sutin is an instrument systems engineer at Caltech/Jet Propulsion Laboratory. He specializes in the design and performance modeling of space-based sensing instruments, particularly the radiometry, end-to-end system engineering, and calibration plans required to demonstrate that an instrument concept will meet science requirements.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.

Citation Download Citation

Brian M. Sutin "Statistics of electron-multiplying charge-coupled devices," Journal of Astronomical Telescopes, Instruments, and Systems 9(2), 028001 (6 April 2023). https://doi.org/10.1117/1.JATIS.9.2.028001

Received: 28 January 2023; Accepted: 10 March 2023; Published: 6 April 2023

Access the abstract

JOURNAL ARTICLE
13 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

Subscribe to Digital Library

Receive Erratum Email Alert