Mathematical Details

Next: Some Practical Considerations Up: Optimal Extraction Previous: General Principles

7.10.2 Mathematical Details

At a given $\lambda$ on order m, there is a true signal level equivalent to $S_{m,\lambda}$ photons accumulated over n exposures that were added together. This signal is spread over a vertical extent of approximately 20 pixels. The actual signal sensed in each pixel is quantized and has the usual Poisson statistical fluctuations. On average (in the absence of statistical uncertainties) the expected numerical outcome in each pixel identified by a subscript i is

$\begin{displaymath} I_{i,\lambda}=S_{m,\lambda}b_if_{i,\lambda}+rf_{i,\lambda} +... ...}b_{10-i}f_{i,\lambda} + S_{m+1,\lambda}b_{10+i}f_{i,\lambda}~,\end{displaymath}$

(6)

where

i: is the index that describes the y location of a pixel or horizontal slice, with i=0 being set at the location of maximum intensity for order m.
b_i: is the fraction of energy deposited in a particular horizontal section i of an order (normalized such that $\sum_{i=-10}^{10}b_i = 1.00$ ).
$f_{i,\lambda}$: is the sensitivity pattern's attenuation function averaged over n images. It is normalized so that an interblob region has $f\approx 1$ , and thus $S_{m,\lambda}$ would be the signal registered if there were no blobs with reduced sensitivity.
r: is the background illumination level.

Figure 12: An illustration of how to determine the most likely spectrum intensity $\langle S_{m,\lambda}\rangle$ along an order m at wavelength $\lambda$ . The height of the surface above the horizontal x-i plane represents the recorded image intensity. Small squares show measurements of $I_{i,\lambda}$ from i=-5 to +5. The $\times$ 's show the samples taken to determine the most likely intensities $\langle S_{m-1,\lambda}\rangle$ and $\langle S_{m+1,\lambda}\rangle$ of the contaminating orders. The entire pattern is elevated above a general background r, and the holes represent depressions in the sensitivity $f_{i,\lambda}$ .

$\begin{figure} \plotone{Fig.ord_cut.eps}\end{figure}$

The $S_{m-1,\lambda}$ and $S_{m+1,\lambda}$ terms are signals in the adjacent orders that can contaminate our sampling of $S_{m,\lambda}$ . They must be evaluated and subtracted out, and because of their uncertainties, they lower the reliability of outlying samples of $S_{m,\lambda}$ -- an effect that must be recognized when the relative weights are assigned for different i. To estimate these contamination contributions, we sampled the adjacent orders at positions near their cores, but slightly offset in the direction of the order m (see Fig. 12). This offset insures, to first order, that small errors in centering on order m automatically adjust the contamination correction in the right direction.

In setting up formulae for the variances of quantities that would come from hypothetical, repeated trials of the experiment, we must express the outcome using a general value for S, called S_m as distinguished from $S_{m,\lambda}$ , because we do not want the weight factors to be influenced by the local chance fluctuations on top of the desired and contamination signals. In practice, S_m can be pictured in terms of an average of $S_{m,\lambda}$ over a range of $\lambda$ that is large enough to make such fluctuations inconsequential.

Imagine that we could perform repeated trial measurements of I_i. We should expect to find a variance,

$\begin{displaymath} {\rm Var}(I_i)=S_mb_if_{i,\lambda}+rf_{i,\lambda}+S_{m-1}b_{10-i}f_{i,\lambda} + S_{m+1}b_{10+i}f_{i,\lambda} + n\sigma_r^2~,\end{displaymath}$ (7)

where $\sigma_r$ is the rms readout plus CCD dark current noise (§) (expressed as an amplitude relative to that of a single photoevent). Numerically, $\sigma_r = 1.6$ for a 511-frame integration over 34 s. The most effective approach for reconstructing S is to evaluate at all $\lambda$ the individual estimates for the most probable values for $S_{m,\lambda}$ , which we designate as $\langle S_{m,\lambda}\rangle$ :

$\begin{displaymath} \langle S_{m,\lambda}\rangle={I_i - rf_{i,\lambda} - S_{m-1,... ... - S_{m+1,\lambda}b_{10+i}f_{i,\lambda} \over b_if_{i,\lambda}}\end{displaymath}$ (8)

In the numerator, the first term is the basic measurement that has random fluctuations governed by Eq. 7. The second term is an ultraviolet background correction term that does not vary (i.e., it is a global correction, except for a general trend that follows the blaze function of the echelle grating). The third and fourth terms are correction factors that must be applied to cancel out the contamination signals from adjacent orders. These two terms have variations of their own that corrupt the correction process, since we can not measure $S_{m-1,\lambda}$ and $S_{m+1,\lambda}$ with perfect accuracy. The magnitude of these corrections depend on exactly how we sample the adjacent orders (preferably, near enough to their centers that we do not have to worry about these orders being contaminated!)

Associated with $\langle S_{m,\lambda}\rangle$ is its variance,

where ${\rm Var} ( \langle S_{m-1}\rangle )$ and ${\rm Var} ( \langle S_{m+1}\rangle )$ are determined through Eq. 7 without the correction terms because of the deliberate, very limited sampling reasonably near these orders' centers (to avoid the complexity of second order contamination corrections upon the first order ones).

Specifically, the intensities of the adjacent orders are determined by two measurements in each case, such that

$\begin{displaymath} \langle S_{m-1,\lambda}\rangle={I_{7,\lambda} + I_{8,\lambda... ...bda} + f_{8,\lambda})\over b_3f_{7,\lambda} + b_2f_{8,\lambda}}\end{displaymath}$ (9)

and

$\begin{displaymath} \langle S_{m+1,\lambda}\rangle={I_{-7,\lambda} + I_{-8,\lamb... ...} + f_{-8,\lambda})\over b_3f_{-7,\lambda} + b_2f_{-8,\lambda}}\end{displaymath}$ (10)

The choice of using i = 7 and 8 is a matter of judgement. Measurements closer to the centers of the other orders will be more accurate. However, we then lose the automatic compensation for centering errors. The simple sum without weight factors reduces the complexity of the equation and is justified on the basis that b₂ is not very different from b₃. As before, the expected variance in the estimate is given by

$\begin{displaymath} {\rm Var}(\langle S_{m-1,\lambda}\rangle)={S_{m-1}(b_3f_{7,\... ... + 2n\sigma_r^2\over (b_3f_{7,\lambda} + b_2f_{8,\lambda})^2}~,\end{displaymath}$ (11)

and likewise for $S_{m+1,\lambda}$ .

Now that we have derived formulae for the best estimate of $S_{m,\lambda}$ and its reliability for a given value of i, we must combine the measurements at different i in an optimum manner. We also need to have a measure of the uncertainty in the outcome, in case we wish to combine the extraction with other ones.

For measurements with different uncertainties, the standard way to combine them is by evaluating a weighted average, with weights that are inversely proportional to the variances:

$\begin{displaymath} \langle S_{m,\lambda}\rangle = {\sum_i\bigl[\langle S_{m,\la... ...i\bigl[ 1 /{\rm Var} (\langle S_{m,\lambda}\rangle )_i\bigr]}~.\end{displaymath}$ (12)

The error in the result is given by

$\begin{displaymath} \sigma(\langle S_{m,\lambda}\rangle) = \Bigl\{ \sum_i\,\bigl... ...{\rm Var} (\langle S_{m,\lambda}\rangle )_i\bigr]\Bigr\}^{-1/2}\end{displaymath}$ (13)

The combination shown in Eq. 13 is not strictly ideal, because the correction terms for interference from adjacent orders will have correlated errors. This effect is probably rather small, since errors are strongly dominated by the background fluctuations (in the r terms) in cases where the correction amounts to much.

Next: Some Practical Considerations Up: Optimal Extraction Previous: General Principles

Karen Levay
12/15/1998