-
Notifications
You must be signed in to change notification settings - Fork 4
/
132.latex
442 lines (390 loc) · 20.7 KB
/
132.latex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
\documentstyle[12pt]{article}
\begin{document}
\newcommand{\x}{{\bf{x}}}
\newcommand{\xp}{{{\bf x}_p}}
\newcommand{\xpi}{{{\bf x}_p^i}}
\newcommand{\bu}{{\bf{u}}}
\newcommand{\arcsper}{\ifmmode \rlap.{'' }\else $\rlap{.}'' $\fi}
\newcommand{\arcs}{\ifmmode {'' }\else $'' $\fi}
\newcommand{\arcm}{\ifmmode {' }\else $' $\fi}
\begin{center}
{\Large \bf Exotic Imaging Methods:}\\
{\bf A Guide For Masochistic AIPS++ Designers}\\
M.A. Holdaway\\
S. Bhatnagar\\
April 7 1992\\
\end{center}
This document examines mosaic imaging algorithms and
non-isoplanatic calibration and imaging algorithms
in the light of the ImagingModel and TelescopeModel objects in AIPS++.
\section{Mosaicing}
A mosaic image is produced from interferometer data taken from
several different, adjacent pointings on the celestial sphere.
A single-pointing of data can be treated as a mosaic. One
does not want to force the treatment of all single-pointing data
as mosaics, ie, one does not always want to image the entire
primary beam (PB) or correct for the PB.
The mosaic machinery also allows for combining data from telescopes
with different PB's and different imaging models, an interferometer
and a single dish, for example.
\subsection{Operations on Mosaic Datasets (Visibilities)}
One will need to perform ``bulk'' operations on
data from all pointings from one telescope (ie, calibration).
One will need to perform ``piecemeal'' operations on all data
from a single pointing from a single telescope (ie, gridding/FFTing).
For convenience of ``bulk'' operations, all the data from one
telescope should be stored together in one dataset.
Then the ``piecemeal'' operations could be performed
by selecting out subsets of data. In addition to all
the generic data selection possible with single-pointing
interferometers, one must be able to select on sky pointing position
(a single dish selector) and Telescope.
Something like the YegSet Iterator which would step through
all pointings one at a time would be nice.
It is unclear whether the data from multiple telescopes should
be stored in one dataset or in one dataset for each telescope.
The number of different telescopes used will usually be one or two,
so association of mosaic datasets from different telescopes
{\it by hand} will not be too cumbersome.
\subsection{Images}
The final product of the MOSAIC imaging algorithm is a plain old
image. The main difference between a MOSAIC image and an image
produced by CLEAN, say, is that it is inappropriate to apply
a PB to the final MOSAIC image: the parameters OBSRA, OBSDEC, and TELESCOP
are not uniquely specified. While it is unclear how they would be used,
if these parameters are present at all, they should be in arrays attached to the
image indicating ALL pointings of ALL telescopes.
There may or may not be an associated SENSITIVITY image, which is
essentially a generalized primary beam, and can be used to determine
the theoretical thermal noise in each pixel.
The fundamental equation at work in mosaicing is derived by making
a pixel by pixel least square fit to the true sky brightness from a number of
overlapping images weighted by the PB:
\begin{equation}
I_{mosaic}(\x) = \frac {\sum_{i=1}^{Npointings} I_{i} A_{i}(\x - \x_{i})}
{\sum_{i=1}^{Npointings} (A_{i}(\x - \x_{i}))^{2}} \label{EQN:MOSAIC}
\end{equation}
where $I_{i}(\x)$ is an image made at pointing $i$, and $A_{i}(\x - \x_{i})$
is the primary beam of the telescope used for pointing $i$ with
pointing center $\x_{i}$. The square root of the denominator is
the sensitivity image.
It has not been specified what kind of
images the $I_{i}(\x)$ are. In some cases, $I_{i}$ will be independently
deconvolved images. In this case, they may have some permanence
outside the mosaicing method. More common, at least when the MMA
comes on line, will be the case where $I_{i}$ are dirty images
produced for the sole purpose of making a mosaiced image. The
mosaiced dirty image may then be deconvolved with a single effective
point spread function. In this case, the $I_{i}$ have no permanence.
And finally, the most complicated case:
\begin{equation}
I_{i}(\x) = I^{D}_{i}(\x) - PSF_{i} \ast (A_{i}(\x - \x_{i})\hat{I}(\x))
\end{equation}
where $I^{D}_{i}(\x)$ is the dirty image made from data from pointing $i$,
$PSF_{i}$ is the point spread function for pointing $i$,
and $\hat{I}(\x)$ is a model image of the entire mosaic field.
An unnormalized mosaic (ie, not divided by the denominator
in Equation~\ref{EQN:MOSAIC})
of this quantity is proportional to the gradient of $\chi^{2}$
with respect to the model image, and indicates how the model image
must be altered in the next iteration of a maximum entropy mosaicing scheme.
In this case, the $I^{D}_{i}(\x)$ and the $PSF_{i}$ (or more conveniently, the
Fourier transform of the $PSF_{i}$) should persist from iteration to iteration
for efficiency sake, though they could be recreated on each iteration if
memory or disk space were in short supply. The intermediate images
$I_{i}(\x)$ can disappear as soon as they are accumulated into the
gradient mosaic.
The $I^{D}_{i}$ and $PSF_{i}$ do not need to persist after the final mosaic
image is formed. While it may be convenient to place each of the $I^{D}_{i}$
and $PSF_{i}$ into {\it collections} of images, they are not really {\it images}
of images. The details of how each of the $I_{i}$ are combined should be
implicit in the coordinate headers of the $I_{i}$ and the coordinate
header of some template image (ie, the final mosaic image or the gradient
image).
\subsection{Pathological Cases}
The following two ImagingModels require knowledge of some parameters
which seem to reside in the TelModel. The extent of the dependence
of the ImagingModel on the TelModel is analyzed for the case of
varying antenna voltage patterns and varying antenna pointing position.
\subsubsection{Different voltage patterns for each antenna}
If the primary beams are different for each antenna,
then a single visibility is related to the sky brightness
distribution via a Fourier sum:
\begin{equation}
V_{j,k}(\bu) = \int I(\x)E_{j}(\x^{\prime}-\x_{p})
E^{\ast}_{k}(\x^{\prime}-\x_{p})
e^{-i2\pi \x \cdot \bu} d \x \label{EQN:VPCALC}
\end{equation}
where $E_{j}(\x)$ is the complex voltage pattern
for the $j$ antenna ($PB(x) = |E(x)|^{2}$), and $\x^{\prime}$ is related
to $\x$ by rotation for azimuth-elevation mount antennas.
For the general case of $E_{j} \ne E_{k}$ for $j \ne k$, it is more
efficient to use a direct Fourier sum than an FFT to calculate a model
visibility from a model sky brightness distribution since the calculation
must be performed one visibility point at a time.
Correction for different voltage
patterns may be important for high dynamic range MMA mosaic
observations. The voltage patterns would probably be determined
holographically. An imaging algorithm would then adjust the image
iteratively using a gradient of $\chi^{2}$ with respect to $I(\x)$
which looks something like
\begin{equation}
\Delta I(\x) = 2\sum_{i=1}^{Npointings} \sum_{j,k;j<k}
FT(V_{i;j,k} - \hat{V}_{i;j,k})
E_{j}(\x - \x_{i})E_{k}(\x - \x_{i}) \label{EQN:VPDELTA}
\end{equation}
where $\hat{V}_{i;j,k}$ is the model visibility calculated from
the current model of the sky brightness distribution as per
Equation~\ref{EQN:VPCALC}.
The basic requirement from Equations~\ref{EQN:VPCALC} and \ref{EQN:VPDELTA}
is that the ImagingModel must have access to the complex voltage patterns
for each antenna. Actually, ``VoltagePattern.Apply'' is
the only method required external to ImagingModel.
Under the current understanding, the
voltage patterns would reside in the TelModel. If this is so,
then the VPMosImagingModel requires very little of the TelModel.
\subsubsection{Different pointing centers for each antenna}
Pointing errors are thematically similar to different voltage patterns.
A single visibility is related to the true sky brightness distribution
as
\begin{equation}
V_{j,k}(\bu) = \int I(\x)E(\x^{\prime}-\x_{p_{j}})
E^{\ast}(\x^{\prime}-\x_{p_{k}})
e^{-i2\pi \x \cdot \bu} d \x \label{EQN:PECALC},
\end{equation}
where the differences between Equation~\ref{EQN:PECALC} and
Equation~\ref{EQN:VPCALC}
are the inclusion of antenna-dependent pointing positions $\x_{p_{j}}$
and the omission of antenna dependent voltage patterns. Like the voltage
pattern case, a direct Fourier sum must be used. The equation for the
gradient image will be omitted to save on indices, but the basic idea
is the same as for the antenna-dependent voltage pattern case.
The PEMosImagingModel must have access to a method which supplies
either a pointing correction or an absolute pointing position for
a given antenna and at a given time. Again, this seems to require
that the PEMosImagingModel have knowledge of the TelModel.
It is unclear whether these imaging methods for these two
pathological cases will actually be implemented. However,
it is known that {\it simulations} of pointing errors and
voltage pattern errors (as caused by surface errors, for example)
is required for the design of the MMA and other high frequency
instruments.
\newpage
\section{Non-isoplanatic Imaging}
At low frequencies, the imaging problems related to the ionosphere are
aggravated. The field of view is large and the ionospheric phases
cannot be considered to be a function of time alone - they become a
function of direction in the sky as well. The phase delays goes as
$\lambda^2$ at low frequencies and that makes it essential to correct
the raw phases as a function of direction in the sky to get moderate
dynamic range maps. Also the 2D approximation in inverting the
visibilities to get the image breaks down. A 2D Fourier Transform is
no longer valid.
The problem of 3D inversion is well understood and implemented (in SDE
etc.). Here we attempt to describe the proposed algorithm for doing
SelfCal for the non-isoplanatic case. Implementing these algorithms
in AIPS++ will be a good test of the flexibility of the system and also
a requirement.
\subsection{The problem}
In ordinary SelfCal, the gain corrections applied to the visibilities,
are assumed to be antenna based and constant for the entire field of
view of the individual antennas for the given time range (the SelfCal
integration time interval). This means that a single gain is
associated with a given antenna, at a given time. In other words,
the properties of the ionosphere do not vary appreciably over
each antenna's PB.
However, when the field of view is large, the gain associated with the
individual antennas is a function of the direction in the sky.
Therefore, the gain solution needs to be a function of time as well as
the direction in the sky.
One of the solution suggested (Schwab), divides the PB of each antenna
into smaller patches, each of which can be assumed to be isoplanatic
and then solve for the gains associated with each of these patches,
for each antenna. Thus for an array with N antennas, and each PB divided
into M patches, the number of unknowns to be solved for will be $(M*N)-1$
(phases for all the patches in the reference antenna cannot be set to zero).
The number of patches that one needs to have in the PB will depend on the
size of the iso-planatic patches and that in turn will depend on the
stability of the ionosphere, observing frequency, and PB size.
For the iso-planatic SeflCal, for an array with N antennas, the
problem is overdetermined by a factor of N/2. If the source model is
inaccurate or the S/N ratio is poor, for SelfCal to suceed, this
overdeterminacy is essential. For non-isoplanatic case, the limit on
the maximum number of patches is set by this requirement of
overdeterminacy and overdeterminacy decreases as the number of patches
increase. For large M (M will be large when the ionosphere is bad and
the iosoplanatic patches are small), accurate initial model and high
S/N ratio will be required.
For arrays like GMRT and VLA, the primary beams of most of the
antennas will overlap at the typical ionospheric height (300 Km) for
low frequency ($<$300~MHz) observations. This means that in the above
scheme, the gains associated with the patches where the PBs overlap,
will be solved for more than once (for each of the PBs in the
overlapping region). Since the total number of unknows should be less
than the number of baselines, this restricts the total number of
patches in the PB to less than $N(N-1)+2/2N$. This also adds the
extra computation of solving for redundant unknowns in the overlapping
regions.
In the scheme proposed by Subrahmanya, the overlap of PBs at the
ionospheric height, is taken into account by dividing the ionosphere,
intercepted by the entire array, into isoplanatic patches and solving
for the associated gains of these patches. Thus, for an array with N
antennas with the ionosphere divided into M isoplanatic patches, the
total number of unknowns would be N+M-2. This will relax the limit on
the maximum possible number of isoplanatic patches to $N(N-1)/2 + +2
-N$. However, in general, the ionosphere cannot be regarded as a thin
layer and the ionospheric structure in vertical direction has to be
considered.
\subsection{Formulation of the problem}
For the case of isoplanatic ionosphere, the measured visibility can be
written as
\begin{equation}
V_{pq}^{iso} = V^{mod} e^{i(\phi_q(t) - \phi_p(t))}
\end{equation}
where $\phi_q $ is the antenna based phase.
Since the the antenna based phases are a function of time alone, the
assumption of iso-planaticity is implicit.
Let the model sky image be represented by
\begin{equation}
B(x,y) = \sum_k A_k \delta (x-x_k, y-y_k)
\end{equation}
and the model visibility given by
\begin{equation}
V^{mod} = \sum_k \xi_k
\end{equation}
\begin{equation}
\xi_k = A_k e^{i(ux_k + vy_k + w\surd{1-x_k^2-y_k^2})}.
\end{equation}
In the non-isoplanatic case, the observed visibilities can be
expressed as
\begin{equation}
V_{pq}^{non} = e^{i(\phi_q(t) - \phi_p(t))} \sum_k \xi_k e^{i(\psi_k(p)
- \psi_k(q))} \label{EQN:NIVIS}
\end{equation}
where $\psi_k(p)$ is the total antenna based phase (apart from the
phase due to the source) in the PB of antenna p in the direction
$(x_k, y_k)$ (the isoplanatic case is a special case of this equation
where $\psi_k(p) = \psi_k(q)$). If $\theta_l(p)$ represent the
phase associated with the $l^{th}$ cell, the total phase $\psi_k(p)$
can be written as
\begin{equation}
\psi_k(p) = \sum_l w_l(p;x_k, y_k) \theta_l
\end{equation}
where $w_l(p;x_k, y_k) \theta_l$ are weights associated with some
interpolation scheme used for interpolation.
If the model visibility is represented as
\begin{equation}
V^{mod}_{pq} = e^{i(\phi_q - \phi_p)} \sum_k \xi e^{i(w_l^{qk} -
w_l^{pk})\theta_l}
\end{equation}
the unknowns are the $\phi$'s and $\theta$'s and can be solved for
by minimizing the equation
\begin{equation}
S = \sum_p \sum_q |V^{non}_{pq} - V^{mod}_{pq}|^2. \label{EQN:NICAL}
\end{equation}
\subsection{The operations required}
\subsubsection{Non-Isoplanatic Selfcalibration}
In the case of iso-planatic SelfCal, the algorithm walks through the
time slices and generates the antenna gains as a function of time by
solving for different time slices.
For the non-isoplanatic case, the algorithm has to not only walk
through the time slices, but also the space slices - the iso-planatic
patches and generate antenna gain solutions as a function of direction
in the sky for each time slice.
The most basic additional requirement for handling non-isoplanatic
selfcalibration is to be able to get the estimate of the visibilities as a
function of the direction in the sky. This requirement is explicit in
Equation~\ref{EQN:NIVIS}.
To start the non-isoplanatic imaging cycle, a model image is needed.
At very low frequencies, the phases will be stable for less than
a minute. With many isoplanatic patches across the PB and source
structure which is often somewhat homogeneous (ie, no dominant
point source), getting a model image to begin the selfcalibration
cycle is non-trivial. A point-source model is usually inappropriate.
A reasonable solution would be to observe the region of interest
at higher frequencies where phase stability is not a problem
and use the higher frequency image as a starting model.
Once a model image exists and each patch of sky's contribution to
the model visibilities can be calculated,
a least-square minimization can be done on Equation~\ref{EQN:NICAL}
to get solutions of $\theta_l$ and $\phi_p$. Since the visibilities are
required as a function of the direction in the sky, Solve requires
the ImagingModel's PredictYegs method and therefore couples the
TelModel with the ImagingModel explicitly.
One requirement on TelModel is that the gains must always be iteratively
adjusted; you cannot apply the gains to a visibility set and then
throw the gains away since the visibility set will then be
phase stable for only one patch of sky.
\subsubsection{Non-Isoplanatic Imaging}
Once direction dependent gains exist, we can go about the work of
imaging. Two approaches seem fruitful: a direct method, which
applies the ``local'' gains for each patch, inverts the visibilities,
and cleans; and an inverse approach, which tries to find the image
which reproduces the data (raw) visibilities after unapplying the
gains.
The direct approach is similar in spirit to MX and the SDE task
FLY. It will likely be built onto FLY since FLY correctly
deals with the non-coplanar baseline problem already. FLY
generates a wide-field image from a collection of overlapping
2-D facets, each tangent to the celectial sphere, reducing the
computation required to solve the full 3-D problem. In general,
the facet size will be much smaller than the isoplanatic patch
size, and the facet size is resolution dependent while the isoplanatic
patch size is not. So, a patch will probably cover many facets.
In an $N$ facet FLY, the visibilities are gridded onto $N$ different
Fourier planes, each conjugate to a facet plane, and Fourier
transformed. The brightest feature in all facets is found.
CLEANING in each facet proceeds independently until the maximum
residual in each facet reaches the global maximum times the
synthesized beams peak sidelobe level. At this point, the
clean components which have been found in each facet are
Fourier transformed and subtracted from the data visibilities.
This ends one round of the major cycle. The major cycle is repeated
until the desired image residual is achieved.
FLY can easily be modified to treat nonisoplanatic imaging.
I assume that the gains for the isoplanatic patches have been
determined from selfcal. As noted above, an isoplanatic patch
probably spans severl FLY facets.
However, the gains from adjacent patches may be different enough
to warrent some interpolation of the gains in sky position.
Some form of error analysis must be carried out to find out
how small the image sections must be to produce the desired
image fidelity. Smaller chunks will lead to less efficiency.
We will ignore the interpolation problem in this discussion.
To image, apply the gains for the $i^{th}$ isoplanatic
patch to the visibilities. Grid them onto the Fourier planes and
transform to produce the facets within the $i^{th}$ isoplanatic
patch. Loop over all isoplanatic patches, applying the appropriate
gains. Again, conservative CLEANING proceeds, stopping in each
facet when the maximum residual gets close to the global image
peak before CLEANING started times the maximum sidelobe level.
At this point, the clean components from each facet are
Fourier transformed back to their respective visibility planes,
the gains from the appropriate isoplanatic patch are {\em un-applied}
to the component visibilities just generated, and the newly corrupted
component visibilities are subtracted from the raw, uncalibrated
visibilities. This is the end of the major cycle, which is repeated
until the desired image residual is achieved.
Both the Gainapply and Gain-unapply methods of TelModel are
required by ImagingModel,
The inverse nonisoplanatic imaging method will not be developed in detail
since it is not clear to us how to solve the noncoplanar baseline problem
within its single-image context. It may be useful for VLBI or linear arrays.
The inverse method requires the same services of TelModel or ImagingModel
which the calibration and the direct non-isoplanatic imaging algorithm require.
\subsubsection{Summary of Required Operations}
\begin{itemize}
\item For each time interval, there is a mapping for each antenna
of the ionosphere's isoplanatic patches onto the celestial
sphere. Both ImagingModel and TelModel must know about this
mapping.
\item TelModel's Solve method must use the method PredictYegs for
each patch on the source.
\item There can be no ``split'' to apply all the gains to the
raw visibilities; rather, the gains must be updated and
the raw data visibilities must be kept.
\item ImagingModel must use the TelModel.Gain-unapply and
TelModel.Gainapply. (The inverse method only requires
Gain-unapply.)
\end{itemize}
\end{document}