1 / 39

Interferometric Imaging in CASA An Introduction for ALMA reduction

Interferometric Imaging in CASA An Introduction for ALMA reduction. Steven T. Myers. National Radio Astronomy Observatory Socorro, NM. Radio Interferometric Imaging in theory…. Classic Radio Interferometer. e.g. The EVLA or ALMA a bunch (27 to 50) antennas connected together

haile
Download Presentation

Interferometric Imaging in CASA An Introduction for ALMA reduction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Interferometric Imaging in CASAAn Introduction for ALMA reduction Steven T. Myers National Radio Astronomy Observatory Socorro, NM

  2. Radio Interferometric Imaging in theory…

  3. Classic Radio Interferometer • e.g. The EVLA or ALMA • a bunch (27 to 50) antennas connected together • independent elements  Earth rotation synthesis

  4. baseline vector B Interferometer Baselines • Baseline vector B in “aperture plane” • coherent signal applied to interferometer would produce plane-wave interference “fringe” on sky with angular period l/B q = λ /B interferometer naturally decomposes sky into plane waves!

  5. each point on aperture gets correlated with each point on other aperture autocorrelations measure uv spacings inside D/l The Aperture Plane • Correlate wavefronts in plane of apertures (Fourier transform of sky) • dish optics sum aperture plane at focus • visibility is cross-correlation of wavefronts of the 2 apertures visibility contains range of baselines from closest to furthest parts of apertures interferometer cannot measure “zero-spacing” w/o autocorrelations

  6. aperture xcor width 2D/λ From sky to uv-plane The uv-plane is the Fourier Transform of the tangent plane to the sky F-1 2.5o baseline u = B/λ ℓ = 2p|u|= 2p|B|/λ F Sky Plane x = (x,y) Fourier Plane u = (u,v)

  7. Example: VLA observes Jupiter • A 6cm VLA observation of Jupiter: Fourier transform of nearly symmetric planetary disk bad data

  8. Reconstruction of the Sky • Visibilities and the Sky v = AF-1 s + n • A known instrumental response, but is not invertible • instrumental noise n is a random variable • The issues: • unknown random noise n • convolution due to size of A in uv domain • incomplete sampling of uv-plane by visibilities • Maximum Likelihood - Optimal Map mMLE = ( ATN-1A)-1ATN-1v = R-1dd = Hv No inversion R singular (at best ill-conditioned) for fully sampled s

  9. The Dirty Map • Grid onto sampled uv-plane d = Hv = Hs + nd • H should be close to HMLE, e.g. H = ATN-1 : A ~ A • AT should sample onto suitable grid in uv-plane • reminder: need only be approximate for gridding • Invert onto sky  “dirty image” d = Fd = R s + ndR = FRF-1 • image is “dirty” as it contains artifacts • convolution by “point spread function” (columns of R) • multiplication by response function (diagonal of R) • noise

  10. VLA point-spread function (PSF) • The VLA is an example of a sparsely filled array • uv-plane gaps are treated as zeroes, cause “sidelobes” in PSF • many solutions for sky that fit data, “dirty image” is principal solution • must use “deconvolution” techniques to “clean” image snapshot uv-coverage PSF “dirty beam” “dirty” image “clean” image Example: VLA 30s snapshot discovery data for gravitational lens CLASS B1608+656 (Myers et al. 1995, ApJL, 447, L5-L8)

  11. Image, uv, and Data Spaces • image plane  uv-plane  visibilities • operators F , H , A handle these transformations • not all operators have inverses (H and A do not) • example: model image m • first transform sky model to uv-plane m = F-1 m • then project onto the visibilities (data space) vm = Am = A F-1 m • form residual dvm = v-vm = A ( s - m ) + n • finding “best model” will involve minimizing this residual

  12. Classic Deconvolution • uv-plane CLEAN algorithm (“major cycles”) • iterate on residual images removing point models • initial residual data, and model: dv0 = v m0 = 0 • form dirty image: d0 = FHdv0 • locate peak and residual and put fraction f into model dm1 = f M d0 mask M : 1 at max, else 0 • increment model: m1 = m0 + dm1 • form cumulative visibilities and residual v1 = Am1 = AF-1 m1dv1 = v – v1 • form new dirty residual image: d1 = FHdv1 • and repeat until final residual image dfis noise-like

  13. image-plane CLEAN • CLEAN in the image plane (minor cycles) • initial model: m0 = 0 • form dirty image: d0 = FHdv0 • locate peak and residual and put fraction f into model dm1 = f M d0 mask M : 1 at max, else 0 • increment model: m1 = m0 + dm1 • subtract from dirty image to make dirty residual image d1 = d0 – Rdm1 R ~ R (the PSF) • and repeat until final residual image dfis noise-like • size of kernel R (Hogbom=full-size, Clark=quarter size) • specified by psfmode in CASA

  14. CLEAN variations • Cotton-Schwab (CS) CLEAN • break into major (uv-plane) & minor (image-plane) cycles • in CASA initiated by imagermode=‘csclean’ • some options ( ‘mosaic’, ‘mfs’ nterms>1) hardwired cs • clean in minor cycles to a threshold where max residual is some fraction of starting max residual • in CASA will be cyclefactor x max psf sidelobe • more major cycles = more accurate cleaning but slow • poor PSF, simple image structure = lower cyclefactor (<1) • good PSF, complex image structure = raise cyclefactor (>1) • purpose: correct errors from gridding & minor cycles • the transform back from model visibilities is as accurate as we wish to make it

  15. CLEAN Example • Jupiter 6cm – interactive cleaning in CASA as imaging proceeds, include regions containing emission in mask breaks between interactive cycles are major cycles

  16. Gridding kernels in CLEAN • This is a choice: d = Hv • Default kernel: H0 = N-1 • includes only noise (inverse variance) weighting • Optimal (mosaic) imaging: H = ATN-1 • uses aperture function (A ~ A) • will grid mosiacs onto same uv-plane • can correct for known pointing errors • Frequency synthesis kernel (MFS) • add kernels corresponding to Taylor expansion terms • G0k= 1 G1k= ln(nk/n0) for each channel k

  17. Mosaicing in the sky and uv planes offset & add phase gradients

  18. Mosaicing kernel • the usual equation (aperture and offset term) • note that we assume “phased up” at each pointing xk ! • kernel B and its transform (Fourier shift theorem) • B = F-1BF is the “mosaicing” kernel (A-projection) • if offset xk is unknown, then this is a “pointing error” • offset for R and L polarizations is the “squint” term

  19. Mosaicing considerations • In CASA, imagermode=‘mosaic’ • ftmachine=‘mosaic’ uses A-projection kernel • grids to single uv-plane, optimally weights fields, most efficient • can be some aliasing, keep mosaic to inner quarter of imsize • ftmachine=‘ft’ does standard image-plane shift+add • takes much longer, not recommended • mosaic uses csclean, watch cyclefactor • uses approximate single PSF for entire mosaic • will “correct” in major cycles • uses POINTING table when present (FIELD for phases) • Outputs • standard: .image, .model, .psf • .flux (PB plus weights plus extra PB from A-convolution) • .flux.pbcoverage (just the PB effects) • e.g. to correct to flux on-sky use .image/.flux

  20. MEM and CLEAN • CLEAN • algorithm: find peak in residual image; add fraction to model; form new residual data & residual image; iterate • performance: good on compact emission, difficult for extended • Maximum Entropy Method (MEM) • algorithm: for pixel values p : maximize entropy -S p ln p ; minimize c2(p) to encourage “smooth” extended emission • performance: complicated, suppresses spiky emission, but fast • CLEAN and MEM use point (pixel) basis • complete basis – unique representation of image

  21. Sparse Approximation Imaging • Problem: find a model to represent the sky as efficiently as possible, subject to the data constraints and within the noise uncertainty, possibly also subject to prior constraints. • some problems (like ours) cannot be efficiently reconstructed using orthonormal bases (like pixels or Fourier modes) • extensive literature on this! • use non-orthogonal bases: multiscale (e.g. Gaussians) • choose dictionary of model elements (atoms) • efficiency: find a representation that uses the fewest number of atoms

  22. Example: MEM versus MS-CLEAN Restored Residual Error Maximum Entropy MS Clean

  23. The Future of Multiscale Methods • Algorithms • mostly iterative, starting from a blank model • “greedy” methods make locally optimal choices at each step • MS-CLEAN is a greedy algorithm in this class! • dictionary of points and Gaussians on different scales • is essentially a “Matching Pursuit” (MP) algorithm (e.g. Tropp 2004) • Key Research Area in CASA • new arrays are designed for high dynamic range & fidelity • we need efficient, robust, and accurate multiscale methods • integrate multiscale (MS) with multispectral (MFS)

  24. Calibration and Imaging • Some effects corrupt the visibilities • most are on a per-antenna basis, other per-baseline • antenna based effects can be “self-calibrated” out • The Measurement Equation (ME) • the Jones matrices J contain the corruptions to V • there are different corruption terms to the J • gain G, pol leakage D, ionosphere F, parallactic angle P • can be direction dependent, additive RFI & correlator errors

  25. Calibration in Image Plane • Calibration errors show up as artifacts in image plane: • given an approximate model for the image we can solve for the errors  “self-calibration” through iteration Before Correction After Correction

  26. Pointing Corrections • Example of direction-dependent errors: • VLA antennas have ~10’’ pointing residual • affects high-dynamic range imaging (>104) • also “squint” between R and L beams • Work by Sanjay Bhatnagar (NRAO) • Simulation of 1.4GHz EVLA observations • Residual images • Before correction: Peak 250Jy, RMS 15Jy • After correction: Peak 5Jy, RMS 1Jy • Can incorporate into standard self-cal • Computational cost ok for now • See EVLA Memos 100 & 84 • Implementing in CASA, testing underway

  27. Polarization: Vis to Stokes in XY basis • for antennas with parallactic angles fi and fj • and for identical parallactic angles f between antennas: Linear Feeds: linear polarization in all hands, circular only in cross-hands

  28. Polarization Leakage • Primary on-axis effect is “leakage” of one polarization into the measurement of the other (e.g. X  Y) : X  X + diX Y • but, direction dependence due to polarization beam! • Calibrate out on-axis leakage and put direction dependence in “beam” • example: expand XY basis with on-axis leakage • remember XX,YY contain IQU and XY,YX contain QUV • ideally the “d”s are ~1-2% but can be worse, should be stable

  29. Primary Beam: full field polarization • VLA primary beams • Beam squint due to off-axis system • Instrumental polarization off-axis • AZ-EL telescopes • Instrumental polarization patterns rotate on sky with parallactic angle • Limits polarization imaging • Limits Stokes I dynamic range (via second order terms) • must implement during imaging Green contours: Stokes I 3dB, 6dB, black contours: fractional polarization 1% and up, vectors: polarization position angle, raster: Stokes V

  30. I V Q U Simulations on a complex model • VLA simulation of ~ 1 Jy point sources + large source with complex polarization (“Hydra A”) • Long integration with full range of parallactic angles • equivalent to weak 1.4GHz source observed with EVLA • Antenna primary beam model by W. Brisken • See EVLA memo 62

  31. I V Q U 1-D Symmetric Beam • dynamic range limited by errors from incorrect approximate primary beam Dynamic Range ~200 using symmetric beam model

  32. I V Q U 2-D Polarized Beam • need to use accurate polarized beam to reach high fidelity and dynamic range Dynamic Range ~104 using 2-D beam model

  33. CASA CLEAN Imaging Details • MFS (mode mfs) • nterm=2 compute spectral index, 3 for curvature etc. • needed for bandwidths ~5% or more (S/N dependent) • tt0 average intensity, tt1 alpha*tt0, alpha images output • takes at least nterms longer (image size dependent) • Multiscale (set non-zero multiscale list) • scales are in units of pixels • usually set to be multiples of synth. beam size • e.g. for 3x oversampling of beam: • multiscale = [0,3,6,12,24] • takes at least nscales longer (x nterms?) • can be tricky to get to work right

  34. CASA CLEAN logistics • clean can be restarted from current state • if imagename used before and files exist and same size • will first recompute residuals from model • user can input some starting conditions • previous mask (e.g. from previous clean) • previous modelimage • key toggles • mode: mfs, channel, velocity, frequency • imagermode: ‘’, csclean, mosaic • psfmode: clark, hogbom • gridmode: advanced stuff, eventually a-projection

  35. Spectral Cube Considerations • mode: channel, velocity, frequency • sets the grid of planes in output image cube • will apply doppler corrections on the fly • can set frame info, restfreq, etc. • data: taken in sky frequency (terrestrial) frame • velocity: include doppler shifts from: • Earth rotation: few km/s (diurnal) • Earth orbit: 30 km/s (annual) • Earth/Sun motion w.r.t. LSR (e.g. LSRK, LSRD) • maybe galactic rotation to extragalactic frames • these shifts are time dependent • “Doppler Tracking/Setting” adjust sky freq while observing • can shift and regrid data before imaging : cvel

  36. Continuum Subtraction • If you have a viable continuum model • e.g. via using mode mfs • can use uvsub to remove from data before imaging • BUT: must have really good accurate model (not likely) • In practice, subtract “continuum” in data • use task uvcontsub or uvcontsub2 • particularly if strong short-baseline emission • these fit polynomial to each vis spectrum and subtract • uvcontsub2 can cross spw boundaries • need to know line-free channel ranges • if uvcontsub need to fit in each spw • “continuum” not generally imageable • e.g. no closure !!!

  37. Odds and Ends • Oversampling PSF • at least 2.5x, I like 3-4x (whichever rounds well), some use 5x, no strong impact (but can make imaging longer) • Boxing • helps guide clean where to pick out real emission, important when psf sidelobes high/complex and/or when image has complicated structure • if initial calibration poor, shallow careful clean with boxing will get better model for selfcal (don’t burn in artifacts) • some cases using csclean (w/cyclefactor) you can “turn it loose”, but be careful before doing this • we are working on autoclean / autoboxing

  38. Flotsam and Jetsam • What, no clean components? • CASA clean increments the .model image • does not keep track of each iteration component • does not report total cleaned flux until end • does not keep separate multiscale components • might later store cc’s as component lists, but not now • you can mask/edit .model image • you can supply initial “model” using modelimage • this can come from anywhere! e.g. single dish • Stopping clean • when you get residual that are noiselike :) or a mess :( • ideally you should get within 2x thermal • if not, probably calibration errors, bad data, or dyn. range issues

  39. Fin • Final words • the proof of the quality of the data is in the final imaging, you generally cannot guess this from metrics on the calibrators (particularly when selfcal is possible) • Anything else? • Documentation • inline help <task> • online cookbook (massive but in depth) • casaguides

More Related