Abstract

This paper reports a method for three-dimensional (3D) analysis of shift-invariant pattern recognition and applies to holographic images digitally reconstructed from holographic microscopes. It is shown that the sequential application of a 2D filter to the plane-by-plane reconstruction of an optical field is exactly equivalent to the application of a more general filter with a 3D impulse response. We show that any 3D filters with arbitrary impulse response can be implemented in this way. This type of processing is applied to the two-class problem of distinguishing different types of bacteria. It is shown that the proposed technique can be easily implemented using a modified microscope to develop a powerful and cost-effective system with great potential for biological screening.

1. Introduction

In the past, high-resolution imaging of three-dimensional (3D) objects, or matter suspended in a volume of fluid, has mainly been accomplished using confocal microscopes [1]. In recent years, however, attention has returned to wide-field optical microscopy using coherent illumination and holographic recording techniques that exploit advances in digital imaging and image processing to compute 3D images. In contrast, with confocal imaging, coherent microscopy provides 3D information from a single recording that can be processed to obtain imaging modes analogous to dark field, phase or interference contrast as required [27]. In comparison with incoherent microscopes, a coherent instrument provides an image that can be focused at a later stage and can be considered as a microscope with an extended depth of field. For screening purposes, the increased depth of field is significant, particularly at high magnifications and high numerical aperture. For example a conventional, high magnification microscope has a depth of field of only a few microns whereas a comparable coherent instrument can have a depth of field of a few millimetres or so. This means that around 1000 times the volume of fluid can be screened from the information contained in a single digital recording [8].

The potential of coherent microscopes for automated biological screening is clearly dependent on the development of robust image or pattern recognition algorithms [9]. In essence, the application of pattern recognition techniques to coherent images is similar to that applied to their incoherent counterpart. The task can be defined as that of highlighting objects of interest (e.g., harmful bacteria) from other clutter (e.g., cell tissue and benign bacteria). This process should be accomplished regardless of position and orientation of the objects of interest within the image. It can be accomplished using variations on correlation processing. Linear correlation processing has been criticized in the past for its lack of rotation invariance and its inability to generalize in the manner of neural network classifiers; however, a cascade of correlators, separated by nonlinear (decision) layers, has considerably enhanced performance [5, 10]. Furthermore, we have shown that this is the architecture a neural network classifier assumes if it is trained to provide a shift-invariant output [11, 12].

The application of linear correlation processing to the complex images recorded by a digital phase shifting interferometer has recently been demonstrated by Javidi and Tajahuerce [13]. Pattern recognition techniques implemented using a holographic microscope for the detection of microscale objects has also been considered by Dubois et al. [5, 14] In these works, the 3D sample field was reconstructed plane by plane and image classification was performed by the application of a 2D correlation filter to each of the reconstructed planes. It is noted, however, that although 2D correlation can be applied independently to different image planes it does not take into account the true nature of 3D optical fields, nor that the information in any two planes of these fields is, in fact, highly correlated [15].

In this paper, we considered, from first principles, 3D shift-invariant pattern recognition applied to optical fields reconstructed from digital holographic recordings. It will be shown that the sequential application of a 2D filter to plane-by-plane reconstructions is exactly equivalent to the application of a 3D filter to the full 3D reconstruction of the optical field. However, a linear filter designed based on the plane of focus will not necessarily work for planes out of focus, and therefore a 3D nonlinear filtering scheme is introduced into the optical propagation field. The 3D nonlinear filter is a system implemented with a general impulse response and followed by a nonlinear threshold. We will prove with experiment that a 3D nonlinear filtering structure can significantly improve the classification performance in 3D pattern recognition. In the experiment, we will apply the 3D nonlinear filter to 3D images of two types of bacteria recorded from a holographic microscope, and the enhanced classification performance will be shown.

2. Theory

Firstly, we define the 3D cross-correlation of complex functions and as where is a position vector and conventionally denotes the scalar quantity . Assume that and are the Fourier transforms of and , respectively; according to the convolution theorem, can also be written: where the superscript * denotes complex conjugation. For pattern recognition purposes, (1) and (2) are equivalent ways to describe the process of correlation filtering defined in space domain and frequency domain, respectively.

It is clear from (1) and (2) that in general 3D correlation filtering requires 3D integration (in either the space or frequency domains). However, this is not the case when correlation filtering is applied to monochromatic optical fields propagating forward, typically the holographic reconstruction of optical fields by digital or optical means. In essence this is because is nonzero only within an area of a 2D surface, and consequently is highly correlated.

According to scalar diffraction theory, the complex amplitude representing a monochromatic optical field propagation in a uniform dielectric must obey the Helmholtz equation [16] such that where is a constant. Neglecting evanescent waves that occur close to boundaries and other obstructions, it is well known that the solutions to this equation are plane waves of the form: where is a complex constant. In these equations, and are the wave number and wave vector, respectively and are defined here such that , where is wavelength. In consequence, any monochromatic optical field propagating a uniform dielectric is described completely by the superposition of plane waves such that where is the spectral density, and is the Fourier transform of such that

It is noted that because consists of plane waves of single wavelength, the values of only exist on an infinitely thin, spherical shell with a radius . In consequence, if a general 3D correlation filter with transfer function is applied to a monochromatic optical field, , then in frequency domain the product is also nonzero only on the spherical shell and consequently will obey the Helmholtz equation. If we expand (5), we have

The square root in these equations represents light propagating through the plane in the positive and negative -directions, respectively. Since most holographic recordings record the flux in only one direction, we will consider only the positive root. According to (7), we can define as the 2D projection of the spectrum onto the plane, , such that If represents the optical field in the plane , we have In addition, taking the Fourier transform, we have Equation (10) allows the spectrum to be calculated from the knowledge of the optical field propagating through a single plane. Equation (9) allows the field in any parallel plane to be calculated.

If we consider the application of a general 3D filter to the reconstruction of a propagating monochromatic field, we remember that the product only exists on the surface of a sphere. Consequently, according to the derivation from (7) to (9), we have where is the 3D correlation output in the plane , and Finally, we note that in the space domain the correlation is where Equation (13) shows that a single plane of the 3D correlation of a propagating optical field, , with a general impulse response function, , can be calculated as a 2D correlation of the field in that plane, with an impulse function, , that is defined by (14).

In the recent literature, 2D correlation filtering has been applied to complex images reconstructed from a digital holographic microscope [14]. Practically, a digital holographic microscope measures the complex amplitude in the plane of focus, and the complex amplitude images in the parallel planes are calculated based on optical propagation theory. It is noted that a linear filter that is designed to perform well in one plane of focus will not necessarily perform well in another, and therefore a nonlinear filtering process is required.

When the 3D complex amplitude distribution of samples is reconstructed from the digital holographic recording, correlation filters can be applied for pattern recognition. In the field of statistical pattern recognition, it is common to describe a digitized image of any dimension by the ordered variables in a vector [17], and we adopt this notation here. In this way, the discrete form of a complex 3D image can be written in vector notation by lexicographically scanning the 3D image array. Thus, an -dimensional vector represents a 3D image with volume elements. We define a correlation operator , with a filter kernel (or impulse response), , is defined as, where the superscript “*” denotes the complex conjugate, and the subscript is taken to be modulo such that A nonlinear threshold operator can be defined in the same way to operate on the individual components of a vector such that In general, image data from a hologram is a complex-amplitude field; however, we consider only the intensity distribution and define a modulus operator that operates on the the output such that In this way, a 3D nonlinear filter, , can be expressed as where the subscript to each operator denotes the layer in which a given operator is applied.

Without loss of generality, we design the 3D nonlinear filter to generate a delta function for the objects to be recognized, and zero outputs for the patterns to be rejected. For this purpose, we define a matrix set, , of reference images such that and the corresponding output matrix, , is given by

For the optimization of the 3D nonlinear filter, a matrix with all the desired outputs intensity images is defined. In general, the desired outputs for in-class images will be a zero-valued vector with the first element set to be unit magnitude, and for an out-of-class image the desired output is zero. In order to train the filter with the desired performance, the error function below is requested to be minimized: where and represent the th pixel of the th training and output image, respectively. The first term in this expression is the variance of the actual output from the desired output. The second term represents the signal peaks (that for simplicity are defined to be the first term in the output vector) and is given extra weight to ensure that they have the desired unit magnitude. Because (21) is a nonlinear function with a large number of variables, it is not possible to find an analytical solution. Hence, an iterative method is used in the minimization process. In this case, a simulated annealing algorithm was implemented in the optimization because it is more likely to reach a global minimum [18].

In the practical implementations of the 3D nonlinear filter described in this paper, we require a filter to identify the presence of fairly small objects in a relatively large field. In these cases, a relatively small filter kernel is used, and the kernel is zero-padded to the same size as the input image. In the test of this paper, the training images are selected to be 32 × 32 × 16 elements, and we use 16 × 16 elements transfer function (2D). The filter output, the filter kernel, and the desired output images are all zero-padded to a resolution of 32 × 32 × 16 elements. In this way, edge effects in pattern recognition for large images can be avoided.

3. Experiment

The objective of the work described in this section was to demonstrate 3D rotationally invariance pattern recognition based on digital holographic microscopy for the classification of two species of live bacteria, E. coli and Pantoea.

The digital holographic microscope setup used for this study is illustrated in Figure 1. In this arrangement, a He-Ne laser (633 nm) is used as coherent light source and is divided by a beam splitter and launched into a pair of optical fibres of equal length. One fibre supplies the light that forms the object beam for the holographic recording and is collimated. The microscope is used in a transmission mode and has an objective lens with 100x magnification and an oil immersion objective with an equivalent numerical aperture of NA = 1.25. The object plane is imaged onto a CCD array placed approximately 200 mm from the objective. It is noted that because the microscope is holographic, the object of interest need not be located in the object plane.

The fibre that supplies the reference beam has an open termination that is arranged to diverge from a point in the rear focal plane of the microscope objective. In this way, the interference of the light from the reference beam and the light scattered is recorded at the CCD. Phase curvature introduced by the imaging process [19] is precisely matched by the reference curvature, and straight interference fringes are observed in the image plane in the absence of any scattering objects. From the analysis in Section 2, we can see that the interference pattern recorded by the CCD can be demodulated to give the complex amplitude describing the propagating field in the object plane. For reasons of processing, efficiency care was taken to adjust the magnification of the microscope to match the CCD resolution such that an optimally sampled (Nyquist) reconstruction is produced.

The holographic microscope is implemented with a flow cell that defines an experimental volume. The nutrient fluid with two species of living bacteria, E. coli and Pantoea, is syringed into the flow cell through a pipe. Figure 2 shows an image taken from the microscope corresponding to the absolute value of the complex amplitude in the object plane. In this image, the bacteria understood to be E. coli are highlighted with circles; some out-of-focus bacteria are invisible on this plane. Figure 3 shows a 3D image of the field in Figure 2 reconstructed using the method demonstrated in the above section.

In this study, a 3D nonlinear filter was trained to highlight live E. coli bacteria floating in the flow cell, while the Pantoea bacteria will be ignored. However, the reference set preparation is one of the most challenging problems for the identification of the living cells because each of the live bacteria varies in size and shape and appears at random orientation. To recognise the bacteria regardless of their shapes and orientations, adequate representative distortions of bacteria images must be provided for the 3D nonlinear filter as reference images.

The bacteria images registered as training set can be obtained by directly cropping the cell images from the 3D reconstructed field, or by simulating from the recorded images. For example, a selected bacteria image can be rotated to generate several orientation versions. Figure 4(a) shows eight absolute value images of a typical rod-shaped E. coli rotated in steps of 45 degrees. Pantoea bacteria have a similar rod shape, but slightly different in size from E. coli. Figure 4(b) shows one of the selected Pantoea in eight different rotated versions.

To demonstrate the performance of the 3D nonlinear filter, we train the system to detect E. coli bacteria with 42 images, including 25 E. coli and 17 Pantoea images, and the filer is tested with the complex amplitude image in Figure 2. Figure 5 shows the 3D image of the 3D filter output. Figure 6 reports the projection of the output volume onto a plane. It can be seen that most of the E. coli bacteria had been highlighted by correlation peaks and the Pantoea had been ignored. However, a small portion of the E. coli cannot be detected; this is because the training set with limited number of reference images does not represent all the distortions and orientations of the bacteria. It is expected that classification rate can be improved if more reference images are included in the training set.

4. Conclusion

This paper describes 3D pattern recognition with a 3D nonlinear filter applied to monochromatic optical fields that can be recorded and reconstructed by holographic microscopes. The 3D extension and formulation of the nonlinear filter concept has been introduced. We have shown with experimental data that the 3D nonlinear filtering system provides additional capability as a means to perform 3D pattern recognition in a shift and rotationally invariant means. We demonstrate this in practice by applying the 3D nonlinear filter to a holographic recording of the light scattered from two kinds of living bacteria suspended in water. The experimental data demonstrated that the 3D nonlinear filter has good shift and rotationally invariant property in 3D space.

Acknowledgment

Financial support from The Research Fund for the Doctoral Program of Higher Education (No. 20122302120072) to initiate this research is gratefully acknowledged.