Human-Designed Filters May Outperform
Machine-Learned Filters

Paul C Mocombe

Research Article

Human-Designed Filters May Outperform Machine-Learned Filters

Gengsheng L Zeng^1,2*

¹Utah Valley University, Orem, Utah, 84058, USA

²University of Utah, Salt Lake City, Utah, 84108, USA

Corresponding Author

Received Date: November 10, 2022; Published Date: November 22, 2022

Abstract

Machine-learned image processing systems in medical imaging have shown better results than those obtained by traditional human-designed techniques. The success of machine learning techniques inspires humans to design better systems. The convolutional neural network (CNN) has a multi-channel architecture, which the conventional filters do not have. This paper proposes that by borrowing the multi-channel architecture, the human-designed denoising filter can have better performance than the machined-learned version. We illustrate the feasibility of this idea with a toy example in a sinogram denoising task in the area of tomography.

Keywords: Data science; Denoising; Image processing; Machine learning; Nonlinearity; Signal processing

Introduction

Machine learning is believed to have the potential to outperform the conventional technologies [1]. Without doubt, machine learning is one of the most exciting technologies of today. In the medical imaging industry, the FDA has approved machine learning image reconstruction techniques for clinical usage, because the machine learning based techniques can provide images with less noise and higher spatial resolution [2].

We believe that the superius performance of machine learning induced methods is due to its nonlinearity. On the other hand, classical methods are mostly based on linear models, such as the well-known filtered back-projection (FBP) algorithm. If the objective function is quadratic, the iterative gradient descent algorithm is a linear algorithm. When some nonlinearity is introduced to medical image reconstruction, the reconstructed images have some desirable properties that linear methods do not have. For example, the total variation (TV) constrained image reconstruction is a nonlinear method, which is able to remove noise and maintain sharp edges [3]. Some imaging problems can be modeled as compressed sensing problems, and their solutions rely on nonlinear L₁ optimization [4].

The power of deep learning relies on the nonlinear activation function in every layer of the neural network. Without the nonlinear functions, the entire neural network will degenerate to a one-layer linear network. This current paper is inspired by the architecture of neural networks. We present a toy example, in which the human designed filter uses the architecture of a neural network.

Methods and Results

The convolutional neural network (CNN) has wide applications in image processing [5]. The U-net version of it is popular in the segmentation applications [6]. CNN has a unique feature of using multiple channels. The development of a neural network today is still empirical. The network architecture and super parameter selection are tuned by trial and error.

The toy example in this paper is sinogram denoising using computer simulations. The detector had 64 bins. The number of views over 360° was 360. The two-dimensional (2D) image reconstruction algorithm was FBP. Additive zero-mean Gaussian noise was added to the sinograms. The number of random sinograms was 1000 used in the CNN training, and the number of random unseen testing phantoms was 10. The testing phantoms contained some small dots that were not generated in the training data. The convolution kernels in the CNN were all 3 × 3. The nonlinear activation function in the CNN was the rectified linear unit (ReLU) function. The number of training epochs was 100. Some typical random phantoms are shown in Fig. 1, which contains ellipses with various sizes, locations, and intensity values. The testing errors are calculated and reported here in terms of the mean squared error (MSE) in the sinogram domain, with respect to the noiseless true sinograms. From our studies, the bias term in the neurons is not effective for the denoising task and is thus discarded in all our neural networks in this paper (Figure 1).

irispublishers-openaccess-biomedical-engineering-biotechnology

One-Channel Multilayer CNN

Table 1 shows some typical testing study MSE values of the onechannel CNN experiments. As the number of layers increases, the MSE values do not have a decreasing trend (Table 1).

As a comparison, Table 2 lists the results of the one-layer, one-channel network with various convolution kernel sizes. This method is equivalent to the conventional linear filtering (Table 2).

Multi-Channel Two-Layer CNN Network

Table 3 shows the results of two-layer CNN results. The first layer has multiple channels; the second layer has one channel. It is observed from Tables 1 and 3 that using a multi-channel shallow network is better than using a one-channel multi-layer network (Table 3).

Table 1: One-channel multi-layer network performance.

Table 2: One-layer network performance.

Table 3: Two-layer multi-channel network performance.

A Human-Designed Denoising Filter

The traditional human-designed filters have one layer and one channel. Inspired by the multi-channel CNN, we propose a twochannel filter, whose input image is f and output image are g. The input-output relationship is expressed as

where h₁ is a 2D low-pass filter convolution kernel, h2 is a 2D highpass filter convolution kernel, and  is the ReLU function. The two terms inside the square brackets in (1) are considered as the first layer neurons, and the convolution with h₁ at the end is the second layer. In our design, there is only one design parameter, u = 0.1, as shown in Fig. 2 (Figure 2).

typical set of outputs of the human-designed filter are shown in Fig. 3. The low-pass channel output is similar to that from a conventional linear filter, with a minor exception that the negative values are discarded. The high-pass channel captures the salt-like noise, which will be removed by the negative sign when combining with the low-pass channel output. The high-pass channel output is new and not considered in a conventional denoising filter. The FBP reconstructions from the corresponding sinograms are shown in Fig. 4. In the test image evaluation, the MSE for the human-designed filter is 1.04 × 10^-4, while for the machine-learned counterpart (see the first row in Table 3) is 1.42 × 10^-4. The human-designed filter is better in our toy example (Figure 3,4).

Conclusion

For a sinogram denoising task, this paper observes that the bias term in each neuron is unnecessary; the ReLU function is not necessary for the final output layer; using a lot of layers may not help if there is only one channel; using many channels may help if the network is shallow. We propose a human-designed 2-channel, 2-layer denoising filter for sinogram denoising (see Eq. (1)). The filter has only one design parameter u (see Fig. 2). This humandesigned filter is compared with a machine-learned 2-channel, 2-layer CNN (see the first line in Table 3). The human-designed filter has a smaller MSE than the machine-learned version. Our experiments are limited, and our claims may not be valid in more general situations. However, it is innovative to adopt the state-ofthe- art neural network architecture for human-designed systems.

Acknowledgment

This work is supported by NIH grant 2R15EB024283.

Conflict of Interest

No conflict of interest.

References

D Dutton, G Conroy (1997) A review of machine learning. The Knowledge Engineering Review 12(4): 341-367.
(2019) FDA Clears GE's Deep Learning Image Reconstruction Engine.
L Ritschl, F Bergner, C Fleischmann, M Kachelries (2011) Improved total variation-based CT image reconstruction applied to clinical data. Physics in Medicine & Biology 56(6): 1545-1561.
B Bernstein, S Liu, C Papadaniil, C Fernandez Granda (2020) Sparse recovery beyond compressed sensing: separable nonlinear inverse problems. IEEE Trans Inf Theory 66(9): 5904-5926.
R Yamashita, M Nishio, RKG Do, Kaori Togashi (2018) Convolutional neural networks: an overview and application in radiology. Insights Imaging 9(4): 611–629.
Ronneberger, P Fischer, T Brox (2015) U-Net: Convolutional networks for biomedical image segmentation. In: Navab N, Hornegger J, Wells W, Frangi A (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. Lecture Notes in Computer Science, vol. 9351. Springer, Cham.