Mini Review Article
Neural Network Forecast of the Total Electron Content of the Ionosphere and Optimization
Artem Kharakhashyan, Olga Maltseva*
Institute for Physics, Southern Federal University, Russia
Olga Maltseva, Institute for Physics, Southern Federal University, Rostov-on-Don, Stachki, 194, 344090, Russia
Received Date:March 13, 2025; Published Date:March 25, 2025
Abstract
Forecasting the total electron content of the ionosphere is important for many technological systems, and the use of neural networks allows to increase the accuracy of its forecast. In the previous work of the authors (RS, 2023), 10 models were proposed, which were divided into several groups of unidirectional and bidirectional recurrent and convolutional architectures. The most effective model was BiTCN. However, all proposed models, like most models in other works, used manual hyperparameter tuning. In the work (Chen et al. Biomimetic, 2024), attention was drawn to the role of the optimization procedure and several corresponding algorithms were considered. In the present paper, the Beluga Whale Optimization algorithm was combined with the most effective BiTCN model. Calculations were carried out for the most common forecasting times: 2h, 1d, 7d in advance. The results showed that despite the extremely low initial errors for the BiTCN model, the improvement in the forecast accuracy was achieved due to optimization..
Keywords:Total Electron Content, Global Positioning System, Bidirectional, BiTCN, BWO, Forecasting
Introduction
The information about the state of the ionosphere is very important because the ionosphere critically affects the propagation of radio signals which are widely used in GNSS applications, HF/ VHF/UHF radio communications, satellite control and operations, navigation timing, direction finding and positioning, spectrum management, radar detection, disaster warning and others [1]. The main parameter describing the state of the ionosphere is the total electron content (TEC) of the ionosphere, measured by GPS receivers using transmitters on satellites. The state of the ionosphere can be monitored and predicted using this parameter. In recent years, neural network methods have become prevalent in forecasting ionospheric parameters [2]. First of all, these are the classical LSTM [3] and GRU [4] methods, which, on the one hand, are used as refer ence methods for comparison with newly developed methods, and on the other hand, when combined with other architectures, they provide an increase in forecast accuracy. The most effective was the combination with bidirectional architectures. Papers [5-8] can be cited as examples.
In [5], a method combining random forest with a Bi-LSTM neural network is proposed. The random forest algorithm is used for regression analysis to select the most important input variables. The Bi-LSTM method was used for the prediction of TEC 1 h ahead on a dataset of 9 Chinese stations located near and along the 110- 120° meridians in 2021. The training dataset included values from DOY 001 to DOY 273; the validation dataset, from DOY 274 to DOY 334 in 2021; the test dataset, from DOY 335 to DOY 365. The results obtained are presented as RMSE for three latitude ranges: (1) 10°- 25° E, (2) 25°-40° E, (3) 40°-55° E. The RMSE were: (1) 2.56 TECU, (2) 1.34 TECU, (3) 0.93 TECU. The paper [6] concerns ТЕС map prediction and proposes an improved model, Mixed Convolutional Neural Network (CNN) - Bidirectional Long Short-Term Memory (BiLSTM), for predicting TEC in China. The training array included the longest (GIM)-TEC maps from 1998 to 2023 in China. The model utilizes historical TEC map data for the previous 72 hours, along with inputs including ap, Kp, Dst, AE, disturbance index, F10.7 index, and time factor, to predict the future 1-24 hours of TEC maps. The main focus was on the forecast during the disturbance. Results for 1h, 2h, 24 h ahead are presented. From full 702 storms, the train set comprised 552 geomagnetic storms, the validation set consisted of 100 storms, and the remaining 50 storms (27 minor and moderate storms and 23 strong storms) were used as the test set. The model performs well in short-term predictions, accurately capturing the occurrence, evolution, and classification of ionospheric storms. When the predicted length increases, the accuracy gradually decreases, and some erroneous predictions may occur. RMSE increases with the increase of predicted length and the enhancement of geomagnetic activity. The RMSE exhibits strong geographic dependence, with the maximum values occurring in the EIA region around magnetic latitudes of 20° to 30°. Under strong geomagnetic conditions, the RMSE reaches approximately 4 TECU, 8 TECU, and 10 TECU for 1 h, 12 h, and 24 h predicted lengths, respectively. Under quiet conditions, the RMSE is approximately 1 TECU lower than under moderate geomagnetic conditions and 2 TECU lower than under strong geomagnetic conditions. The paper [7] describes in detail all the stages of the Temporal Convolutional Network (TCN) method and compares its results with the results of LSTM, GRU, BiLSTM. The study uses TEC data of the global CODE map with 2h resolution for 6 mid-latitude stations and 6 low-latitude stations in the period 2017-2021. The data is divided into two parts (training, test) by the ratio of 9:1 and use the past 24 data to predict and verify the next data. It is noted that such parameters of the method as loss, learning rate, optimizer, timesteps, batch size, input dimension and others were selected by the grid search method. Among the input parameters, Dst index, latitude, longitude, SN, F10.7, Kp index, Ap index were used. Data for January 2017 - December 2020 were used for training, and data from 2021 were used for testing. It was found that the results depend on latitude. The mean average error of TCN (1.2385 TECU) is lower in most areas compared with LSTM (1.2727 TECU), GRU (1.2602 TECU) and BiLSTM (1.2767 TECU). The paper [8] proposes a method that is a combination of attention-based bidirectional long short-term memory and gated recurrent unit (Bi-LSTM GRU) to predict TEC and signal delay in the ionosphere. The training array included data from 2009-2017, the test array was from 2018 in Bangalore, India. The results are presented for ionospheric delay and 24 h and showed the advantage of the presented method over LSTM, Bi-LSTM, GRU. Moreover, for Bi- LSTM GRU, an experiment was conducted with the attention mechanism, which showed a better result than without this mechanism.
It should be noted that a common feature of many models and methods is manual assignment of hyperparameters. In the works [9-10], attention was drawn to the role of optimization procedures. The general approach is presented in [9], and this problem is considered in [10] in relation to TEC prediction. The paper [10] notes the growing popularity of deep learning models, but their requirement for a large number of hyperparameters is considered a drawback due to the complexity of their selection. This process can be accomplished using optimization algorithms, in particular, the Beluga Whale Optimization (BWO) algorithm that can be used to optimize hyperparameters of deep learning models. The paper [10] analyzed the drawbacks of BWO and proposed an improved BWO algorithm, named FAMBWO (Firefly Assisted Multi-Strategy Beluga Whale Optimization). This method was compared with 11 state-of-the-art swarm intelligence optimization algorithms on 30 benchmark functions, and the results showed that the improved algorithm had faster convergence speed and better solutions on almost all benchmark functions. This algorithm was then used to optimize the MA-BiLSTM model for TEC prediction and obtained significantly better results than the MA-BiLSTM model optimized by other algorithms. The aim of this work is to use the method [10] to optimize the most efficient BiTCN method from [11].
Materials and Methods
The experimental data included the TEC values of the JPL map calculated from the IONEX files (https//cddis.nasa.gov/archive/ gnss/products/ionex/) for the Juliusruh station (54.6° N, 14.6° E) and 2015. The values of the indices of solar and geomagnetic activity – proton density Np, planetary 10Kp index, disturbance storm time Dst, and solar flux F10.7, solar wind velocity Vsw were taken from SPDF OMNIWeb Service (http://omniweb.gsfc.nasa.gov/ form/dx1.html). Methods: (1) LSTM-FRF, (2) LSTM-RFF, (3) GRUFRF, (4) GRU-RFF, (5) TCN, (6) BiLSTM-FRF, (7) BiLSTM-RFF, (8) BiGRU-FRF, (9) BiGRU-RFF, (10) BiTCN are described in detail in [11], the additions related to the BiTCN optimization procedure are as follows.
All convolutional layers of the model were parameterized, and the following parameters were allocated for each layer: dilation factor (DF), number of filters, filter kernel size (Fsz). Due to the requirements for the tensor shape, the filter number was identical for all convolutional layers, so the final number of parameters was equal to 9 (one dilation factor parameter and filter kernel size parameter per Convolution1D Layer plus a single filter number parameter). Sine and cosine DOY and HOD components were included as additional indices for the initial model and optimized versions. Data preparation procedure is identical to [11]. Beluga Whale Optimization algorithm is applied during training for validation loss at the final training epoch (Figure 1).
Results and Discussion
Calculations were carried out for widely used forecast advance times: 2h, 1d, 7d. The results consisted of a comparison of the metrics: mean absolute error (MAE), mean absolute percentage error (MAPE), root-mean-square error (RMSE), averaged over half a year during a single step of 2h, 24h and 7 days advance time forecasts for different models. A comparison of the initial and optimized models is shown in Figure 2 for BiTCNnew, BiTCN-BWOinit, BiTCN-BWOcorr procedures.
It is evident that the forecast accuracy deteriorates with increasing advance time. Initial estimates for BiTCN-BWOinit were actually worse than estimates for BiTCNnew, where parameters were determined via trial and error. It was caused by the fact that BWO algorithm was not effective in determining the number of filters parameter. Therefore, we kept all the BWO estimates for parameters but corrected the number of filters via trial and error, again (BiTCN-BWOcorr). This allowed one to decrease the errors further, making the model more effective than BiTCNnew and BiTCN-BWOinit. Figure 3 compares the new models with the previously obtained results for bidirectional models [11].



It is evident that the corrected method improved the accuracy of the TEC forecast. It can be assumed that combination of such optimization with other methods and models can provide a much greater improvement.
Conclusion
Methods aimed at improving the accuracy of the forecast constantly appear in publications. This paper shows that the optimization algorithms proposed in [10] can improve the accuracy, but in order to recommend these algorithms for widespread use, it is necessary to weigh the advantages and disadvantages of these algorithms. The advantages of BWO include a good initial approximation for manual optimization. The method is scalable to a large number of parameters and reveals non-obvious dependencies. The disadvantages include a much longer calculation time, and the need for repeated verification, since not all parameters are optimized equally effective.
Acknowledgment
The research was carried out at Southern Federal University with the financial support of the Ministry of Science and Higher Education of the Russian Federation (State contract GZ0110/23- 10-IF).
Conflicts of Interest
No conflict of interest.
References
- Goodman J (2005) Operational communication systems and relationships to the ionosphere and space weather. Adv Space Res 36(12): 2241-2252.
- Radicella SM (2023) New Ways to Modelling and Predicting Ionosphere Variables. Atmosphere 14(12): 1788.
- Hochreiter S, Schmidhuber J (1997) Long Short-term Memory. Neural Computation 9(8): 1735-1780.
- Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, et al. (2014) Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) PP. 1724-1734.
- Jiang Z, Zhang Z, He X, Li Y, Yuan H (2024) Efficient and accurate TEC modeling and prediction approach with random forest and Bi-LSTM for large-scale region. Adv Space Res 73(1): 650-662.
- Ren X, Zhao B, Ren Z, Xiong B (2024) Ionospheric TEC Prediction in China during Storm Periods Based on Deep Learning: Mixed CNN-BiLSTM Method. Remote Sens 16(17): 3160.
- Hou D, Liu F, Peng H, Gu Y, Tang G (2024) Temporal convolutional network construction and analysis of single-station TEC model. J Atm Solar-Terr Physics 262(2): 106309.
- Basavarajaiah S, Garudachar R (2024) Prediction of global ionospheric TEC using attention based bidirectional long short-term memory and gated recurrent unit. Bull Electr Eng Inform 13(4): 2797-2808.
- Jia H, Wen Q, Wu D, Wang Z, Wang Y, et al. (2023) Modified beluga whale optimization with multi-strategies for solving engineering problems. J Comput Des Eng 10(6): 2065-2093.
- Chen Y, Liu H, Shan W, Yao Y, Xing L, et al. (2024) Optimizing Deep Learning Models with Improved BWO for TEC Prediction. Biomimetics 9(9): 575.
- Kharakhashyan A, Maltseva O (2023) Comparison of the Forecast Accuracy of Total Electron Content for Bidirectional and Temporal Convolutional Neural Networks in European Region. Remote Sen 15(12): 3069.
-
Artem Kharakhashyan, Olga Maltseva*. Neural Network Forecast of the Total Electron Content of the Ionosphere and Optimization. Iris Jour of Astro & Sat Communicat. 1(5): 2025. IJASC.MS.ID.000522.
-
Total Electron Content, Global Positioning System, Bidirectional, BiTCN, BWO, Forecasting
-
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
- Abstract
- Introduction
- Monistic Elastic Hierarchies with Inelastic Perturbations
- NASA Experiments LLR and GPB can be Reinterpreted in Flatspace
- Why Einstein Finally Rejected the Schwarzschild Solution
- Euclidean Matterspace in Non-Dual Modification of Einstein’s Equation
- Communication Problems in Nonlocal Matterspace with Dissipation
- Conclusion
- References