[Retracted] Urban Ecological Monitoring and Prediction Based on Deep Learning

Yang, Jun He; Chan, Chia Pang

doi:https://doi.org/10.1155/2022/1973777

Wireless Communications and Mobile Computing

On this page

Abstract Introduction Related Work Conclusions Data Availability Conflicts of Interest References Copyright Related Articles

Research Article Retraction

!

This article has been Retracted. To view the article details, please click the ‘Retraction’ tab above.

Special Issue

Artificial Intelligence for Wireless Communications and Control Networks

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 1973777 | https://doi.org/10.1155/2022/1973777

[Retracted] Urban Ecological Monitoring and Prediction Based on Deep Learning

Jun He Yang^1,2and Chia Pang Chan^3,4

Academic Editor: Junjuan Xia

Received18 Feb 2022

Accepted19 Mar 2022

Published13 Apr 2022

Abstract

Obtaining comprehensive and accurate air quality information is conducive to people’s daily travel and living arrangements, especially to protect people’s health from air pollutants. Due to the limited number of air quality monitoring stations and the lack of training samples, the generalisation performance of air quality estimation model is often not good enough. Therefore, we propose an urban air quality index (AQI) prediction and AQI level estimation method based on deep multi-task learning. We consider various urban big data information related to air quality (meteorology, transportation, enterprise self-test, POI, road network, etc.), and use machine learning methods such as deep learning and graph embedding learning to learn the representation of relevant information, and establish the relationship between these related representations and air quality. Experiments show that this scheme can estimate the level of urban air quality index joint prediction task and air quality index, and the model has generalisation performance.

1. Introduction

With industrial development and population expansion, various harmful substances are emitted into the air, causing air pollution. Having comprehensive and accurate information on air quality helps to protect the ecological environment and human health from the dangers of air pollution [1]. Many cities have set up air quality monitoring stations to monitor air quality in real time. However, the number of air quality monitoring stations is limited. It is not possible to know with certainty the air quality at locations where no air quality monitoring stations have been set up, and air quality monitoring stations cannot predict future air quality [2]. Therefore, estimating air quality in areas without monitoring stations and predicting future air quality at monitoring stations can provide comprehensive air quality information in space and time, helping people to rationalise their travel plans and assisting relevant organisations in making environmental decisions. Existing air quality estimation methods are based on urban data related to air quality (e.g., meteorological, road network, and POI) to establish the relationship between air quality in two areas, thus using the air quality in areas with monitoring stations to estimate the air quality in areas without monitoring stations [3]. However, existing air quality estimation methods do not adequately consider data related to air quality, and do not take into account information such as company self-measurement and the angle of relative location between areas. Secondly, existing work has defined and extracted features manually, which are often incomplete and take a lot of time to design and verify their validity [4]. Also, due to the limited number of monitoring sites and the lack of training samples, the models often fail to achieve good generalisation performance [5].

To address these problems, this paper proposes a deep multi-task learning-based urban air quality prediction and estimation method, which integrates a variety of air quality-related urban big data information (meteorology, enterprise self-measurement, road network, POI, etc.), learns the representation of this related information through deep learning and machine learning methods such as graph embedding learning, and improves the generalisation performance of single-task models by combining air quality prediction and estimation tasks through multi-task learning [6].

2.1. Air Quality Estimates

Existing air quality estimation methods include mainly physical dispersion simulation-based methods and statistical and data-driven methods.

Physical dispersion simulation-based methods estimate the distribution of pollutants by simulating the pattern of pollutant dispersion [7, 8]. Linear statistical model-based methods use linear models such as spatial interpolation or land-use regression combined with land-use related characteristics to estimate air quality. For example, the kriging-based air quality estimation method [9] uses a spatial interpolation method called fuzzy genetic linear member kriging to estimate the geospatial distribution of air quality. The land-use regression-based approach [10] uses a land-use regression model to establish the relationship between land-use related characteristics (e.g., land use, traffic patterns, and population density) and air quality.

As linear statistical models are unable to establish non-linear relationships between urban air quality and land-use related characteristics, non-linear statistical models are also widely used in air quality estimation [11]. Established methods based on non-linear statistical models include those based on supervised learning and those based on semi-supervised learning [12].

Methods based on supervised learning include those based on generalised additive models and those based on Gaussian process regression. For example, [13] used generalised additive models to establish the relationship between air quality and the relevant explanatory variables. [14] used Gaussian process regression to establish relationships between characteristics such as traffic flow, population density, temperature, and air quality. Such non-linear models also fail to achieve good estimation performance due to the limited number of training samples [15]. [16] proposed an air quality estimation method based on a collaborative training algorithm, which combines a variety of data such as meteorology, traffic flow, road network structure, and POI in cities, and uses conditional random fields and artificial neural networks to model the relationship between relevant features and urban air quality. [17] used unlabelled data to increase the number of training samples through the collaborative training algorithm; however, the collaborative training algorithm did not control the noise introduced during the iterative training process, and therefore, the model could not achieve better results.

2.2. Air Quality Forecasting Methodology

Existing air quality prediction methods are divided into those based on physical dispersion modelling and those based on statistical and data-driven methods.

Methods based on physical dispersion modelling predict the distribution of pollutants by simulating the pattern of pollutant dispersion, such as Gaussian models [18], dynamic street canyon models, and computational fluid dynamics [19], which mostly use functions related to meteorology, street geography, receptor location, traffic flow, and dispersion factors to simulate the dispersion of pollutants. However, such methods usually require empirical assumptions to be met and parameter settings are not generalisable.

Linear statistical modelling-based approaches use linear models to model the linear relationship between air quality over its own time series or with other characteristics to make predictions about future air quality. For example, [20] used an autoregressive sliding average model to model the trend in air pollutant concentrations over their own time series to predict future average air pollutant concentrations. For example, [12] used polynomial regression combined with meteorological data to predict daily maximum concentrations and [13] used kernel regression combined with meteorological data to predict daily maximum concentrations. [14] used artificial neural networks and linear regression to predict future air quality in the area to be predicted by combining meteorological and historical air quality from the area to be predicted and from the surrounding air quality monitoring stations.

3. Learning of Regional Non-Temporal Information Representation Based on Graph Embedding Methods and Convolutional Neural Networks

The POI category and density of an area tend to reflect the land use and traffic patterns of the area, and are directly or indirectly related to the air quality of the area; for example, areas with factories tend to have poorer air quality, and areas with parks tend to have better air quality. Road network structure and traffic patterns are strongly correlated, with traffic-generated emissions being a source of urban air pollutants, and the road network structure to some extent reflecting the air quality of the region. The traditional method extracts relevant features such as road network and POI, simply counts the number of various types of POI contained in each area and the length of various types of road network as relevant features, ignoring the hierarchical information between different categories of POI/road network, and when there are more categories of POI, the statistical features extracted by the traditional method are rather sparse. The graph embedding method LINE is a method of embedding an information network into a low-dimensional vector space by representing each vertex in the network with a vector in the low-dimensional space. In this paper, the LINE method is used to embed the non-temporal information of each region into a low-dimensional vector by combining the information network graph consisting of coordinates, road network, and non-temporal data such as POI of all regions to be estimated and regions to be predicted. In order to extract more non-temporal features relevant to AQI prediction and AQI class estimation, the low-dimensional vector of non-temporal information is further processed by a convolutional neural network to extract local features, and the output of the convolutional neural network is used as the final regional non-temporal information representation for subsequent AQI prediction and AQI class estimation tasks.

3.1. Learning Non-Temporal Information Representation of Regions Based on the Graph Embedding Method LINE

Definition 1. Area-region diagram. The region-area diagram , representing the physical distance relationship between grid regions, is shown in Figure 1, where denotes the set of grid regions, denotes the set of edges between grid regions, and each grid region and grid region is separated by an edge , with edge weight defined as the physical distance between and .

Definition 2. Area-POI diagram. The region-POI diagram , representing the distribution of POIs within a grid influence region, is shown in Figure 2, where denotes the set of grid regions, denotes the set of POI categories, and denotes the set of edges of grid regions and POI categories. If the grid region has POIs of category within the grid influence region, an edge exists between grid region and POI categories; the edge weight is defined as the number of POIs of category contained within the grid influence region of .

Definition 3. Area-road network diagram. The region-road network diagram represents the distribution of road sections within the grid influence area, as shown in Figure 3, where denotes the set of grid regions, denotes the set of road section categories, and denotes the set of edges between grid regions and road section categories. If there are road sections of category within the grid influence area of grid region , there exists an edge between grid region and road section category , and the edge weight is defined as the total length of road sections of category , contained within the grid influence area of . According to the Chinese urban planning guidelines, this paper classifies urban roads into four classes as shown in Table 1.

For the three diagrams defined above, using the LINE method to learn the low-dimensional vector representation of all vertices in each diagram, the diagrams Gdd, Gdp, and Gdr correspond to the objective functions L(Gdd), L(Gdp), and L(Gdr), respectively, and the total objective function as shown in the following equation.

In the calculation reference formula of , by optimising the objective function , the low-dimensional vector representation of all vertices in the graph can be obtained; that is, the low-dimensional vector representation of non-temporal information of each region can be obtained, which represents the vector space of , and the connection relationship of regional location, POI, and road network in the graph can be embedded into the low-dimensional vector, in order to extract more non-temporal information related to AQI prediction and AQI level estimation tasks.

3.2. AQI Forecasting Tasks

In this paper, multiple complex factors related to air quality are taken into account when predicting the AQI of a region to be predicted: historical meteorological, historical air quality, historical traffic, historical business self-measurement, and weather forecast data of the region to be predicted; non-temporal data such as coordinates, P0I, and road network of the region to be predicted; and historical meteorological and historical air quality data of the region of global influence. A graph embedding method and a CNN are used to learn the representation of this relevant non-temporal information, and a recurrent neural network is used to learn the representation of this relevant temporal information, and based on these relevant representations, the AQI of the area to be predicted at multiple times in the future [21–23].

The output LSTM is called the output LSTM, based on and . This paper uses a one-layer LSTM to predict the AQI at future moments. First, all the vectors in and are joined together to obtain a linear transformation and the output of the linear transformation is non-linearly processed by the tanh function to obtain as shown in equation (2), as the initial hidden state information of the output LSTM.

Similarly, all the vectors in are concatenated to obtain , and after a linear transformation and non-linear processing, is obtained as the initial memory cell information for the output LSTM.

When the input sequence of the input recurrent neural network is too long, the output LSTM is often not good enough when the output LSTM is initialized with only the output of the input recurrent neural network at the last moment, so in this paper, when using the output LSTM to predict the AQI at z future moments, an attention mechanism is introduced, and at each step of generating the sequence of AQI prediction values, the input recurrent each position of the input sequence of the neural network is searched, and the most relevant part of the output LSTM at the current moment is selected to calculate the context vector, and the AQI prediction value at the current moment is calculated based on this context vector. Let denote the output of the output LSTM at moment , and let denote the context vector associated with the long-term sequence of weather data at moment for which the attention mechanism has been introduced, as shown in the following equation. where represents the hidden state output of the input recurrent neural network at moment when a long-term sequence of weather data is used as the input sequence of the input recurrent neural network, represents the weight of , score is a scoring function to calculate the correlation between the hidden state output of the input recurrent neural network at moment and the output of the output LSTM at/moment, and is the weighted sum of the hidden states of the input recurrent neural network at all moments.

4. Experimental Comparison of Relevant Urban Air Quality Estimation Methods

The results of the different urban regional air quality estimation methods are shown in Table 2. The results demonstrate that for the Semi-EP method, its average classification accuracy is highest at fc =3, which is consistent with the results of [15]. The results in Table 2 show that the performance of the proposed method is significantly better than that of U-Air and Semi-EP on the dataset used in this paper, which may be due to the fact that, compared to U-Air and Semi-EP, features are defined and constructed manually and the proposed method uses graph embedding methods to classify the features. The non-temporal and temporal feature representations learned by the proposed method using graph embedding and deep neural networks are more representational [24, 25].

The FFA method uses a linear regressor to model the relationship between local characteristics such as historical air quality, current weather, and weather forecasts at the site to be predicted and the future air quality at the site to be predicted, and an artificial neural network to model the relationship between current weather and global characteristics such as historical air quality at neighbouring monitoring sites and the future air quality at the site to be predicted. The results of the two models are then integrated using a regression tree. For the classification of the global impact space, the same approach as for FFA is adopted. The results of the different AQI prediction methods are shown in Table 3. The results in Table 3 show that the mean absolute error of the proposed method is smaller than the mean absolute error of the FFA method. This may be due to the fact that the method proposed in this paper establishes a better non-linear relationship between the air quality of the site to be predicted and the local factors than the FFA method, and the change in the mean absolute error of the FFA method is greater than that of the method proposed in this paper as the distance between the time to be predicted and the current time increases. This may be due to the fact that the relevant feature representations learned by the proposed method using graph embedding and deep neural networks are more representational and take into account the previous AQI prediction when predicting the null AQI at a certain time, introducing more information.

5. Conclusions

This paper proposes a deep multi-task learning-based approach for urban AQI prediction and AQI class estimation. On the one hand, a variety of urban big data information related to air quality (meteorology, traffic, enterprise self-measurement, POI, road network, etc.) is considered, and machine learning methods such as deep learning and graph embedding learning are used to learn representations of relevant information and establish relationships between these relevant representations and air quality, so as to estimate AQI levels in areas without air quality monitoring stations and to estimate AQI levels in areas with air quality monitoring stations, AQI prediction for regions with air quality monitoring stations.

Data Availability

The experimental data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declared that they have no conflicts of interest regarding this work.

References

R. Janarthanan, P. Partheeban, K. Somasundaram, and P. N. Elamparithi, “A deep learning approach for prediction of air quality index in a metropolitan city,” Sustainable Cities and Society, vol. 67, article 102720, 2021.
View at: Publisher Site | Google Scholar
S. Weichenthal, M. Hatzopoulou, and M. Brauer, “A picture tells a thousand…exposures: opportunities and challenges of deep learning image analyses in exposure science and environmental epidemiology,” Environment International, vol. 122, pp. 3–10, 2019.
View at: Publisher Site | Google Scholar
Z. Han and A. Xu, “Ecological evolution path of smart education platform based on deep learning and image detection,” Microprocessors and Microsystems, vol. 80, article 103343, 2021.
View at: Publisher Site | Google Scholar
X. Li, L. Peng, Y. Hu, J. Shao, and T. Chi, “Deep learning architecture for air quality predictions,” Environmental Science and Pollution Research, vol. 23, no. 22, pp. 22408–22417, 2016.
View at: Publisher Site | Google Scholar
D. Qin, J. Yu, G. Zou, R. Yong, Q. Zhao, and B. Zhang, “A novel combined prediction scheme based on CNN and LSTM for urban PM2.5 concentration,” IEEE Access, vol. 7, pp. 20050–20059, 2019.
View at: Publisher Site | Google Scholar
L. Liu and Y. Zhang, “Smart environment design planning for smart city based on deep learning,” Sustainable Energy Technologies and Assessments, vol. 47, article 101425, 2021.
View at: Publisher Site | Google Scholar
A. Heydari, M. Majidi Nezhad, D. Astiaso Garcia, F. Keynia, and L. De Santoli, “Air pollution forecasting application based on deep learning model and optimization algorithm,” Clean Technologies and Environmental Policy, vol. 24, 2021.
View at: Google Scholar
A. Bekkar, B. Hssina, S. Douzi, and K. Douzi, “Air-pollution prediction in smart city, deep learning approach,” Journal of big Data, vol. 8, no. 1, pp. 1–21, 2021.
View at: Publisher Site | Google Scholar
Y. Jiang, C. Li, L. Sun, D. Guo, Y. Zhang, and W. Wang, “A deep learning algorithm for multi-source data fusion to predict water quality of urban sewer networks,” Journal of Cleaner Production, vol. 318, article 128533, 2021.
View at: Publisher Site | Google Scholar
Q. Yuan, H. Shen, T. Li et al., “Deep learning in environmental remote sensing: achievements and challenges,” Remote Sensing of Environment, vol. 241, article 111716, 2020.
View at: Publisher Site | Google Scholar
S. Christin, É. Hervet, and N. Lecomte, “Applications for deep learning in ecology,” Methods in Ecology and Evolution, vol. 10, no. 10, pp. 1632–1644, 2019.
View at: Publisher Site | Google Scholar
E. F. Sherley, A. Kumar, Revathy, and Divyashree, “Detection and prediction of land use and land cover changes using deep learning,” Communication Software and Networks, Springer, Singapore, pp. 359–367, 2021.
View at: Publisher Site | Google Scholar
Z. Chen, H. Liu, C. Xu et al., “Modeling vegetation greenness and its climate sensitivity with deep-learning technology,” Ecology and Evolution, vol. 11, no. 12, pp. 7335–7345, 2021.
View at: Publisher Site | Google Scholar
M. Zamani Joharestani, C. Cao, X. Ni, B. Bashir, and S. Talebiesfandarani, “PM2. 5 prediction based on random forest, XGBoost, and deep learning using multisource remote sensing data,” Atmosphere, vol. 10, no. 7, p. 373, 2019.
View at: Google Scholar
H. Che and J. Wang, “A nonnegative matrix factorization algorithm based on a discrete-time projection neural network,” Neural Networks, vol. 103, pp. 63–71, 2018.
View at: Publisher Site | Google Scholar
H. Che and J. Wang, “A collaborative neurodynamic approach to global and combinatorial optimization,” Neural Networks, vol. 114, pp. 15–27, 2019.
View at: Publisher Site | Google Scholar
H. Che, C. Li, X. He, and T. Huang, “A recurrent neural network for adaptive beamforming and array correction,” Neural Networks, vol. 80, pp. 110–117, 2016.
View at: Publisher Site | Google Scholar
T. Pu, H. Cai, G. He, Y. Luo, and M. Wu, “An air quality prediction model based on deep learning and wavelet analysis considering the COVID-19 pandemic factors,” in 2021 IEEE International Performance, Computing, and Communications Conference (IPCCC), pp. 1-2, Austin, TX, USA, 2021.
View at: Publisher Site | Google Scholar
X. Ning, F. Nan, S. Xu, L. Yu, and L. Zhang, “Multi-view frontal face image generation: a survey,” Concurrency and Computation: Practice and Experience, no. article e6147, 2020.
View at: Publisher Site | Google Scholar
C. Wang, X. Wang, J. Zhang et al., “Uncertainty estimation for stereo matching based on evidential deep learning,” Pattern Recognition, vol. 124, 2022.
View at: Publisher Site | Google Scholar
Z. Yuan, X. Zhou, and T. Yang, “Hetero-convlstm: a deep learning approach to traffic accident prediction on heterogeneous spatio-temporal data,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 984–992, 2018.
View at: Google Scholar
W. Zhi, D. Feng, W. P. Tsai et al., “From hydrometeorology to river water quality: can a deep learning model predict dissolved oxygen at the continental scale?” Environmental Science & Technology, vol. 55, no. 4, pp. 2357–2368, 2021.
View at: Publisher Site | Google Scholar
Z. H. A. N. G. Zhengwan, Z. H. A. N. G. Chunjiong, L. I. Hongbing, and X. I. E. Tao, “Multipath transmission selection algorithm based on immune connectivity model,” Journal of Computer Applications, vol. 40, no. 12, p. 3571, 2020.
View at: Google Scholar
X. Zhang, M. Zhao, and R. Dong, “Time-series prediction of environmental noise for urban IoT based on long short-term memory recurrent neural network,” Applied Sciences, vol. 10, no. 3, p. 1144, 2020.
View at: Publisher Site | Google Scholar
G. Grekousis, “Artificial neural networks and deep learning in urban geography: a systematic review and meta-analysis,” Computers, Environment and Urban Systems, vol. 74, pp. 244–256, 2019.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Jun He Yang and Chia Pang Chan. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

423

Downloads

561

Citations

Wireless Communications and Mobile Computing

Artificial Intelligence for Wireless Communications and Control Networks

[Retracted] Urban Ecological Monitoring and Prediction Based on Deep Learning

Abstract

1. Introduction

2. Related Work

2.1. Air Quality Estimates

2.2. Air Quality Forecasting Methodology

3. Learning of Regional Non-Temporal Information Representation Based on Graph Embedding Methods and Convolutional Neural Networks

3.1. Learning Non-Temporal Information Representation of Regions Based on the Graph Embedding Method LINE

3.2. AQI Forecasting Tasks

4. Experimental Comparison of Relevant Urban Air Quality Estimation Methods

5. Conclusions

Data Availability

Conflicts of Interest

References

Copyright