Abstract

A clear understanding of the spatial distribution of earthquake events facilitates the prediction of seismicity and vulnerability among researchers in the social, physical, environmental, and demographic aspects. Generally, there are few studies on seismic risk assessment in United Arab Emirates (UAE) within the geographic information system (GIS) platform. Former researches and recent news events have demonstrated that the eastern part of the country experiences jolts of 3-5 magnitude, specifically near Fujairah city and surrounding towns. This study builds on previous research on the seismic hazard that extracted the eastern part of the UAE as the most hazard-prone zone. Therefore, this study develops an integrated analytical hierarchical process (AHP) and machine learning (ML) for risk mapping considering eight geospatial parameters—distance from shoreline, schools, hospitals, roads, residences, streams, confined area, and confined area slope. Experts’ opinions and literature reviews were the basis of the AHP ranking and weighting system. To validate the AHP system, support vector machine (SVM), decision tree (DT), and random forest (RF) classifiers were applied to the datasets. The datasets were split into 60 : 40 ratio for training and testing. Results show that SVM has the highest accuracy of 79.6% compared to DT and RF with a “predicted high” precision of 87.5% attained from the model. Risk maps from both AHP and ML approaches were developed and compared. Risk analysis was categorised into 5 classes “very high,” “high,” “moderate,” “low,” and “very low.” Both approaches modelled relatable spatial patterns as risk-prone zones. AHP approach concluded 3.6% as “very high” risk zone, whereas only 0.3% of total area was identified from ML. The total area for the “very high” (20 km2) and “high” (114 km2) risk was estimated from ML approach.

1. Introduction

Earthquakes are considered short-term calamities that exert a significant long-term impact on human lives, infrastructure, and the economy that can last for decades or longer [1, 2]. The severity of a tremor can range from light, i.e., nearly no impact, to sufficiently strong to destroy means of livelihood [3]. Tremors are defined on the basis of their epicenter’s geographic location, magnitude, frequency, duration, and onset speed. The seismic actions are measured based on an earthquake’s frequency and magnitude occurring within a certain period. These hazards contribute to severe vulnerability in terms of loss of human life, society, and economy. The vulnerability of built-up areas to earthquakes and other natural hazards is a consequence of construction methodology and the quality of materials [4].

The United Arab Emirates (UAE) experiences low seismic activities, and their hazards [5] are categorised as insignificant by various researchers [6]. With the country’s growing population, dramatic infrastructure development has occurred over the last 30 years. Numerous studies have indicated that the UAE lies in low-seismicity zones, and no massive fatalities from earthquakes have been reported yet. However, in the recent few years, the country experienced earthquake jolts that were primarily concentrated in its eastern part, i.e., the Fujairah Emirate [7]. The UAE has designed infrastructure codes to manage any anticipated earthquake efficiently [811]. Few studies have been carried on earthquake risk management for the country within the geospatial platform. Therefore, it is crucial to analyse the earthquake risk within the country to elude any havoc for future mitigation. This research focuses on developing a technique to ascertain most risk-prone zones in the Fujairah city and surrounding towns in UAE.

Generally, seismicity and vulnerability hazard assessments are conducted by traditional theoretical and empirical algorithms and conventional decision-making processes based on earthquake magnitude and intensity. However, over recent years, remote sensing (RS), geographic information system (GIS), and machine learning (ML) techniques are evolving for earthquake risk assessment. Geotechnical, structural, and social-economic are some of the key points to investigate the earthquake [3, 1214]. Many studies [1527] have combined spatial statistical techniques and the analytical hierarchy process (AHP) integrated with a GIS to investigate seismic hazards, vulnerability, and their associated risks to humans and the environment.

A contemporary study [28] on earthquake risk assessment has been carried in northeast India. The study focussed on distance from railway, railway density, distance from landuse, landuse density, distance from buildings, and building geospatial density layers. These layers were integrated with AHP [29, 30] and convolutional neural network (CNN) to microzonate risk. Three major regions were identified to be likely experiencing higher intensity events and therefore is more risk prone in the southern part of the state. Model developed utilizing CNN showed an accuracy of 0.94 and a precision of 0.98. Another novel research [31] assessed seismic vulnerability in Tehran, Iran. The factors such as peak ground acceleration (PGA), slope, construction (material, quality), population, employment status, open spaces, road network, fire stations, hospitals, gas pipes, and gasoline stations were integrated into radial basis function (RBF) and teaching–learning-based optimization (TLBO) to optimize weights of each factor. Their findings reflected that PGA has a higher liquefaction probability, therefore, higher seismic vulnerability in the region. A contemporary study [3] utilized the population data, anticipated seismic events, infrastructure aggregate, elevation, and earthquake hazard events of the UAE and then integrated AHP to analyse the hazard’s adversity. This study reported that the northern part of the UAE is more hazard-prone and vulnerable than its southern part. The authors deduced that several infrastructure elements only have a minimal degree of protection from seismic activities, and consequently, the seismic design practice is still in its nascent stages in the UAE. We investigated the most commonly used factors and techniques in the literature (Table 1). The following parameters are most utilized for seismicity: PGA, soil type, geology, distance from active faults, epicenter, and slope. Whereas for vulnerability, the most utilized parameters are a distance from residences, hospitals, streams, roads, landuse, landuse density, and topography.

This research utilized AHP supported by ML techniques to zonate seismic vulnerability for Fujairah City and surrounding towns in UAE. Towards generating the seismic hazard map five parameters such as PGA, fault distances, slope, soil, and geology were considered. To analyse the vulnerability, eight parameters such as distances from the shoreline, schools, hospitals, roads, residences, confined areas, streams, and slope were considered. The generated seismic hazard map was then multiplied with vulnerability map to adopt the risk map finally. The primary objectives of the current study can be summarised as follows: (i) investigate risk-prone zones for disaster management within UAE; (ii) analyse the seismicity in the UAE; (iii) identify and map topography, hydrology, and distance from residences, streams, and other parameters for earthquake risk assessment; (iv) employ and compare AHP and ML techniques for preparing a risk map of Fujairah City and its surrounding towns.

The following section provides a brief explanation of the study area. Thereafter, a case study of the UAE is presented to understand the high seismic-prone zones. The subsequent section analyses factors and techniques related to risk assessment from AHP and ML point of view. After the results are provided, the paper summarizes the study’s significant findings and provides suggestions for future research.

2. Study Area

The UAE is located in the eastern part of the Arabian Peninsula, and it shares its borders with Saudi Arabia in the southwest and Oman in the East. It sits on the eastern part of the Arabian Plate (Figure 1), close to the collision zone of the Arabian and Eurasian Plates. Topographically, the country is covered with 95% plains, mostly constituted by desert, with 5% mountainous regions. The Hajjar Mountains are located in the eastern part of the country, where elevation rises up to 2000 m.

Rugged terrains mostly cover eastern Emirates, such as Fujairah and Ras-al Khaimah. With an area of 1450 km2, Fujairah is the fifth largest Emirate. It is situated in the eastern part of the UAE, bordering the Gulf of Oman. The Fujairah Emirate has been the most seismically active region in the UAE, having experienced tremors of magnitude 5 in March 2002 and magnitude 2.2 in September 2011 in the Masafi region [3, 13, 36]. The Al Dibba region also recorded tremors in November 2009 [13, 36, 37]. Although these incidents did not lead to any casualties, researchers are focusing on the risk assessment of the region due to the rapid pace of urbanisation. Primary risk assessment is narrowed down to the eastern part of the Fujairah Emirate, covering the densely populated cities of Fujairah, Kalba, Al Aqdah, Hail, Al Bithnah, and Qurayya, which are regions near the seashore. These cities lie in an area which is bounded between mountains and sea as shown in Figure 2(b) and 2(c). The mountains enclosing Fujairah City and its surrounding areas are generally more than 200?m until 1000?m high.

The current study utilized historical earthquake data to determine the risk-prone zones in UAE. An earthquake catalogue that contains the date, time, latitude, longitude, and magnitude was used as reference; it covers 13,156 events from 1900 to 2015 [36, 38]. It includes the entire Arabian Plate and its neighbouring territories, i.e., every significant hazard for the Arabian Plate [38, 39]. Table 2 lists historical earthquake events and some of their associated attributes [38].

3. Methodology

Figure 3 represents the methodology of this study. The study has been constructed into two parts: (a) seismic analysis of UAE using AHP and demarcating the seismic prone zone based on historic earthquake events, (b) risk analysis for the most hazard-prone zone within UAE. Soil, geology, distance from faults, slope (percent), and PGA parameters were paired with AHP and weighted overlay in ArcGIS Pro to zonate seismic prone regions in UAE. Investigation of earthquake risk assessment was supported consuming parameters such as in situ data of built-up areas, roads network, hospitals, schools, ocean shoreline, and digital terrain model (DTM) were processed to attain a required thematic layer of each parameter, respectively, in the GIS framework. Then, integration of AHP and ML models (support vector machine (SVM), decision tree (DT), and random forest (RF)) paired with weighted overlay facilitated obtaining a risk map. Intensive literature review and expert opinion were the basis for weights and ranks. Pairwise comparison matrix validated the weights of each parameter in the AHP technique. 500 random points were generated and processed in ML platforms as 60 to 40 ratio training and testing datasets. The ML model helped to modify the weights of each criterion. Weighted overlay was applied to develop the risk map.

3.1. Assessment of Seismic Prone Zones of UAE

This section describes the geospatial thematic layers utilized to demarcate seismic-prone zones of UAE paired with AHP technique in GIS environment. Also, to validate the AHP technique, a pairwise comparison matrix was also developed.

3.1.1. Geospatial Parameters for Hazard Assessment

Five thematic layers, namely, PGA, soil classification, distance from fault, slope percentage, and geology, were considered in seismic hazard-prone zonation. Each layer exhibits a correlation with earthquake hazard. PGA is the maximum ground acceleration observed during an earthquake. Previous earthquake seismicity research [9, 40] observed a decline in PGA as distance increases from the epicenter. In the current research, given that no major seismograph events occurred in the UAE, ground acceleration was calculated using the ground motion prediction equation that considers all the historical earthquake events in the entire Arabian Plate from previous researches [36, 41]. As stated in [12], only two major faults exert a direct seismicity effect on the UAE: the Zagros Fold [14] and the thrust belt in Makran zone [3, 11]. In this study, PGA has been established employing attenuation relation by the following equation [41, 42].

where is PGA cm/sec2; is earthquake magnitude moment; is hotspot distance (km); (.399), (-.0019), and (1) are constants of Zagros horizontal component used for this study; is site class; is site condition; is standard deviation; and is constant (0,1).

The thematic layer of distance from faults exhibits an inverse relationship with seismicity. Fault lines were extracted from Landsat 8 satellite images of 30 meters spatial resolution, and then, the Euclidean distance was calculated. Near distances of up to 200 km from fault lines were considered the most seismic prone [41]. The soil layer comprises torripasmments, calciorthids, saliorthids, torrifluvents, gypsiorthids torriorthents, and salorthids. Torripasmments being clay rich is not at risk for seismicity [36], whereas saliorthids, calciorthids, and torrifluvents are more seismically prone in the UAE than the other soil classes [3, 41]. A slope spatial map was derived from the (Advanced Spaceborne Thermal Emission and Reflection Radiometer) ASTER digital elevation model (DEM) with a resolution of 3 m. Although most of the land shares similar slopes, higher steep slopes are found in the north-eastern part of UAE. Therefore, slopes with >30° were considered under higher seismic zones [36]. The geology thematic layer was prepared from Landsat 8 satellite images by applying supervised classification. The layers contain sand, alluvium, limestone, metamorphic rocks, gabbro, and ophiolite. Sand has the least compactness, and thus, it was considered the most seismic prone [36]. Literature review and experts’ opinion were the basis of weights for all seismic hazard parameters (Table 3). Weights and ranks were validated using the AHP technique by preparing a pairwise comparison matrix. AHP is discussed in the next section.

3.1.2. AHP for Hazard Assessment

Each parameter was evaluated using Saaty’s AHP [18, 24, 34, 41], [43]. Saaty’s AHP is a decision-making procedure based on each criterion and alternatives [44]. The parameters were assigned with weights in accordance with the rank of their suitability and importance. The AHP technique consists of three major steps [45]. In the initial step, the decision-making problem was divided into a hierarchical structure that consists of all the parameters. Several factors were utilized to create a hierarchy of the primary goal of identifying hazard-prone areas. The next step was to establish decision tables for each hierarchy level. The matrices denoted pairwise comparisons (PC-matrices) by using comparable data. A nine-point scale was used for comparison, or alternatively, actual data can also be used if available [24]. The nine-point scale includes 9, 8, 7, …, 1/7, 1/8, 1/9, where 9 indicates extreme preference, 7 indicates very strong preference, 5 indicates strong preference, and so on down to 1, which means no preference. An independent evaluation of each factor’s contribution was made due to the pairwise comparison, which helped simplify the decision-making process [18]. The pairwise comparisons were arranged in a square matrix, with the diagonal elements being 1. The relative importance of the criteria was determined by calculating the principal eigenvalue and the corresponding normalized right eigenvector of the comparison matrix. The elements of the normalized eigenvector were weighted with reference to the criteria or subcriteria and rated with respect to the alternatives [18]. Then, an evaluation of the consistency of the matrix of order was performed on the basis of Equations (2)–(4) [41, 43]. where is the consistency index, is the randomised index, is the consistency ratio, and is the order of the compression matrix.

Pairwise comparison matrix was utilized to validate the weights of the parameters (Table 4). To validate the consistency of the model, should be <10% [18]. The of this model was calculated as 3%, validating the ranking and weighting criteria as true. After validating the ranking technique, the weights were assigned in ArcGIS by utilizing the weighted overlay tool to prepare the output. The seismic hazard map of the UAE was reclassified into five zones: very high, high, moderate, low, and very low (Figure 4) [38].

From the analysis of Figure 4, the very high hazard zone is clearly found in Fujairah City, a highly populated region with an area of 202 km2. Approximately 11% of the UAE, including areas within Fujairah, Ras Al Khaimah, and Sharjah, fell within a high hazard zone. Sharjah and Dubai lie from high to moderate zones, whereas Abu Dhabi is located considerably far from seismic hotspots and lies within the low seismic hazard zone. In the long term, risk analysis, contingency strategies, land use action plans, and relief measures should be considered and promoted in these areas for critical disaster management.

3.2. Risk Assessment of Fujairah City and Neighbouring Areas

This section describes the vulnerability and risk-prone zones of Fujairah City and its adjacent towns, the associated factors, and techniques. Finally, the risk map is developed using AHP and the weighted overlay, and also ML techniques have been employed to understand the parameters for the risk associated with earthquake hazard.

3.2.1. Vulnerability Assessment of Fujairah City

Vulnerability is the possible impact of a particular hazard on a community and its environment [46, 47]. It is included within the risk framework. Following the United Nations (UN) (2004), risk has two major components. The first component is the hazard itself, the damaging event, human activity, or phenomenon characterised by location, intensity, frequency, and probability. The second component is vulnerability, which defines the hazard severity’s interdependency and its potential degree of damage [46]. The UN (2004) defined risk assessment with the help of Equation (6) [47].

Different types of earthquake vulnerability are influenced by the selection and mapping of each criterion, and the study and incorporation of these criteria are considered in an efficient earthquake vulnerability mapping process [18, 26, 27, 48].

3.2.2. Geospatial Parameters for Risk Assessment

The vulnerability of a particular area to earthquakes can be predicted using spatial and temporal components. Spatial layers vary depending on the location, nature, and boundary conditions of different regions [45]. Layers, such as built-up, transportation networks, hospitals, and school locations, were first extracted from OpenStreetMap (OSM) in vector format and then georeferenced. The calculation of the Euclidean distance helps in assessing earthquake vulnerability [4]. DTM downloaded from the website of the United States Geological Survey for the region of Fujairah was used to extract a slope map and a stream order. The shoreline of Fujairah was demarcated using the base map in ArcGIS Pro. Further details of the layers are discussed in the succeeding sections.

(1) Spatial Euclidean Distances. In the current study, Euclidean distances from the shoreline, schools, roads, and hospitals were calculated to determine the hazard’s vulnerability. (i)Shoreline. The vulnerability effect decreases as distance from the shoreline increases [49]. An earthquake of high intensity will likely aggravate the adjacent water body (the Gulf of Oman in this case), leading to higher waves, and eventually, floods or tsunami, affecting close areas. The shoreline was spatially mapped utilizing Landsat 8 images. Euclidean distance was then calculated to obtain the thematic layer as presented in Figure 5(a). The layer was reclassified by assigning the highest ranks to a distance of approximately 3 km from the shoreline. The degree of vulnerability decreases as the distance from shoreline increases, with the least effect occurring at distances of more than 9 km, and the most vulnerable areas are those with distances from shore less than 3 km. A buffer of 0.5 km was restricted while reclassifying(ii)Schools. The vulnerability assessment was inversely proportional to a school’s distance because open spaces, such as school playgrounds, are considered evacuation areas during a disaster [9]. The Euclidean distance thematic layer was prepared using a school’s locations from OSM, as shown in Figure 5(b). The farther the assigned distance from schools, the higher the ranking. In this study, the degree of vulnerability increases as the distance from schools increases with the least effect occurring when schools are at an easily accessible distance of less than 2 km and the most vulnerable when distances are more than 6 km(iii)Roads. Similar to schools, roads are also considered evacuation areas during hazards [36]. The farther the distance from roads, the higher the risk of vulnerability, refer to Figure 5(c). Road network shapefile database was prepared utilizing OSM, and then Euclidean distance was calculated. Therefore, the highest rank was assigned to roads that are located at a distance of more than 3.5 km. In this study, the degree of vulnerability increases as the distance from roads increases with the least effect occurring when roads are at a distance less than 1.5 km and the most vulnerable when distances are more than 3.5 km(iv)Hospitals. Open spaces at a hospital’s boundaries are considered evacuation areas during hazards and provided to medical facilities [36]. Euclidean distances were calculated using a hospital’s point shapefile which was obtained from OSM, refer to Figure 5(d). The farther the distances from hospitals, the higher ranks were assigned in accordance with Saaty’s AHP approach, signifying higher vulnerability. In this study, the degree of vulnerability increases as the distance from hospitals increases with the least effect occurring when hospitals are at a distance less than 2 km and the most vulnerable when distances are more than 9 km(v)Distance from Residences. Residential areas were considered highly vulnerable and were assigned the highest rank [2]. This thematic layer was prepared using the polygon shapefile of residential areas utilizing dataset from OSM, refer to Figure 5(e). Areas without residences were assigned the lowest rank

(2) Topographic Factors. The topographic factors considered in the current study were slope and confined area. The thematic layers for both parameters were created using a DEM of the Fujairah Emirate scaled at 30 m spatial resolution. (i)Slope. The slope thematic layer is presented in Figure 5(f). The risk of vulnerability increases as slope increases. The slope map was derived utilizing DEM from USGS website. Five classes were considered when reclassifying the slope layer by using the natural breaks (Jenks) technique. Over the years, built-up areas have expanded along the foothills of the Fujairah Emirate. Steeper slopes are more prone to landslides [2, 27, 46], contributing to additional hazards after a disaster to built-up areas constructed close to foothills. The degree of vulnerability increases as the angle of slope increases, with the least effect occurring at angles less than 10 degrees and the most vulnerable areas are those with slope angles more than 30 degrees(ii)Confined Areas. In the current research, confined areas are considered spaces between mountains and the sea. The confined areas were demarcated by utilizing Landsat 8 images, and subsequently, raster files have been obtained. As observed from the DEM thematic layer, i.e., Figure 5(g), the mountains’ foothills begin at an elevation of approximately 450 m above sea level, and their peak can be observed up to 1008 m. The space covering elevation below 450 m and within the proximity to the shoreline and streams was considered confined space. Confined spaces are more vulnerable to earthquakes as compared to nonconfined spaces

(3) Hydrology Factors. Stream orders were obtained for Fujairah City by using the hydrology tool in ArcGIS, as shown in Figure 5(h). Stream order is one of the important parameters for assessing vulnerability and creating a risk map. (i)Distance from Streams. Areas near streams are more susceptible to risk because a water body will interfere with adjacent built-up areas, causing additional harm to livelihood, refer to Figure 5(h). Streams increase the vibration and lubrication of soil. Hence, built-up areas close to streams exhibit higher chances of collapsing, aggravating the risk [4, 46]. Strahler method [50] was implied to extract the stream order represented in Figure 5(h). Third-order streams are constituted in the eastern part of the study area. Euclidean distance tool was applied to extract the distance from streams thematic layer. Areas near streams were assigned higher ranks in the AHP ranking system. The degree of vulnerability decreases as the distance from streams increases with the least effect occurring at distances of more than 3.5 km, and the most vulnerable areas are those with distances from streams less than 0.5 km

3.2.3. Pairwise Analysis of Parameters

Similar to the seismic spatial analysis discussed in the previous sections, vulnerability assessment was also performed using the AHP approach. Literature review and expert opinions were the basis for weights of each parameter, refer to Table 5. Weights were assigned in a square matrix that represents diagonal elements as 1 on the basis of several vulnerability assessment studies [4, 17, 27].

Each parameter’s weights helped develop a pairwise comparison (Table 6) to understand and validate relative weights among each parameter. The matrix helped to calculate, , and , by utilizing the equations mentioned in Section 3.1.2 to validate the proposed risk assessment model. The resulting was also 3%, validating the model as a good one for the ranking and weighting system.

3.2.4. Machine Learning (ML) Analysis of Parameters

ML techniques are boon for modern-day research in all the scientific domains [28, 33, 5156]. It allows the input data to read, analyse, and train up to maximum accuracy compared to any traditional approaches. Several studies [19, 32, 51, 57] of earthquake risk assessment have utilized ML techniques paired with traditional approaches. This study utilizes three ML models: SVM, DT, and RF to classify most risk-prone zones within Fujairah and its surrounding towns. The research established 500 random points across the study area to train the ML algorithms. Each thematic layer developed for this study was the independent parameter, and the potential risk was the dependent parameter for the ML models. The raw data was first preprocessed to remove any null values or outliers. This is essential so that the ML model is able to train and learn properly. The next important step is to split the data in order to remove the bias from the training process of the ML algorithm. Often, the ML algorithms fit too tightly on the training data, leading to incorrect predictions on the test data.

(1) Support Vector Machine (SVM). SVM is the first ML algorithm utilized to analyse the earthquake datasets. SVM is one of the most regularly utilized supervised learning algorithms for classification and regression analysis and provides practical learning tasks. The SVM takes the input data and predicts the class for each data. For this study, 60% of the dataset was used for training, and 40% was utilized as a test dataset. The classification is performed by identifying hyperplane boundaries between the classes such that the boundary lines are as far as possible from the classes. By using the dot kernel type, the weights for the attributes are also obtained [33, 51, 58, 59]. The hyperplane is constructed using the following function:

where is the attributes at each instance and is the weights. The SVM model in this study performed 79.6% accuracy.

(2) Decision Tree (DT). A decision tree is a supervised learning algorithm that identifies the essential parameters that can help in classification. The rules are worked out based on the structure of the data. The tree starts with a highly influential attribute as the root node and successive rules are applied to move to the next attributes until a leaf node or terminal node is reached [60, 61]. The data set is split repeatedly from the coarsest attributes to the finest attributes. DT have the advantage of generating a visually easy to interpret model. The DT model in this study gave an accuracy of 78.9%. The optimal depth of the tree was obtained to be 4 with an error rate of 16%.

(3) Random Forest (RF). RF is one of the most used supervised learning algorithms in ML. A RF is a group of random decision trees where each node splits the dataset based on a particular parameter. Only a few attributes are considered for the selection at each node. This selection is specified as a parameter while designing the model. Based on the splitting rules at each node, the dataset is classified among the possible outcomes. New nodes are continuously built till a criterion is satisfied at which point the tree terminates. Each tree results in a single outcome, and the final outcome of the RF is the average of all the individual trees [62]. While splitting at a node, RF uses the best attribute among all the attributes rather than selecting the most important attribute. RF gave an accuracy of 78.2% in this study. The optimal parameters were obtained at a tree depth of 4 and the number of trees as 20 with an error rate of 16.3%.

4. Results and Discussion

The earthquake risk was estimated and mapped by two approaches, AHP and ML, spatially (refer to Figures 6(a) and 6(b)). The map was categorised on the basis of ordinal scale into 5 classes: “very high,” “high,” “moderate,” “low,” and “very low.” Though both the maps depicted a similar pattern for adversity, marginal differences in each class area were estimated, refer to Table 7. As can be seen from Table 7, the area under “very high” category in AHP was 12 times more than in ML. Similarly, in “very low” category, ML has almost twice the area than AHP. The “very high” risk zones in the AHP output map were categorised as “high” risk zones in the ML output map. Additionally, two more zones were identified as “high” risk zones in the ML output which were not observed in the AHP output. Three central locations were identified as belonging to the “very high” risk category according to the AHP output Figure 6(a).

The risk map was obtained by multiplying hazard and vulnerability. Three ML models—SVM, DT, and RF—were applied to the dataset using Rapidminer software. 60% of the dataset was categorised as training and 40% as testing for ML models. SVM presented the highest accuracy of 79.6% (Table 8) compared to the DT (78.6%) and RF (78.2%). The SVM model draws a standard deviation of ±6.5%. Table 9 represents the confusion matrix accuracy of the same model. The True "high" accuracy is almost 87% representing the training and testing datasets. SVM facilitated the assignment of weights to each parameter in a more accurate way. The total area for “very high” and “high” risk was estimated to be 20 km2 and 114 km2, respectively, for the ML output. These zones are identified to be very close to streams and shoreline and are classified as confined areas.

Figure 7 represents the comparison of the weight of each input parameter derived from AHP and ML techniques. A key difference in weights can be observed from both techniques. In the AHP model, utmost importance to roads, confined areas, and residential areas have been allotted. The ML approach validated the AHP technique by concluding the highest weights to similar parameters. However, minute upswing in weights for the residential area can be seen for AHP. ML approach moderately assigned more weightage to evacuation centres like schools and hospitals. Also, the weights for distance from shoreline were increased to two times in the ML technique.

The following points were concluded from both maps (Figure 6): (i)Zone A portrays one of the very high-risk zones from both approaches. It is situated in Fujairah’s north-eastern part, within 1 km from the shoreline, and is a compact industrial area with large oil storage tankers, as shown in Figure 2(a). The area is bounded by mountains of height 180?m (above sea level) in the west, as shown in Figure 2(b), and the Gulf of Oman in the east, as shown in Figure 2(c). Being an industrial region, it is far from evacuation areas, such as the open spaces of schools and hospitals which are located at 4?km and 7?km, respectively, thereby, posing a hurdle for evacuation during times of disaster.(ii)Zone B shows identical patterns in both the outputs. AHP has a higher proportion of area under “very high” than ML has under “high” risk. Like Zone A, this zone is also a confined area with mountains of 150 m height in the west and has a compact built-up of residences, as shown in Figures 2(b) and 2(d). It is considerably closer to streams at a distance of 2 km and 6 km far from schools.(iii)Zone C is a combination of confined and residential areas, making it a very high-risk zone (refer to Figure 6(a)). This zone is close to the Gulf of Oman and approximately 3 km from the shoreline, in the Kalba region. This region categorised as “very high” in AHP was observed to be in the “high” category in ML.(iv)Zone D is another high-risk zone, with similar contributing major parameters as Zones A, B, and C, i.e., closeness to residential and confined areas. Other parameters that play a role are its close proximity to streams (within 4 km) and distance from schools (2 km).(v)Zone E is categorised as a low-risk zone. Although the region is closer to the shoreline at a distance of 5 km, the contributing parameters were roads and schools located at a distance less than 2 km. Moreover, this zone is located approximately 4 km away from streams. These factors contribute to the “low” risk categorization of this zone, validating the AHP ranking and weighting approach.(vi)Zone F lies within 1.5 km range from streams and has a moderately higher slope of 35-40 degrees. The region is farther from evacuation centres such as hospitals and roads making it a “moderate” risk zone, Figure 6(a), whereas it is classified as a “high” risk zone in Figure 6(b)(vii)Zone G is closer to Zone E. As the weights are slightly higher for the distance from schools in the ML technique, the output showed more percentage of areas to be “high” risk as compared to the AHP technique. Also, this zone has been identified to be close to streams at a distance less than 1.5 km, unlike Zone E.

The above discussion also leads the scientific society to investigate and study the consequences of coseismic secondary effects [63, 64]. With respect to the present research, the secondary effects might arise from tsunamis, landslides, liquefaction of soil, faults, and cracks through mountains, oil spillage from industry belts, collapse of high rise residential buildings, and fire hazards due to natural gas or oil spillage [63]. These effects might result in the compounding of the earthquake hazard and might lead to more widespread calamities and human life destruction. In the event of future occurrence of an earthquake, the area has a higher possibility of being affected by tsunami as a coseismic effect despite no fatalities in the study area until now [37].

5. Conclusion

This study represents an effort to assess UAE’s vulnerability to seismic activities. Although the UAE has not been directly affected by any major earthquakes to date, the eastern part of the country has experienced high-magnitude (3-5 M) tremors [3, 9, 12, 13, 37, 49], and thus, an earthquake vulnerability assessment is necessary. Spatial statistical techniques obtained from a previous study [32, 57] were utilized to determine the earthquake event pattern over the Arabian Plate and locate hazard-prone areas in the UAE. These techniques helped determine that most high-hazard events are observed in the northern belt of the Arabian Plate covering the Zagros Mountains of Iran. PGA, distance from faults, slope percent, soil classification, and geology were the parameters integrated in Saaty’s AHP to determine seismic-prone zones in UAE. One of the major outcomes from the seismic hazard map was that the eastern part of the UAE is more likely seismically prone, particularly Fujairah City and adjacent towns, such as Kalba, Al Aqdah, Hail, Al Bithnah, and Qurayya. Subsequently, the study evaluated the hazard risk and charted integrated AHP and ML techniques to obtain the risk map. Three ML techniques (i.e. SVM, DT, and RF) were attempted, and accuracy of each was intercompared. T Another major accomplishment in the study is that the SVM model showed the highest accuracy of 79.6% with 60% of the dataset as a training dataset and 40% as testing dataset. The SVM-generated weights were utilized to validate and revise the AHP weights for vulnerability parameters. Finally, the weighted overlay technique facilitated to development of the risk map and categorised the risk zones into very high, high, moderate, low, and very low. Risk map obtained from both approaches AHP and ML was compared. The parameters utilized for the risk assessment were the distance from the shoreline, schools, hospitals, roads, residences, streams, and confined areas. Confined areas and compact built-up regions with residences or industries located closer to the shoreline or streams were the most vulnerable. Schools, hospitals, and roads were considered evacuation areas during hazards. A shorter distance from vulnerable areas to these evacuation areas is more favourable because of their open spaces. The farther the distances of the evacuation areas, the higher the risk. The region with low vulnerability was identified to be located at a distance of approximately 2 km and 5 km from schools and the shoreline/streams, respectively. Approximately, 20 km2 and 114 km2 were estimated to lie under “very high” and “high” risk zones, respectively, in ML. The ML approach demonstrated results in a more refined way and also aided in validation of the conventional AHP approach. The methodology developed in this research will assess seismic-prone areas and the risk associated with earthquake hazard. This approach can be utilized to deal with disasters and is beneficial for the disaster management of a country, such as the UAE. It can also be applied to other geographies.

Data Availability

The data used to support the findings of this study are available from the first author upon request and upon approval of the data source.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Authors’ Contributions

D.A.-D. and R.A.-R. developed the concept and the methodology of the study. D.A.-D., R.A.-R., K.S., and S.M. conducted the spatial processing to develop the required thematic layers and also carried out the AHP weighting approach for vulnerability analysis. D.A.-D., R.A.-R., K.S., S.A.-M., and S.M. conducted spatial processing and carried out the AHP weighting approach for vulnerability analysis. D.A.-D., R.A.-R., K.S., B.K., and S.A.-A. contributed in selecting the most seismically active region. D.A.-D., R.A.-R., K.S., S.A.-M, S. M, B.K., S.A.-A., and H.A.-A contributed in selecting the most vulnerable region after hazard. R.A.-R., B.K., H.A.-A., and N.U edited, restructured, and professionally optimized the manuscript. D.A.-D., R.A.-R., B.K., K.S., S.A.-M, S.M.,H.A.-A., and N.U. prepared and reviewed the manuscript.