Abstract

Developing models to analyze time series is a very sophisticated, time-consuming, but interesting experience for researchers. Commodity price component determination is challenging due to remarkable price volatility, uncertainty, and complexity in the futures market. This study aims to determine the components that drive the market price of commodity futures. This study utilized the decomposition methods, empirical mode decomposition (EMD), and variational mode decomposition (VMD), to analyze three commodity futures prices data: corn from agricultural products, crude oil from energy, and gold from industrial metal. We applied these techniques to decompose the daily data of each commodity price from different periods and frequencies into individual intrinsic mode functions for EMD and modes for VMD. We used the hierarchical clustering method and Euclidean distance approach to classify the IMFs and modes into high-frequency, low-frequency, and trend. Next, applying statistical measures, particularly, the Pearson product-moment correlation coefficient, Kendall rank correlation, and Spearman rank correlation coefficient, we observed that the trend and low-frequency parts of the market price are the main drivers of commodity futures markets’ price fluctuations. The low-frequencies are caused by special events. In a nutshell, commodity futures prices are affected by economic development rather than short-lived market variations caused by ordinary disequilibrium of supply-demand.

1. Introduction

Globally, crude oil, corn, and gold futures prices have great influence on worldwide economy. These commodities are prone to price fluctuations and have decisive impact on international economic activities as well as both consumers and producers [1], [2]. Price fluctuation is a primary concern as investors are anticipated to make satisfactory profit from their commodity futures market exchange, and the price studies of crude oil, corn, and gold can supply macroeconomic information to investors to minimize risks of loss and maximize profit. The futures market price determinants of these commodities are not affected by demand-supply only but also influenced by several components such as financial policies, growing economy, exacerbation of market speculation, and special events (eruption of war, outbreak of pandemic, 911 gruesome attacks, and 2008 and 2012 global debt crisis) [3]. The remarkable price volatility in the futures market of crude oil, corn, and gold makes it difficult to develop models to analyze the price movements of the aforementioned commodities. Prudent economic and investment decision-making, which can assist investors and policymakers to reduce the danger posed by price swinging, therefore, can only be achieved by accurately analyzing the futures market.

The United States is the leading consumer of corn in the world. In 2019/2020, the United States consumed about 12.30 billion bushels of corn, followed by China. China consumed about 10.98 billion bushels of corn in the same year. The European Union is the third largest consumers of corn globally [4]. These countries are regarded as the world leading economies, which imply that this agricultural commodity plays significant role in the daily planning of the economy of these three countries. According to the International Energy Agency (IEA) 2019 report, United States and China are the leading consumers of crude oil worldwide, which consumed about 19.4 million barrels per day and 14 million barrels per day, respectively. Gold as a precious metal is used as an indicator to hedge against inflation and measure wealth in the world. It is a metal which is fascinated by everyone and has maintained its value over centuries. Unlike crude oil, gold price is not determined by any organization, but mostly depends on the cost of extraction and how much people are willing to pay for it. Gold is expensive due to the high cost associated with its production and also exceeding popularity and can maintain constant weight all the time, which makes it price rise more often. As stated by the futures price discovery mechanism, these commodities are extremely affected by macroeconomic policies; therefore, they are useful in providing price information to spot markets. Hence, discovering the drivers of these commodities is expected not only to minimize the unreliability and decrease the danger in commodity markets but can also help to make strategic and reliable blueprints for government.

Crude oil, corn, and gold were chosen across the commodity price market to conduct this study, which aimed at identifying the underlying components that drive the futures prices’ fluctuations in energy, agriculture, and industrial metal because these commodities are regarded as the most volatile, interactive, and complex in the commodity price market and very sensitive to macroeconomic policies [4]. These commodities play a vital role in the world economy; hence, the futures prices of these commodities market price data are very important for any future development plan since there is strong mutual relationship between price, supply, and demand.

Inspired by the aforementioned reasons, we proposed an enhanced signal detector and EMD and VMD techniques, to decompose price series of crude oil, corn, and gold to establish the components that drive commodity futures spot markets’ prices. This paper utilizes EMD and VMD to analyze the components of futures prices of corn (from agricultural commodities section), crude oil (from energies section), and gold (from industrial metals’ section). By investigating the components that drive commodity futures prices of these selected items, this research contributes to the existing literature as follows:(1)We demonstrate that the EMD and VMD can reveal the underlying factors that causes corn, crude oil, and gold markets price fluctuations.(2)Using Hierarchical Clustering and Euclidean Distance Techniques, the IMFs and modes were grouped into their respective frequencies—high-frequency, low-frequency, and trend components.(3)Subsequently, different statistical measures were used to evaluate the contribution of each frequency to the net variance of the volatility of crude oil, corn, and gold markets’ price fluctuations.(4)The economic interpretations of these components were categorized as near-term fluctuations, as a result of ordinary disequilibrium of supply-demand, unpredictable special events, and a long-term trend. It can, thus, be concluded that commodity futures prices are driven by economic development since low-frequency and trend are dominant components of commodity prices’ series. From these processes, some strategic forecasting approaches are discussed in the conclusion.

The remainder of the paper is divided as follows: in Section 2, we review the relevant studies; we detail the data and introduce the EMD and VMD algorithms in Section 3; we discuss the empirical results in Section 4, where also the derived intrinsic modes by EMD and VMD are presented; Section 5 discusses and concludes our findings, where we also provide some possible explanations for the results and future developments.

Globally, energy, agricultural, and industrial metal products markets are complex, uncertain, volatile, and interdependent. As a result, crude oil, corn, and gold time series are nonlinear and nonstationary [5]. Therefore, determining the drivers of these commodities are demanding task [6]. Much research has been conducted into the analysis of commodity prices, and the usual methods used can be classified into two groups: data-driven approaches and structure-modeling techniques. Data-driven strategies consist of linear regression models such as autoregressive moving average (ARMA) and autoregressive conditional heteroscedasticity- (ARCH-) type models [7], [8] and nonlinear models such as backpropagation neural network (BPNN) and radial basis function neural network (RBFNN) [9]. The structure approach models profile commodity market prices and then uses schedule supply-demand equilibrium to examine the price volatility [10].

The mechanism of commodity price investigation and the factors that influence it can be modeled using structure models; however, the structure method is strenuous to apply due to the distinctive features of the commodity futures market. For instance, oil supplying is difficult to model because it is dispensed by two independent bodies, namely, the non-OPEC and OPEC nations. The non-OPEC countries have no control over oil prices, while the OPEC ones determine the production levels, using several factors, other than reserved capacity [11], Furthermore, the difficulty in modeling commodity futures prices is also due to the unsteady international market environment. The data-driven approaches have proven to be quite versatile for short-term predictions but do not have meaningful economic interpretations, as they cannot justify the inherent forces that cause commodity price movement.

Several studies have cropped up in the literature trying to develop models to study the price fluctuations of commodity futures spot market prices. Studies such as those by Zhu et al. [12], Wang et al. [13], Miao et al. [14], Boubaker and Raza [15], and Wang and Li [16] used singular spectrum analysis (SSA) and wavelet and combined with neural network models such as backpropagation neural network, radial basis function, and wavelet neural network to predict commodity futures prices. They used SSA and wavelet to decompose crude oil, corn, and gold prices series. The experimental results revealed that the neural network models combined with SAA perform better than the benchmark models.

Boubaker and Raza [15] employed the wavelet method to analyze mean and volatility spillovers between oil price and BRICS stock markets at different time intervals. They developed a combined model, VARM-GARCH-cDCC, to examine the effects of spillover on mean and variances of level prices at different time scales. Evidence from the investigation indicates that the information within the oil industry and volatilities have strong influence on oil price and stock market prices but have no effects on the volatilities of other prices. The results also revealed that the proposed VARM-GARCH-cDCC combined with wavelet decomposition predicts the oil price accurately. However, using SAA and wavelet as a data preprocess tool has some drawbacks. SSA and wavelet decompositions extract false cycles forming part of the series since choosing the function basis, and the decomposition levels are subjective. Wavelet is nonadaptive in nature.

An objective data analysis approach, such as empirical mode decomposition (EMD), proposed by Huang et al. [17], and variational mode decomposition (VMD), suggested by Dragomiretskiy and Zosso [18], can be used to address the challenges in modeling and in the absence of meaningful economic interpretations in market price of commodity futures.

EMD is empirical in nature; it is instinctive, straight, and robust data pretreatment technique developed, particularly, for nonlinear and nonstationary data preprocessing. The work of EMD is to break down the data into a discrete number of intrinsic mode functions (IMFs) in such a manner that the data is near periodic based on peculiar characteristics, called the “extrema,” which is the length between two sequential local extrema; thus, based on the scale, the meaning of each IMF and the interpretations can be established; for instance, an intrinsic mode associated with financial time series with a scale of three months is considered as a special event or seasonal event. Exploring data’s intrinsic modes is very important to understand economic interpretation since data is the only relationship we have with real life.

Researchers have applied EMD in studies on economic and financial data. It has been applied, for example, in modeling agricultural products (Wang et al. [13]; Abadan and Shabri [19]), electricity price (Xiong et al. [20]; An et al. [21]; Crosato et al. [22]), on exchange rates (Lin et al. [23]; Premanode and Toumazou [24]; Plakandaras et al. [25]), gold prices (Jian-Hui and Wei [26]; Hua and Jiang [27]; Owusu Junior et al.[28]), crude oil prices (Meng et al. [29]; Chen et al. [30]; Yu et al. [31]; Zhang et al. [32]; Wu and Huang [33], and on carbon prices (Zhu et al. [12]). Premanode and Toumazou [24] used differential empirical mode decomposition (DEMD) for improving forecasting of exchange rates using the support of vector regression (SVR). The simulation report showed that when DEMD integrated with the SVR model, it was more robust than the state-of-art Markov switching generalized autoregressive conditional heteroskedasticity (MS-GARCH) and Markov switching regressive (MSR) models.

Korotin et al. [34] used EMD and combined with the Hurst exponent to assess the effects of economic sanctions on ruble exchange rate in Ukraine. They based their argument on the theory of an effective market and reported that there was no direct impact of sanctions on the ruble exchange rate and concluded that the foreign exchange market has a long memory.

Zhu et al. [35] proposed EMD and combined with the counterfactual analysis to estimate the effects of China’s export policy on the price of tin. They used four different kinds of decompositions in their analysis and selected the EMD method as the best decomposition technique. The empirical results revealed that China’s export policy has no effect on tin price, but the abolition of export policy brought about an increase in supply of global tin market, such that there was downward trend in the tin prices in the long run, which resulted in the short-term fluctuation in the international tin market, and that market policy and price are effective response of trade policy.

Zhu et al. [36] proposed the EMD method to study the spillover effects between carbon market and electricity market. EMD was applied to decompose carbon and electricity prices into several modes with different frequencies and used conditional value at risk was used to detect the spillover effects between carbon market and electricity market. The experimental results indicate that the spillover effects of carbon market affect electricity market positively, while spillover effects of electricity market affect carbon market negatively. It further revealed that, for high and low frequency modes, the spillover effects of carbon market affect electricity market positive, and vice versa. For the case of intermediate frequency, there exist a negative bidirectional spillover effect between the two markets.

Dragomiretskiy and Zosso [18] introduced the VMD model to improve EMD and EEMD by reducing endpoints’ effect and mode mixing. VMD is a nonrecursive and bandwidth-limited optimization signal decomposition method, using the Wiener filtering and Hilbert transform; it can decompose nonlinear and nonstationary time series data. VMD assumes that each mode has a limited bandwidth of center frequency and minimizes the sum of estimated bandwidth of each mode and the sum of each is equal to the observed data; moreover, it is robust to noisy data.

VMD is developed to decompose nonlinear and nonstationary data and has, increasingly, attracted research attention in many fields. For instance, Lahmiri [37] employed VMD as a data preprocessing tool to predict intraday stock prices at different frequencies. The study revealed that the VMD combined with other models forecast stock better than single models.

Aneesh et al. [38] studied performance analysis of VMD over empirical wavelet transform (EWT) using six different categories of power for recognition of power quality disturbances. The report on the identification indicated that VMD was superior to EWT in terms of the trait extraction process and identification accuracy. Seo et al. [39] employed VMD to model daily rainfall-runoff, and they reported that VMD models were robust regarding efficiency and effectiveness.

Niu et al. [40] used VMD-LSTM to forecast the stock price index. VMD was proposed to decompose the actual series of stock price into different number of modes, and the result was compared to EMD-based hybrid models. They reported that the VMD-LSTM model improves the forecasting of the stock price index. The combined model outperforms the single models, and predicting precision of VMD-based models was better than the EMD-based models using complexity-invariant distance (CID) as a statistical measure in rating the predictability of the models.

Zhu et al. [5] analyzed the price series and volatility of natural rubber’s market using VMD as data preprocessing to decompose the rubber futures series from Shanghai Future Exchange into different modes. A hybrid VMD-BiGRU model was formulated to forecast the short-term rubber futures on the Shanghai Future Exchange. The empirical results revealed that the VMD approach emerged as a robust technique for analyzing natural rubber’s market and can detect the hidden factors associated with rubber market price fluctuations. Similarly, Huang and Deng [41] employed VMD to predict the crude oil prices series. VMD was combined with long short-term memory (LSTM) and moving window to form a hybrid model, VMD-LSTM-MW, to forecast monthly and daily crude oil price. They reported that the VMD-LSTM-MW model was superior to a single-energy based (SE). Jianwei et al. [42] predicted the movement of gold price futures fluctuations by utilizing VMD technique to decompose the original gold price series into different modes. A hybrid model was formulated based on independent component analysis (ICA) and gate recurrent unit neural network (GRUNN) approach, called VMD-ICA-GRUNN. The empirical results indicate that the combined method, VMD-ICA-GRUNN, predicts gold price accurately as compared to single benchmark models, such ARIMA, radial basis function neural network (RBFNN), long short-term memory neural network (LSTM), GRUNN, and ICA-LSTM. He et al. [43] proposed forecasting models, combined with VMD quantile regression neural network (QRNN), to predict three crude oil markets, the West Taxes Intermediate (WTI), the Brent, and the OPEC markets, and reported that the proposed hybridization approach to forecast oil price volatility market accurately and increasing forecasting ability of QRNN.

It is clear evidence that EMD and VMD framework have been used recently as data processing in predicting commodity market prices. However, none of the studies analyzed the underlying components that drive commodity futures prices, and as a result, this current study proposes two decompositions, EMD and VMD techniques, as a data pretreatment method to identify the underlying factors that causes fluctuations in crude oil, corn, and gold markets. This paper introduces a new signal decomposition technique, VMD and EMD, to decompose crude oil, corn, and gold prices series into several modes and use statistical measures, hierarchical clustering method and Euclidean distance approach, to group the IMFs and the modes into high frequency, low frequency, and trend. The contribution of each component to the volatility of three commodity markets is analyzed. The contribution of each IMF or mode to the total variance of the volatility of crude oil, corn, and gold markets price fluctuations can be established using the Pearson product-moment correlation coefficient, Kendall rank correlation, and Spearman rank correlation coefficient.

3. Data and Descriptive Statistics

This part of the study presents the data selection, descriptive statistics of the data, data pretreatment, and experimental procedure. As mentioned in the Section 1, to analyze the drivers of commodity futures prices, this study chose three commodities’ prices across the stock market, namely, corn from agricultural products, crude oil from energy, and gold from industrial metal, to conduct this empirical research. Table 1 shows the descriptive statistics of the three commodities, and Figures 13 represents the time series plots of corn, crude oil, and gold, respectively.

Daily closing spot market prices from Bloomberg Commodities Index were used in this empirical exercise. Bloomberg Commodities Index, which is considered the standard price for global commodity markets, was used to analyze futures prices of crude oil, corn, and gold in this experiment. We used 1277 observations for the price series of each commodity from May 2016 to April 2021. The data were obtained from http://www.bloomberg.com.

The corn and gold had positive skewness. This suggests the distribution of the two commodities spot market prices has a right tail. The crude oil had negative skewness, which indicates that the spot prices of crude oil have a left tail, as shown in Table 1. The results are confirmed by the minimum and the maximum and the time series plots of the commodity futures prices in Figures 13. Kurtosis of 11.98, 4.05, and 2.59, respectively, indicate that the distribution of daily spot market prices was leptokurtic. The high Jarque–Bera test statistics suggest that the commodity futures prices did not follow a normal distribution. These confirmed the nonnormality and positive skewness of corn and gold and negative skewness of crude oil daily spot market prices of commodity futures prices as revealed by time series plots in Figures 13, respectively.

4. Methods

4.1. Empirical Mode Decomposition (EMD) Algorithm

The empirical mode decomposition technique was introduced by Huang et al. [17] as a nonlinear and nonstationary historical dataset pretreatment tool. It is a robust and systematic decomposition technique used in analyzing nonlinear and nonstationary time series. The EMD addresses full frequency content that cannot be explained by a single Hilbert transform. In analyzing commodity price series within its time domain, some important features may not be uncovered. EMD presumes that analyzing data in its bulk state may have several hidden modes swinging simultaneously. EMD, as a data processing tool, can bring out inherent modes from the actual series, derived from the distinct features of the actual data. EMD can present each inherent mode as an intrinsic mode function (IMF).

We follow Zhu et al. [12] in presenting the EMD algorithm. The EMD approach is operated on the following presumptions:(i)There must be at least one maximum and one minimum in the actual signal(ii)There must be characteristics’ time scale between the extrema(iii)If there is no extremum in the data but have an inflexion point, then the data can be differentiated once or several times to bring out the extrema

The above assumptions make sure that there is harmony in the IMF and approaching periodic function and put the mean to zero. The following parameters, amplitude, frequency, and mean must be zero to certify that an IMF is a harmonic function. In practice, the act of extracting the IMFs from the data is called the “sifting” process. The EMD method can convert the time series data into a bounded and compact figure of intrinsic functions (IMFs).

The EMD process is as follows:(a)All the maxima and minima of the commodity prices series were identified and denoted by (b)Next, we used cubic spline interpolation to produce the upper and lower envelopes, and (c)The point-by-point mean represented by were estimated from upper and lower envelopes using the following formula:(d)The mean was obtained from the series, , and defined the difference of and by :(e)We checked the characteristics of (i)If is an IMF, we denoted as ith IMF and replaced with residual:The ith IMF is denoted by ci (x), and ci (x) is called its index(ii)If was not IMF, it replaced with (f)We repeated Steps (a)–(e) till the residuals met some stopping criteria. For details about stopping criteria, see the work of Zhu et al. [12].

Hence, the commodity futures prices can be written as the sum of IMFs and a residue, mathematically as follows:where n is the number of IMFs, rn (x) is the final residue, and ci (x) is the ith IMF.

In the extraction of IMFs, c1 is the first extracted component and holds the IMF with the first-class scale of the commodity price series. After the extraction, c1 is the residue and identified as the element with a longer period variation; thus, EMD filters and groups the IMFs into high-frequency and low-frequency modes. Practically, the grouping of IMFs is based on the algorithm called “fin-to-coarse reconstruction.” This study uses the Euclidean distance and hierarchical clustering approaches to classify the IMFs into high and low frequencies.

In general, EMD has the following advantages over other decomposition methods:(1)EMD can break down any sample size data into simple independent intrinsic mode function(2)It differs from the wavelet approach because the decomposition depends on the distinct characteristic time scale of the data and allows only extrema in the sifting process; therefore, it is robust, highly efficient, and distinct [44](3)The extracted IMFs through EMD have an instantaneous frequency based on deriving the phase functions such that we can apply the Hilbert transformation to the IMFs

4.2. Variational Mode Decomposition (VMD) Algorithm

Empirical mode decomposition (EMD) has drawbacks, such as mode mixing, noise sensitivity, and sampling. VMD is more robust than EMD in terms of signal decomposition [18]. To overcome the difficulties with EMD, attempts were made at replacing local mean estimation and envelope in EMD [45]. Synchro-squeezed transform (SST) [46], empirical wavelet transforms (EWT) [47], and VMD are different classes of data-driven approaches that aim to extract EMD-like decomposition. VMD is a nonrepeating signal processing method; it is employed to disintegrate real-valued signals into several independent number of band-limited subsignals, called modes yk, with specific sparsity characteristics. The extracted modes can be reduced to one center known as “pulsation” , accompanied by the decomposition process. In estimating the bandwidth, we follow the process below:(1)Apply Hilbert transform to each extracted mode yk, to calculate the center frequency spectrum(2)The calculated center frequency is mixed with the exponential tuned to adjust the mode’s frequency spectrum to baseband(3)Estimate the bandwidth of each mode yk, by using the H1 Gaussian smoothness of the demodulated signal

Mathematically, a constrained variational problem is formulated as follows:

Let be the actual signal of the series and yk be the kth of the actual signal; then,

The constrained variation can be minimized as follows:where is the actual signal, yk is the kth element of the actual signal, is the center frequency of yk, δ (t) is Dirac distribution, ⊗ is convolution operator, m is the number of modes, and t is time script. Considering both penalty term and Lagrangian multiplier, λ, the above constrain problem can be changed to an unconstrained one aswhere ∝ is the constraint-stabilizing parameter of the data and L is the augmented Lagrangian.

The augmented Lagrangian L can be estimated in equation (7), and it is the associated minimax point in the iterative series. Suboptimization of L and its minimax point can be obtained using the alternate direction method of multipliers (ADMM). The ADMM optimization method presumes that updating the actual signal, yk, and center frequency, , in two different directions help to achieve better VMD analysis results. See [18] for a detailed algorithm of ADMM.

The solution for yk and is mathematically illustrated as follows:where are the Fourier transforms of , respectively.

5. Empirical Results

This section concentrates on the detailed description of the empirical results and analysis of IMFs derived from EMD and VMD, respectively.

One of the significant contributions of this study is using two decomposition methods to establish the components of commodity futures prices. We applied EMD and VMD techniques to preprocess each sample data to establish commodity futures price units. The observed data of each commodity futures prices is disintegrated into many IMFs and a residue, RES, using the EMD approach. The IMFs generated by EMD is fixed for any sample data; thus, in the EMD technique, the decomposition components for a given data cannot be changed manually. The EMD approach is illustrated in Figure 4.

Unlike EMD, with the VMD technique, the number of modes can be changed to suit the experimental needs. In this research, many tests were carried out to select the optimal number of decompositions for each price series. Figure 5 illustrates the VMD process.

5.1. Decomposition Results and Analysis

This section presents the decomposition results of corn, crude oil, and gold using the EMD and VMD processes.

5.1.1. Decomposition Results of Corn

For corn futures using the EMD method, the sifting processes automatically produced 7 IMFs and one residue, defined as IMF1, IMF2, …, IMF7. Figure 6 illustrates all the IMFs and the residue of corn futures. The IMFs arranged itself automatically from high frequency to low frequency, and the last one represents the residue in each case. In other words, the IMFs were arranged in order of magnitude. These decompositions represent the price swinging known as a “trend” and market fluctuations, caused by demand-supply and random events, respectively.

Similarly, we applied the VMD technique to decompose the corn price series into eight bind-limit signals, called modes; the decomposition results are shown in Figure 7. The modes were arranged in order of amplitude, thus, the from low-frequency mode to high-frequency mode. We compared eight different decomposition modes, namely, M1, M2, …, M8, respectively.

5.1.2. Statistics of IMF of Corn Derived through EMD Method

The IMFs were analyzed using the following measures: the mean period of each extracted IMF, the correlation between each IMF and the actual price, the variance, and the variance percentage of each IMF. The statistical values of the IMFs and modes of the two decomposition methods of the price data series are presented in Tables 2 and 3, respectively.

The mean period is obtained by dividing the overall data points by the extrema for each decomposed IMF since the frequency and the amplitude of an IMF keep on changing as time goes by; hence, the periods are not static. We used the three correlation coefficients, namely, the Pearson product-moment correlation coefficient, Kendall rank correlation coefficient, and the Spearman correlation coefficient, to estimate the correlation between the IMFs and the actual price. The contribution of each IMF to the total variance of the actual price can be explained by summing up the variances and finding the percentage of the variance since these IMFs do not become dependent on each other. According to Peel et al. [48], the observed variance, the variances of IMFs, and the residue may differ when added up; this is a result of rounding-off during calculations, the irregularity in the actual data series, and the introduction of variance. It can be observed that there are a positive 57.245% and a negative 28.157% differences, as seen in Tables 2 and 4, respectively.

For the decomposition of corn futures prices’ series, the original data series, the leading mode was not any IMF but the residue with the mean period of 425.667 and contributed more than 41% of the total variability of corn futures prices. It had the highest correlation coefficient with the observed price of 0.61478, 0.10777, and 0.16794, respectively, and is known for the long-term deterministic trend [17]. The next significant mode was IMF7, which contributed more than 32% of the total variation of the corn futures prices, with a correlation coefficient between the observed price of 0.54986, 0.36244, and 0.51175 for Pearson, Kendall, and Spearman in that order.

The combined variance of the residue and IMF7 accounted for more than 73% of the total variance of corn future price volatility. The third important mode was the IMF6, with a correlation coefficient between the observed price of 0.06365, 0.02319, and 0.03523 for Pearson, Kendall, and Spearman, respectively, as shown in Table 2, with a variance contribution of more than 18% of the future price of corn. The IMF5 accounted for 3.823% of the variability of corn future price. The IMF4 and IMF3 contributed 2.23% and 1.076% of the total variance of corn future price, respectively. The IMF1 and IMF2 accounted for less than 1% of the variance of the corn future price series.

5.1.3. Statistics of Mode of Corn Derived through VMD Method

Similarly, the decomposition of corn future price series data using VMD produced eight modes, as shown in Figure 7. The aggregate represents the signal of the observed data. It can be deduced from Table 4 that the leading mode of the actual price was M2, which contributed 50.56% overall variance of corn future price. Next was M1, with a mean period of 13.165, which accounted for more than 40% of the net variability of the actual price data. Besides M1 and M2, M3, with a mean period of 38.697, contributed more than 5% of the variability of corn future price. For M4, M5, M6, M7, and M8, the values of correlation coefficients were very low. These modes (M4–M8) contributed 1.765%, 0.436%, 0.286%, 0.187%, and 0.066%, respectively, of the variability of the observed data.

5.1.4. Constitution of Corn Futures Prices

In addition, following Zhu et al. [12], we used Euclidean distance between pairs of any two IMFs or modes or between an IMF or mode and the residue and hierarchical clustering approach to group the IMFs and the modes into three components. The clustering results are presented in Figure 8. An IMF was classified as a high-frequency component when the Euclidean distance was less than 10. An IMF was categorized as a low-frequency component when the Euclidean distance was between 10 and 20. The Euclidean distance of more than 20 was considered a trend component. Table 3 and Figure9 presented the three components and their statistical measures as derived from EMD.

Following the above definition, IMF1, IMF2, IMF3, IMF6, and IMF7 formed the high-frequency components of EMD decomposition, while IMF4 and IMF5 correspond to the low-frequency part of the corn futures price series. Each component has a unique feature and economic interpretation—the low-frequency and the high-frequency components represent special events and short-term market price fluctuations, respectively. The residue changes gradually around the long-term mean, as mentioned in Section 2, and corresponds to the long-term corn futures price trend.

For VMD, unlike EMD, the decompositions are arranged from the low-frequency mode to high-frequency mode. M6–M8, with a short amplitude, represents the high-frequency component of corn price series and accounts for short-term market fluctuations. M2–M5 constitutes the low-frequency part of corn future price from May 4, 2016, to April 30, 2021, and represents the special events under study, while M1 represents the long-term trend of the corn future price. A plot of the extracted components and their statistical measures through VMD are presented in Figure 10 and Table 5. The classification of the modes using Euclidean distance is presented in Figure 11.

5.1.5. Trend of Corn Futures Prices

Trend accounted for 46.478% and 39.527% of the total variability of corn future price in terms of EMD and VMD, respectively. It had the highest correlation coefficient with the observed price of 0.244, 0.306, and 0.410 and 0.703, 0.616, and 0.819 in terms of Pearson, Kendall, and Spearman, respectively, for EMD and VMD. This indicates that trend is an inevitable factor in determining corn future price eventually, as shown in Tables 3 and 5. The persisting increase in the price of corn is due to worldwide economic evolution; therefore, by comparing the trend with the actual series of corn, it can be deduced that, occasionally, corn future price fluctuates due to unpredictable special events, such pest invasion, adverse weather conditions, and diseases, but reverse to the trend after the factor that triggered the event is controlled.

5.1.6. Effects of Special Events on Corn Price

The IMF6 and M2 from EMD and VMD represent the low-frequency components of corn futures prices, respectively, as shown in Figures 8 and 11. It can be seen from Figures 6 and 7 that the low frequencies are nonlinear and long-term trend. The low frequency contributed 8.028% and 59.893% of the total variance of corn futures prices in terms of EMD and VMD, respectively, as presented in Tables 3 and 5. These low frequencies correspond to significant events, such as bad weather, diseases, and pest invasion, which have significant impact on corn price. Based on the mean period of these modes, the shortest impact period is more than one month for the case of EMD and more than 2 months for VMD. It implies that it takes the market some time to eradicate these special events, and the duration may last for a while, looking at the mean period of these special events.

Furthermore, the amplitudes of some of the data points were more than $5, which suggests that some special events have a severe consequence on corn futures market prices. Large price fluctuations in corn futures occur in the medium term, due to such significant events. The normal market fluctuation takes place at a high frequency, and the trend changes gradually. We can measure the effectiveness of every significant event as low frequency and separate from the main price; hence, these results can be used in predicting any significant impact from the upcoming event of the same nature.

5.1.7. Effects of Ordinary Market Disequilibrium on Corn Price

Apart from the trend and the effect of significant events, corn futures prices are also affected by speculation. High frequency occurs as a result of random events, such as speculations within commodity market, financial policies, trade wars, and great events, such as outbreak of pandemic. The effect of these speculations can be classified as high frequency and accounted for more than 45.587% and 0.581% of the total variance of corn price in terms of EMD and VMD, respectively. The effects of ordinary market speculation are sometimes called “market disequilibrium” and consist of events with short-term impact on corn futures prices. The word, short term, is used to describe these effects since the data used in this research is daily. The disequilibrium of supply-demand as a normal market fluctuation has a serious impact on corn price as when these events persistently occur, they become one of the main drivers for pushing corn prices up. We can ignore the normal market fluctuations in the long-term trend forecasting; however, they are very crucial for short prediction. Thus, $386.75/ton of corn in April 2021 can be explicitly explained as a $106.81 trend price, $242.56 special event price, and an ordinary market fluctuation price of $37.38 through the VMD method.

5.1.8. Crude Oil Decomposition Results

Similarly, the price series of crude oil was decomposed using EMD and VMD methods, and the decomposition curves are shown in Figures 12 and 13, respectively. The EMD method decomposed the crude oil data into eight IMFs, called IMF1–IMF8, and one residue, defined as RES, as shown in Figure 12. The VMD approach produced eight modes, and the decomposition result is illustrated in Figure 13.

5.1.9. Statistics of IMF of Crude Oil Derived through EMD Method

In analyzing the IMFs derived from EMD of crude oil price series, we again followed Zhang et al. [32] by using the same parameters as that of corn to estimate each IMF contribution to the variability of crude oil price series. The mode which accounted most for the variance of the observed data was IMF8. It contributed more than 37% of the total variability with the mean period of 319.250 and the correlation coefficients between the observed data was 0.615, 0.426, and 0.604 in terms of Pearson, Kendall, and Spearman, respectively, as shown in Table 6. The IMF7 contributed more than 32% of the observed data’s variability, and the correlation coefficients between the observed data was 0.498, 0.315, and 0.471 for Pearson, Kendall, and Spearman. The two most important IMFs—IMF8 and IMF7—contributed more than 69% of the crude oil price variance. The IMF5 and IMF6 accounted for 6.079% and 8.647% of the variance of crude oil price series, respectively. The remaining IMFs did not contribute much to the original price series variance, indicating that the effect of these IMFs on the crude oil price is minimal.

5.1.10. Statistics of Mode of Crude Oil Derived through VMD Method

Once more, we based our analysis of the modes derived from VMD on the following parameters: a mean period of each mode, the correlation between each mode and the observed data, the variance, and the variance percentage of each mode. Table7 presents the statistics of the modes produced by the VMD method.

The M1 emerged as the dominant mode, with correlation coefficients of 0.731, 0.607, and 0.778 between the mode and the observed data regarding the Pearson, Kendall, and Spearman correlation coefficients, and contributed more than 42% of the entire variance of the actual prices data. The M1 also exhibits a long cycle and continuously increased the trend, and the observed price series also followed the same pattern. The next important mode was M2 which caused more than 39% of the crude oil price volatility. The combined impact of M1 and M2 constituted 81.44% of the overall variance of the crude oil price series. The M3 and M4 contributed 15.42% and 1.72%, respectively, of the variance of crude oil price. On the contrary, the M5–M8 exhibited small correlation coefficients with the actual data and accounted for less than 2% of the net variability of crude oil price, suggesting that their impact on the futures prices of crude oil is insignificant.

5.1.11. Constitution of Crude Oil Price Series

Similarly, we grouped the IMFs of crude oil price series into their respective components using hierarchical clustering and the Euclidean distance techniques. The IMF1–IMF4 were classified as a high-frequency component of EMD decomposition, while IMF5–IMF8 represented the low-frequency component of the crude oil price, as shown in Figure 14. These components have unique features and economic meaning—the residue with long-term mean is classified as the trend component of crude oil futures prices, while the low-frequency part represents the effect of significant events. The high-frequency component is identified by its short amplitudes and constitutes shocks caused by short-lived market fluctuations.

For the VMD approach, M4–M8 form the high-frequency part of the oil price series and constitute short-term market fluctuations. The M2 and M3 correspond to the low-frequency component of the oil series from 2016 to 2021, representing the significant events of the period under consideration. The M1 represents the long-term trend of the crude oil futures prices, as shown in Table 8 and Figure 15.

5.1.12. The Trend of Crude Oil Futures Prices

The second deterministic factor of crude oil prices is the trend; it accounted for more than 10% of the variability of crude oil future price volatility through EMD, as shown in Table 9. This was confirmed by the VMD process, and the trend was still the second determinant of the crude oil price fluctuations, contributed more than 27% of the total variability of crude oil futures, and holds a correlation coefficient of 0.731, 0.607, and 0.778 between the original price series for Pearson, Kendall, and Spearman, suggesting that the trend cannot be downplayed in discovering prices of crude oil in the long run, as presented in Table 8. Indeed, the continuing rise in the price of crude oil is in line with worldwide economic growth, and by comparing the trend with the original price series, we can deduce that crude oil prices could fluctuate due to momentous events. The price, however, reverts to trend after the effect of the special event is ended.

5.1.13. Effects of Special Events on Crude Oil Price

Besides the trend, special events are the major contributor of the crude oil price fluctuations. The IMF5–IMF8 and M2 and M3 represent the low-frequency components of decomposition of futures prices of crude oil by EMD and VMD. From EMD, the special events contributed more than 83% and hold a high correlation coefficient of 0.929, 0.765, and 0.928 between the original price series in terms of Pearson, Kendall, and Spearman, respectively. The low frequency accounted for more than 67% through VMD of the variance of crude oil price, indicating that the special events were the major component of the wide price swings of crude oil. Interestingly, both EMD and VMD have the same mean period of 55.522 of the low-frequency components, respectively, as presented in Tables 7 and 9, indicate that the period of special events may persist for a more extended period. Examining the mean period of the IMFs and the modes, it takes an average of almost two months for the market to adjust itself to these special events.

Furthermore, the amplitude of some data points could reach a high level of $15, indicating that some special events have a remarkable effect on crude oil price determination. The medium-term crude oil price fluctuation occurs through special events due to the gradual changes in trend and ordinary market fluctuation that take place at a high frequency. In forecasting, we can estimate every special event, and the results can be used to predict incoming special events of the same nature by isolating special events from the overall price series. Our study conformed to a similar investigation conducted by Zhang et al. [32] on crude oil price analysis using EMD and EEMD methods.

5.1.14. Effects of Ordinary Market Disequilibrium on Crude Oil Price

In addition to special events and trends, crude oil prices are also affected by the political situation, atmospheric conditions, inventory exhaustion, and strike actions. The combined effects of these factors constitute the high-frequency component of the crude oil futures price series since their effects are often short. The effect of the ordinary market disequilibrium on the crude oil price is not serious because it takes place at high frequency; thus, considering the amplitudes of the IMFs and modes, their duration is very short. We regarded this element as the consequences of ordinary market disequilibrium and considered it a set of events with near term influence on the crude oil price; we used, short-term or near term, to describe these effects because the data used in this research is daily. The disequilibrium of supply-demand from ordinary market fluctuation has no serious effect on the crude oil price. These events have frequently occurred and have become one of the major drivers that raise crude oil prices. The ordinary market fluctuations, thus, can be eliminated from the overall series in long-term trend forecasting but they are very crucial for near term prediction; for instance, through the VMD technique, the price of $64.81 per barrel in April 2021 can be explicitly explained as a trend price of $17.86, a special event price of $43.96, and an ordinary market fluctuation of $2.99. The three components derived from EMD and VMD methods are presented in Figures 16 and 17, respectively.

5.1.15. Decomposition Results of Gold through EMD and VMD Methods

Likewise, the EMD and VMD techniques were used to break up gold price data into their respective IMFs and modes in this section. The decomposition curves obtained are illustrated in Figures 18 and 19, respectively. The EMD method generated eight decomposed IMFs of gold futures price series, defined as IMF1, IMF2, …, IMF8, and one residue, as shown in Figure 18, while the VMD approach decomposed the gold series into eight modes. The decomposition result is presented in Figure 19.

5.1.16. Statistics of IMF of Gold Derived through EMD

The residue contributed most to the overall variability of the price series of gold and accounted for over 34% of the net variance. The correlation coefficients between the residue and the actual price of gold were 0.962, 0.559, and 0.743 in terms of Pearson, Kendall, and Spearman, respectively, as indicated in Table 10. The next significant contributor was IMF7 and accounted for more than 27% of the observed data variance. The IMF8 contributed more than 13% of the variance of the observed data with correlation coefficients of 0.285, 0.186, and 0.259 for Pearson, Kendall, and Spearman, in that order. The IMF5, IMF6, and IMF4 accounted for more than 11%, 5%, and 4%, respectively, of the total variance of gold price series. These IMFs, IMF4, IMF5, IMF6, IMF7, and IMF8, and residue contributed more than 96% of the overall variance of the gold futures series. The rest of the IMFs did not contribute much to the total variance of the gold price series, suggesting that the influence of these IMFs on gold price is insignificant.

5.1.17. Statistics of Mode of Gold through VMD

Similarly, the VMD method decomposed the price series of gold into eight modes, M1, M2, …, M8, as shown in Figure 19. The dominant mode was M1, which contributed more than 93% of the total variability of the gold price volatility, with correlation coefficients between the gold price series and the modes of 0.966, 0.731, and 0.902 in terms of Pearson, Kendall, and Spearman, respectively, as presented in Table 11. The next significant contributor to the variability in the gold price was M2, which accounted for 5.420% of the total variance of the gold futures price series. The correlation coefficients between the observed data and M2 were 0.417, 0.327, and 0.427 for Pearson, Kendall, and Spearman, respectively. The M1 and the M2 were considered the two most significant modes, which contributed over 98% of the entire variability of the volatility of the gold futures series. The contribution of the remaining modes, M3–M8, was less than 2% of the gold price variance. These modes also showed low correlations’ coefficients with the price series of gold, suggesting that their impact on gold price determination is insignificant.

5.1.18. Constitution of Gold Price Series

In this section, we discuss the composition of the gold price in terms of the IMFs and modes’ contributions separately. Using the hierarchical clustering and the Euclidean distance between pairs of any two IMFs or modes, the IMFs and the modes were categorized as low frequency, high frequency, and trend. The IMF1–IMF6 formed the high-frequency components of EMD, while IMF7-IMF8 were grouped as the low-frequency components of the gold price series, as shown in Figure 20. These two grouped IMFs and the residue have economic interpretations and provide vital information on gold futures prices. The gradual change of the mean of the residue corresponds to the long-lasting trend of the gold price. The low-frequency component is classified as special events. The effects of short-lived market fluctuations are represented by the high-frequency parts, which can be identified by their shorter amplitudes, as represented in Table 12 and Figure 18.

For VMD, M3–M8 correspond to the high-frequency components of the gold price series and constitute the short-term effects of normal market fluctuations. M2 represents the low-frequency part of gold futures prices from May 2016 to April 2021 and shows the significant events under study. The long-term trend of the gold futures prices corresponds to M1, as illustrated in Table 13 and Figure 21.

5.1.19. The Trend of Gold Price Series

Trend contributed more than 91% and 93% of the total gold price series variance in EMD and VMD, respectively. It holds the highest correlation coefficient with the observed data, indicating that it is the main force behind the long-term fluctuations of the gold price, as shown in Tables 12 and 13. By comparing the trend with the price series of gold, it demonstrates that, although, gold prices can fluctuate due to special events, but they come back to the trend after the special event is over. Figures 22 and 23 represent the three components of gold futures prices through EMD and VMD, respectively.

5.1.20. Effects of Special Events

The IMF7-IMF8 and M2 represent the low-frequency components of decomposition of futures prices of gold by EMD and VMD, respectively. Low frequency contributed more than 7% and 5% of the total variance of the price series of gold for EMD and VMD, while the low frequency constitutes the shock caused by special events of gold price. Based on the average period of the IMFs that constituted the low frequencies, the duration of the shocks takes a minimum of two months for EMD and a minimum of one month and two weeks for VMD; this means that it is difficult for the market to adjust itself to the effects of these special events. The time frame of the effects of special events was longer, which suggests that most special events have a serious impact on the gold price; hence, intermittent fluctuations in gold prices arise through special events.

5.1.21. Effects of Ordinary Market Disequilibrium

Apart from special events and trends, the price of gold is also influenced by political situations, bad weather conditions, industrial action, and inventory depletion. The combined effects of these conditions are classified as high frequency. The effect has a short-term impact on gold price because the time frames of some of these conditions are very short, which suggest that the market disequilibrium has no serious impact on the price of gold. The disequilibrium of supply-demand as a normal market fluctuation, thus, has no serious impact on the gold price. On the one hand, if these events are not checked for a long period, they become one of the major drivers for increasing gold prices. In long-range trend forecasting, the ordinary market fluctuations can be set aside, but crucial for short period prediction.

In a nutshell, gold price composes of the trend (which contributes most of gold price variations in the long period), the special events (which were the leading causes of sporadic price increases of gold in the medium term), and the ordinary market fluxes (with short-lived effects on the gold price). Using the VMD approach, the price of $1254.23 per ounce during April 2021 can be analyzed as a trend price of $1169.81, a special event price of $67.84, and a normal market fluctuation $16.58, as shown in Figure 23.

6. Conclusion

A new model is proposed to study the underlying factors that causes crude oil, corn, and gold markets price fluctuations based on EMD and VMD techniques. EMD and VMD methods are used to decompose daily closing prices of corn, crude, and gold obtained from Bloomberg Commodity Index from May 2016 to April 2021 into their respective IMFs or modes and a residue to determine the components that derived commodity futures market prices.

Using the Pearson product-moment correlation coefficient, Kendall rank correlation, and Spearman rank correlation as statistical measures enabled us to evaluate each IMF’s effects or mode on the commodity futures prices. Subsequently, we used hierarchical clustering and the Euclidean distance approach to classify the IMFs, residue, and modes into high frequency, low frequency, and trend components, with the trend and low-frequency components being the main drivers of commodity futures prices. The three commodities’ prices could be analyzed as a combination of long-term trend, special events, and near term fluctuations resulting from usual market activities, such as imbalance of supply-demand. In general, the three commodities prices are driven by trends and special events. The effects from the trend deviates gradually and changes around the long-term mean. The unpredictable special events are responsible for irregular commodity price movements, and the effect could last for several years; this shows that the short-term commodity price changes are mainly triggered by normal market activities, and the effects have a very short duration; hence, their impact on commodity price is not serious.

Essentially, in using the decomposition approach in analyzing components of commodity prices’ data series explicitly, several forecasting techniques can be considered: firstly, depending on the characteristics of each IMF or mode, an appropriate forecasting method can be chosen to predict every IMF or mode; for instance, a polynomial function can be used to estimate the residue and Fourier analysis to predict low-frequency IMFs’ modes, using a nonlinear method to stimulate high-frequency IMFs or modes and summing them up to obtain the final results. Secondly, the IMFs or modes can be categorized as linear and nonlinear components which can be forecasted separately and then integrating the individual parts to obtain the final predicted results; for instance, the trend can be dealt with using curve fitting, using nonlinear forecasting techniques such as the backpropagation neural network to forecast short-term fluctuations. Special events are difficult to predict since they are influenced by several elements, such as trade wars, outbreak of pandemic, atmospheric conditions, financial policies, and many other sophisticated factors.

Finally, we suggest that a new framework or integrated forecasting method should be developed to handle special events’ effects on prices since no one knows when and where they will happen.

Data Availability

Data can be found at http://www.bloomberg.com.

Additional Points

Highlights. (i) We investigate futures prices of the following commodities: crude oil, corn, and gold. (ii) We propose a novel price determinant for these commodities: crude oil, corn, and gold. (iii) Empirical mode decomposition and variational mode decomposition are used to decompose crude oil, corn, and gold prices into different components. (iv) We examine the effects of each component on the market price fluctuations. (v) We indicate that empirical mode decomposition and variational mode decomposition can bring out the underlying factors of commodity market price volatility.

Conflicts of Interest

The authors declare that they have no conflicts of interest.