Research Article  Open Access
Muhammad Aslam, Osama H. Arif, "Multivariate Analysis under Indeterminacy: An Application to Chemical Content Data", Journal of Analytical Methods in Chemistry, vol. 2020, Article ID 1406028, 6 pages, 2020. https://doi.org/10.1155/2020/1406028
Multivariate Analysis under Indeterminacy: An Application to Chemical Content Data
Abstract
The Hotelling Tsquared statistic has been widely used for the testing of differences in means for the multivariate data. The existing statistic under classical statistics is applied when observations in multivariate data are determined, precise, and exact. In practice, it is not necessary that all observations in the data are determined and precise due to measurement in complex situations and under uncertainty environment. In this paper, we will introduce the Hotelling Tsquared statistic under neutrosophic statistics (NS) which is the generalization of classical statistics and applied under uncertainty environment. We will discuss the application and advantage of the neutrosophic Hotelling Tsquared statistic with the aid of data. From the comparison, we will conclude that the proposed statistic is more adequate and effective in uncertainty.
1. Introduction
In classical statistics (CS), the univariate analysis is the technique to analyze the singlevariable data. The multivariate analysis has been widely used to analyze data having more than one variable. In the multivariate technique under the CS, the Hotelling Tsquared statistic has been widely applied in the variety of fields (see, for example, [1, 2]), for the testing either the means for more than one populations are equal or not. This statistic is the extension of the ttest, which is applied for the testing of the mean for the single population. Brereton [3] used the Hotelling Tsquared statistic to detect the outlier in chemical data. In [4], Varmuza and Filzmoser worked on multivariate analysis for chemometric data. Hervé et al. [5] applied the multivariate technique on biological data. Kitaga ki et al. [6] used Hotelling Tsquared statistic in chemical and electrochemical oscillator issues. For more details about the applications of the Hotelling Tsquared statistic, the reader may read [3, 7] and [8].
The Hotelling Tsquared statistic derived under the CS can be only applied for the analysis when all observations in the multivariate data are determined, precise, and certain. In practice, the data under study are not always precise but linguistic. For example, the temperature of a certain city may be high, low, and medium or the measurement of variable data in a complex system may lead to being in an interval rather than the determined values. In such situations, the Hotelling Tsquared statistic under the CS cannot be used for the analysis of the data. When observations are uncertain or fuzzy, the fuzzy Hotelling Tsquared statistic can be applied for the testing of means of multivariate populations. Taleb et al. [9] applied the fuzzy Hotelling Tsquared statistic to design a control chart. D’Urso [10] provided a review on fuzzy multivariate analysis. Bakdi and Kouadri [11] presented a new adoptive principle component analysis technique to detect fault in a complex system. In [12], Ammiche et al. introduced principle component analysis for the Tennessee Eastman process using a fuzzy approach. More applications can be read in [13–15].
Recently, the neutrosophic logic, which is the extension of the fuzzy logic, attracted many researchers due to its applications in the variety of fields. The neutrosophic logic considered the measure of indeterminacy which fuzzy logic does not consider (see [16]). The neutrosophic statistics (NS) which is based on the neutrosophic numbers is the generalization of the CS (see [17, 18]). The NS has been applied widely in the rockmeasuring issues (see, for example, [19, 20]). The application of the NS for the inspection of the product can be seen in [21, 22]. The applications of the NS in the area of the process control can be seen in [23, 24]. The application of the NS in medical can be read in [25]. For more information on neutrosophic theory, the reader may refer to [26, 27].
Aslam and Smarandache [17, 18] pointed out some suggestions to extend the several concepts of CS to the NS. By exploring the literature and best of our knowledge, there is no work on the development of Hotelling Tsquared statistic under the NS. In this paper, we will introduce the Hotelling Tsquared statistic under NS, which is the generalization of classical statistics and applied under uncertainty environment. We will discuss the application and advantage of neutrosophic Hotelling Tsquared statistic with the aid of data. We expect that the proposed neutrosophic Hotelling Tsquared statistic will perform better than the existing Hotelling Tsquared statistic in uncertainty.
2. Preliminaries
Let be a neutrosophic random variable, which represents the particular neutrosophic observation of the variable that is noted from the item. Note here that is expressed in the indeterminacy interval having the smaller value and the larger value . The neutrosophic form of having determinate part and indeterminate part can be written as follows: . Note here that the neutrosophic random variable reduces to the variable under classical statistics if no indeterminacy is recorded in the data. The neutrosophic data matrix having neutrosophic observations of neutrosophic variables is given as follows:
The neutrosophic form of can be written as
Note here that is the generalization of the data matrix under classical statistics. The data matrix under reduces to the data matrix under classical statistics when = 0.
The neutrosophic sample mean and neutrosophic sample variance from measurements from neutrosophic variables are computed as follows:
The neutrosophic form of can be written as
Note here that is the generalization of the sample mean under classical statistics. The data matrix under reduces to the sample mean under classical statistics when = 0:
The neutrosophic form of can be written as
Note here that is the generalization of sample variance under classical statistics. The data matrix under reduces to the sample variance under classical statistics when = 0.
The neutrosophic sample covariance between two neutrosophic variables are given by
The neutrosophic form of can be written as
Note here that is the generalization of sample covariance under classical statistics. The data matrix under reduces to the sample covariance under classical statistics when no indeterminate observations.
Finally, neutrosophic sample correlation between the and variables is given by
The neutrosophic form of can be written as
Note here that is the generalization of sample correlation under classical statistics. The data matrix under reduces to the sample correlation under classical statistics when no indeterminate observations.
The neutrosophic descriptive statistics for measurements and on variables can be presented into the following arrays.
The neutrosophic sample mean variance and covariance and correlation are presented by the array
3. Neutrosophic Hotelling Statistic
In this section, we discuss the proposed neutrosophic Hotelling statistic. In classical statistics, the student test is applied for the testing of the mean for the univariate case. As mentioned by [28], rejecting the null hypothesis that means are equal when is large is the same as rejecting the null hypothesis of its square:
The neutrosophic form of can be written as
Note here that is the generalization of Hotelling Tsquared statistic under classical statistics. The data matrix under reduces to the Hotelling Tsquared statistic under classical statistics when no indeterminate observations.
For the given values of and , the null hypothesis will be rejected ifwhere is the level of significance and is upper percentiles of the neutrosophic distribution with the neutrosophic degree of freedom .
The generalization of equations (1) and (2) for the multivariate case under the neutrosophic statistical interval method (NSIM) is given bywhere
The neutrosophic form of can be written as
The statistic is given in equation (14) is called neutrosophic Hotelling statistic and has neutrosophic distribution with neutrosophic degree of freedom (ndf) and :
The neutrosophic Hotelling statistic can be used for the testing of hypothesis and alternative hypothesis . The will be rejected if
The software provides the value in making a decision about the acceptance or the rejection of the null hypothesis. According to [18], “a neutrosophic value is defined in the same way as in classical statistics: the smallest level of significance at which a null hypothesis can be rejected.” Note here that the neutrosophic value is not an exact or determined value as in the case of classical statistics. Smarandache [18] discussed criteria to accept or reject the null hypothesis using the neutrosophic value.
4. Application
Now, we discuss the application of the proposed neutrosophic Hotelling statistic using data selected from the healthcare department. The data are collected from 20 healthy women and three variables, which are sweat rate, sodium, and potassium contents are measured. The observations of variables underinvestigated will be obtained from the measurement process. It is expected that not all observations in the data are precise and exact. Therefore, it cannot be analyzed using CS. Similar data for classical statistics are given by [28]. The data having some neutrosophic observations are shown in Table 1. We want to test that the means of three groups for the healthy women have the same population means. We state null and alternative hypotheses as follows: Step 1: vs . Step 2: some basic calculations for the data are given in Table 1 are shown as Step 3: let be the level of significance. Step 4: the neutrosophic Hotelling statistic is Step 5: the critical region is using equation (5) is given as Step 6: as , we reject .

5. Comparisons
In Section 4, we presented the testing procedure for the proposed neutrosophic Hotelling . The proposed neutrosophic Hotelling is the generalization of CS. The proposed neutrosophic Hotelling testing procure reduces to the testing procedure under CS when all observations of sweat data are precise. From neutrosophic sweat data, we note that the proposed testing procedure provides the analysis values in the indeterminacy interval rather than the determined values. The neutrosophic form of proposed Hotelling statistic is . For example, the proposed Hotelling statistic has the indeterminacy interval from 9.73 to 11.41. It means, under uncertainty environment, one can expect the values of from 9.73 to 11.41. The first value 9.73 of the indeterminacy interval of shows the determined part, and 11.41 is an indeterminate part. When imprecise observations are noted in the sweat data, the value of is 9.73 which is under the CS. In other words, when the level of significance is 5%, the probabilities that the null hypothesis is accepted, rejected, and indeterminate are 0.95, 0.50, and 0.1470. By comparing the proposed test with the test under CS, we note that the existing test is unable to tell about the probability of the indeterminacy. As mentioned by [19, 20] that a method that provides the values in an indeterminacy interval under uncertainty is considered as the most effective and adequate method. By comparing the proposed testing procedure with the existing under CS, our theory is the same as in [19, 20].
6. Concluding Remarks
In this paper, we introduced the Hotelling Tsquared statistic under neutrosophic statistics (NS) which is the generalization of classical statistics and applied under uncertainty environment. We discussed the application and advantage of neutrosophic Hotelling Tsquared statistic with the aid of data. The proposed neutrosophic Hotelling Tsquared statistic is expressed in the indeterminacy interval and hence more flexible and information than the Hotelling Tsquared statistic under classical statistics. Based on the comparison, we recommend using the proposed neutrosophic Hotelling Tsquared statistic for the analysis of the data under uncertainty. Some more properties of the proposed neutrosophic Hotelling Tsquared statistic can be studied as future research. The sensitivity of the proposed statistic to uncertainty and measurement errors can be studied in future work.
Data Availability
The data used to support the findings of this study are included in the paper.
Conflicts of Interest
The authors declare that they have no conflicts of interest regarding this paper.
Acknowledgments
This article was funded by the Deanship of Scientific Research (DSR) at King Abdulaziz University, Jeddah. The authors, therefore, acknowledge DSR technical and financial support with thanks.
References
 L. Caucci, H. H. Barrett, N. Devaney, and J. J. Rodríguez, “Application of the Hotelling and ideal observers to detection and localization of exoplanets,” Journal of the Optical Society of America A, vol. 24, no. 12, pp. B13–B24, 2007. View at: Publisher Site  Google Scholar
 A. Shabbak and H. Midi, “An improvement of the Hotelling T^{2} statistic in monitoring multivariate quality characteristics,” Mathematical Problems in Engineering, vol. 2012, Article ID 531864, 15 pages, 2012. View at: Publisher Site  Google Scholar
 R. G. Brereton, “Hotelling’s Tsquared distribution, its relationship to the Fdistribution and its use in multivariate space,” Journal of Chemometrics, vol. 30, no. 1, pp. 18–21, 2016. View at: Publisher Site  Google Scholar
 K. Varmuza and P. Filzmoser, Introduction to Multivariate Statistical Analysis in Chemometrics, CRC Press, Boca Raton, FL, USA, 2016.
 M. R. Hervé, F. Nicolè, and K.A. Lê Cao, “Multivariate analysis of multiple datasets: a practical guide for chemical ecology,” Journal of Chemical Ecology, vol. 44, no. 3, pp. 215–234, 2018. View at: Publisher Site  Google Scholar
 B. T. Kitagaki, M. R. Pinto, A. C. Queiroz, M. C. Breitkreitz, F. Rossi, and R. Nagao, “Multivariate statistical analysis of chemical and electrochemical oscillators for an accurate frequency selection,” Physical Chemistry Chemical Physics, vol. 21, no. 30, pp. 16423–16434, 2019. View at: Publisher Site  Google Scholar
 R. L. Mason, Y.M. Chou, and J. C. Young, “Applying Hotelling’s T^{2} statistic to batch processes,” Journal of Quality Technology, vol. 33, no. 4, pp. 466–479, 2001. View at: Publisher Site  Google Scholar
 N. Zhao, X. Zhan, K. A. Guthrie, C. M. Mitchell, and J. Larson, “Generalized Hotelling’s test for paired compositional data with application to human microbiome studies,” Genetic Epidemiology, vol. 42, no. 5, pp. 459–469, 2018. View at: Publisher Site  Google Scholar
 H. Taleb, M. Limam, and K. Hirota, “Multivariate fuzzy multinomial control charts,” Quality Technology & Quantitative Management, vol. 3, no. 4, pp. 437–453, 2006. View at: Publisher Site  Google Scholar
 P. D’Urso, “Exploratory multivariate analysis for empirical information affected by uncertainty and modeled in a fuzzy manner: a review,” Granular Computing, vol. 2, no. 4, pp. 225–247, 2017. View at: Publisher Site  Google Scholar
 A. Bakdi and A. Kouadri, “A new adaptive PCA based thresholding scheme for fault detection in complex systems,” Chemometrics and Intelligent Laboratory Systems, vol. 162, pp. 83–93, 2017. View at: Publisher Site  Google Scholar
 M. Ammiche, A. Kouadri, and A. Bakdi, “A combined monitoring scheme with fuzzy logic filter for plantwide Tennessee Eastman Process fault detection,” Chemical Engineering Science, vol. 187, pp. 269–279, 2018. View at: Publisher Site  Google Scholar
 A. Azadeh, S. F. Ghaderi, S. Pashapour, A. Keramati, M. R. Malek, and M. Esmizadeh, “A unique fuzzy multivariate modeling approach for performance optimization of maintenance workshops with cognitive factors,” The International Journal of Advanced Manufacturing Technology, vol. 90, no. 1–4, pp. 499–525, 2017. View at: Publisher Site  Google Scholar
 O. Sunanta, “Generalized point estimators for fuzzy multivariate data,” Austrian Journal of Statistics, vol. 47, no. 1, pp. 33–44, 2018. View at: Publisher Site  Google Scholar
 C. K. Yoo, P. A. Vanrolleghem, and I.B. Lee, “Nonlinear modeling and adaptive monitoring with fuzzy and multivariate statistical methods in biological wastewater treatment plants,” Journal of Biotechnology, vol. 105, no. 12, pp. 135–163, 2003. View at: Publisher Site  Google Scholar
 F. Smarandache, “Neutrosophic logicA generalization of the intuitionistic fuzzy logic,” in Multispace & Multistructure. Neutrosophic Transdisciplinarity (100 Collected Papers of Science), vol. 4, p. 396, Hanko, Hanko, Finland, 2010. View at: Publisher Site  Google Scholar
 M. Aslam, “A new sampling plan using neutrosophic process loss consideration,” Symmetry, vol. 10, no. 5, p. 132, 2018. View at: Publisher Site  Google Scholar
 F. Smarandache, “Introduction to neutrosophic statistics,” in Infinite Study, SiTech, Keswick, Australia, 2014. View at: Publisher Site  Google Scholar
 J. Chen, J. Ye, and S. Du, “Scale effect and anisotropy analyzed for neutrosophic numbers of rock joint roughness coefficient based on neutrosophic statistics,” Symmetry, vol. 9, no. 10, p. 208, 2017. View at: Publisher Site  Google Scholar
 J. Chen, J. Ye, S. Du, and R. Yong, “Expressions of rock joint roughness coefficient using neutrosophic interval statistical numbers,” Symmetry, vol. 9, no. 7, p. 123, 2017. View at: Publisher Site  Google Scholar
 M. Aslam, “Design of sampling plan for exponential distribution under neutrosophic statistical interval method,” IEEE Access, vol. 6, pp. 64153–64158, 2018. View at: Publisher Site  Google Scholar
 M. Aslam, “Product acceptance determination with measurement error using the neutrosophic statistics,” Advances in Fuzzy Systems, vol. 2019, Article ID 8953051, 8 pages, 2019. View at: Publisher Site  Google Scholar
 M. Aslam, “Attribute control chart using the repetitive sampling under neutrosophic system,” IEEE Access, vol. 7, pp. 15367–15374, 2019. View at: Publisher Site  Google Scholar
 M. Aslam, N. Khan, and M. Khan, “Monitoring the variability in the process using neutrosophic statistical interval method,” Symmetry, vol. 10, no. 11, p. 562, 2018. View at: Publisher Site  Google Scholar
 M. Aslam and M. Albassam, “Application of neutrosophic logic to evaluate correlation between prostate cancer mortality and dietary fat assumption,” Symmetry, vol. 11, no. 3, p. 330, 2019. View at: Publisher Site  Google Scholar
 M. AbdelBasset, M. Gunasekaran, M. Mohamed, and F. Smarandache, “A novel method for solving the fully neutrosophic linear programming problems,” Neural Computing and Applications, vol. 31, no. 5, pp. 1595–11605, 2019. View at: Publisher Site  Google Scholar
 M. AbdelBasset, G. Manogaran, M. Mohamed, and N. Chilamkurti, “Threeway decisions based on neutrosophic sets and AHPQFD framework for supplier selection problem,” Future Generation Computer Systems, vol. 89, pp. 19–30, 2018. View at: Publisher Site  Google Scholar
 R. A. Johnson and D. W. Wichern, Applied Multivariate Statistical Analysis, PrenticeHall, Upper Saddle River, NJ, USA, 5 edition, 2002.
Copyright
Copyright © 2020 Muhammad Aslam and Osama H. Arif. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.