Feature Selections Using Minimal Redundancy Maximal Relevance Algorithm for Human Activity Recognition in Smart Home Environments

Fang, Hongqing; Tang, Pei; Si, Hao

doi:https://doi.org/10.1155/2020/8876782

Journal of Healthcare Engineering

On this page

Abstract Introduction Experimental Results Conclusions Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Sensor-Based Systems for Independent Living of Ageing People

View this Special Issue

Research Article | Open Access

Volume 2020 | Article ID 8876782 | https://doi.org/10.1155/2020/8876782

Feature Selections Using Minimal Redundancy Maximal Relevance Algorithm for Human Activity Recognition in Smart Home Environments

Hongqing Fang,¹Pei Tang,¹and Hao Si¹

Academic Editor: Ivan Miguel Pires

Received13 Sept 2020

Revised17 Oct 2020

Accepted09 Nov 2020

Published29 Nov 2020

Abstract

In this paper, maximal relevance measure and minimal redundancy maximal relevance (mRMR) algorithm (under D-R and D/R criteria) have been applied to select features and to compose different features subsets based on observed motion sensor events for human activity recognition in smart home environments. And then, the selected features subsets have been evaluated and the activity recognition accuracy rates have been compared with two probabilistic algorithms: naïve Bayes (NB) classifier and hidden Markov model (HMM). The experimental results show that not all features are beneficial to human activity recognition and different features subsets yield different human activity recognition accuracy rates. Furthermore, even the same features subset has different effect on human activity recognition accuracy rate for different activity classifiers. It is significant for researchers performing human activity recognition to consider both relevance between features and activities and redundancy among features. Generally, both maximal relevance measure and mRMR algorithm are feasible for feature selection and positive to activity recognition.

1. Introduction

The aging of population and the increasing number of the elderly who chooses to live on their own [1–3] is an indisputable social reality. To implement the goals, smart home technology can play an important role to detect and analyze health events [4] and to provide corresponding medical assistant and caregiver for frail elderly and disabled people who are unable to live independently for a long period of time and in their home far away from hospital or their families, e.g., to remind them of time to take medicine, to see the doctor, to assist them in cutting off the water, turning off the oven, etc.

Actually, accurate assessment of human Activities of Daily Livings (ADLs) is the prerequisite for smart home to yield the correct service, whether it is for the elders or for the severe disabilities of health monitoring, or provide them with other relevant helps. Once the dangerous behavior is detected, smart home itself can cope with it and eliminate as much of the inhabitant’s risk as possible. Therefore, the accurate recognition of human activity in smart home is of great significance and gives a pattern for the realization of healthcare for solitary elderly or disabled as well, which is the most important process in incorporating ambient intelligence into smart environments [5–8].

Recently, human activity discovery and recognition has gained a lot of interest due to its enormous potential in context aware computing systems, including smart home environments. The primary objective of human activity recognition in smart home environment is to find the interesting patterns of behavior from gathered sensor data and to recognize such patterns. Currently, one of the primary challenges of human activity recognition is the choosing of machine learning algorithms which perform better in the same sequence of sensor data collected by smart home environment during the activity. In the last years, several intelligent algorithms applied for human activity recognition in smart home have been reported. Singla, Crandall, and Cook et al. described the applications of some probability-based algorithms that include naïve Bayes (NB) classifier, Markov model (MM), and hidden Markov model (HMM) [9, 10] to train the partially labeled motion sensor events data to obtain the values of the prior parameters and then to validate the performances of the algorithms by testing the rest of labeled sensor data. Liu et al. presented a Bayesian network-based probabilistic generative framework based on Allen’s temporal relations over primitive events to characterize the structural variabilities of complex activities [11]. Gayathri et al. proposed a statistical relational learning approach which augments ontology based activity recognition with probabilistic reasoning through Markov Logic Network (MLN) [12]. Kim et al. proposed a discriminative and generative probabilistic model, conditional random field (CRF), as a more flexible alternative to HMM [13]. Zhu et al. presented a two-layer CRF model to represent the action segments and activities in a hierarchical manner, which allows the integration of both motion and various context features at different levels and automatically learns the statistics that capture the patterns of the features [14]. Chen et al. introduced a knowledge-driven approach to continuous activity recognition based on multisensor data streams in smart homes [15]. Fahim et al. introduced a novel Evolutionary Ensembles Model (EEM) that values both minor and major activities by processing each of them independently, which is based on a Genetic Algorithm (GA) to handle the nondeterministic nature of activities [16]. Fleury et al. proposed support vector machine- (SVM-)based multimodal classification of ADLs in health smart homes [17]. Wen and Wang combined Latent Dirichlet Allocation (LDA) and AdaBoost to jointly train a general activity recognition model with partially labeled data [18]. Hong et al. composed a hybrid model of Bayesian networks and support vector machines to accurately recognize human activity [19].

Besides the suitable choosing of machine learning algorithms, another key point for human activity recognition in smart home is to select valid features from sensor events datasets collected in smart home environment. Usually, sensor events datasets include a large number of observed sensor events generated by various activities and any activity annotated in the dataset has various features, even redundant and irrelevant features [20]. However, these features are selected in one method in all tests, almost, and the influences of these features on the performance of classifiers are seldom addressed. Actually, the features which are irrelevant to activity recognition and redundant between initial features need to be removed prior to activity recognition. Furthermore, feature selection means to select the features subset which is the most favorable for activity recognition and compressing of data successfully.

Recently, minimal redundancy maximal relevance (mRMR) feature selection algorithm has been widely applied in many researching fields, which aims to achieve the best classification performance by reducing redundancy among the selected features and maximizing their relevance to the target class. Mohamed et al. implemented the mRMR filter and a metaheuristic approach as a feature selection process for drug response microarray classification [21]. Che et al. presented a novel mutual information feature selection method based on the normalization of the maximum relevance and minimum common redundancy for nonlinear classification or regression problems [22]. Xu et al. proposed a new distributed monitoring scheme which integrates mRMR, Bayesian inference, and principal component analysis for plant-wide processes [23]. Li et al. provided a granular feature selection method with an mRMR criterion based on mutual information (MI) for multilabel learning [24]. Escalona-Vargas et al. proposed a method that uses mRMR as criteria to automatically select references for the frequency-dependent subtraction method to attenuate maternal and fetal magnetocardiograms of fetal magnetoencephalography recordings [25]. Mallik et al. developed a new framework of identifying statistically significant epigenetic biomarkers using mRMR criterion-based feature (gene) selection for multiomics dataset [26]. Tiwari calculated the weighted distance to improve the prediction performance of G-protein coupled receptors families and their subfamilies by using sequence derived properties, and the feature selection method based on fusion of mRMR and other supervised filter were provided [27]. Chen and Yan developed an optimized multilayer feedforward network by using mRMR-partial mutual information clustering integrated with least square regression to construct a soft sensor for controlling naphtha dry point [28]. Wang et al. presented a multiobjective evolutionary algorithm which employs the Pareto optimality to evaluate candidate feature subsets and finds compact feature subsets with both the maximal relevance and the minimal redundancy [29]. Morgado and Silveira proposed a multivariate procedure capable of selecting nonredundant subsets of features significantly faster than other similar methods to the diagnosis of Alzheimer’s disease and related disorders which is inspired in mRMR algorithm [30]. Kamandar and Ghassemian used a modified mRMR as a criterion for feature extraction for hyperspectral images classification based on information theoretic learning [31]. Kandaswamy et al. extracted the best features using mRMR feature selection algorithm and used the random forest algorithm to predict extracellular matrix proteins [32]. Jin et al. proposed a novel method for health monitoring and anomaly detection for cooling fans in electronic products based on Mahalanobis distance with mRMR features [33]. Unler et al. presented a hybrid filter-wrapper feature subset selection algorithm based on particle swarm optimization for support vector machine classification. The filter model is based on the mutual information and is a composite measure of feature relevance and redundancy with respect to the feature subset selected [34]. Zdravevski et al. proposed a generic feature engineering method for selecting robust features from a variety of sensors, which is from the originally recorded time series and some newly generated time series (i.e., magnitudes, first derivatives, delta series, and fast Fourier transformation- (FFT-) based series); a variety of time and frequency domain features are extracted [35, 36].

Inspired by the ideas mentioned above, in this paper, maximal relevance measure and mRMR feature selection algorithm (under D-R and D/R criteria) have been applied to select features and to compose different features subsets based on the observed motion sensor events for human activity recognition in smart home environments. And then, the selected features subsets have been evaluated, and the activity recognition accuracy rates have been compared with two probabilistic algorithms: NB classifier and HMM.

The rest of the paper is organized as follows: Section 2 describes the smart apartment testbed, the data collection, and 13 features of observed sensor events for human activity recognition. Section 3 presents the concepts of information entropy, mutual information (MI), and minimal redundancy maximal relevance (mRMR) feature selection algorithm. Section 4 gives the training and testing activities, the features subsets selected through analyzing of maximal relevance measure, and mRMR feature selection algorithm. Finally, we present the comparison results of activity recognition accuracy rates with the selected features subsets and the performance measures of NB classifier as well as HMM. Section 5 summarizes the main contributions.

2. Smart Apartment Testbed and Data Collection for Human Activity Recognition

2.1. Smart Apartment Testbed and Data Collection

The smart apartment testbed for this research is located on Washington State University campus and is maintained as part of the Center for Advanced Studies in Adaptive Systems (CASAS) smart home project, which includes three bedrooms, one bathroom, a kitchen, and a living/dining room. The smart apartment is equipped with motion sensors distributed approximately 1 meter apart throughout the space on the ceilings, as shown in Figure 1. In addition, other sensors installed provide ambient temperature readings and custom-built analog sensors provide readings for hot water, cold water, and stove burner use. Sensor data is captured using a sensor network that was designed in-house and is stored in a Structured-Query-Language (SQL) database. After collecting data from the smart apartment testbed, the sensor events are annotated for ADLs, which are used for training and testing the activity recognition algorithms [3, 4, 7, 9, 10].

The data gathered by CASAS smart home is represented by the following parameters, which specify the number of features that are used to describe the observed sensor events. The generalized syntax of the dataset (Cairo Dataset, 2009) is Date Time SensorID SensorValue <label> An example of the dataset of Night_wandering activity is: { 2009-06-10 03 : 20 : 59.08 M006 ON Night_wandering begin 2009-06-10 03 : 25 : 19.05 M012 ON 2009-06-10 03 : 25 : 19.08 M011 ON 2009-06-10 03 : 25 : 24.05 M011 OFF 2009-06-10 03 : 25 : 24.07 M012 OFF Night_wandering end }

This example shows that the observed sensor events correspond to the Night_wandering activity with concrete Date, Time, Sensor ID, Sensor Value, and activity label parameters.

2.2. Features of Observed Sensor Events

Considering the actual situation, each activity has 13 features of the observed sensor events:(1)The means of logical values of Sensor IDs of each activity’s sensor events is f₁. Considering that the place where each activity happens is relatively stable, therefore, selecting the average of Sensor IDs means the focus area where the activity occurs. The equation is where is the means of Sensor IDs of activity i, is the number of sensors, and is the Sensor ID of activity i.(2)The variance of all Sensor IDs triggered by the current activity, f₂, is(3)Day of the week, which is converted into a value in the range of 0 to 6, is f₃.(4)Previous activity, which represents the activity that occurred before the current activity, is f₄.(5)Activity length, which represents the length of the current activity measured in a number of sensor events, is f₅.(6)The logical value of the first Sensor ID triggered by the current activity is f₆.(7)The logical value of the last Sensor ID triggered by the current activity is f₇.(8)The duration of the current activity, which indicates the time interval between the last sensor and the first sensor triggered by the current activity, is f₈.(9)The beginning time of the current activity is f₉.(10)The ending time of the current activity is f₁₀.(11)Next activity, which represents the activity that occurred after the current activity, is f₁₁.(12)The mode value of the Sensor IDs triggered by the current activity is f₁₂.(13)The median value of the Sensor IDs triggered by the current activity is f₁₃.

Usually, the optimal features subset contains the least number of dimensions that contribute to higher recognition accuracy rate. Therefore, it is necessary to remove the remaining and unimportant features.

3. Minimal Redundancy Maximal Relevance Feature Selection Algorithm

3.1. Entropy

Information entropy is a more abstract mathematical concept which can be understood as the probability of the emergence of some specific information. Generally, the higher of the probability of a kind of information indicates that it is spread more widely or more highly cited. Therefore, information entropy is able to represent the value of information. The information source is the source of the message and the message sequence. For example, the simplest discrete information source is , and is the probability of a given ; then, the entropy of X can be defined as

3.2. Conditional Entropy, Joint Entropy, and Mutual Information

3.2.1. Conditional Entropy

In information theory, the conditional entropy quantifies the amount of information needed to describe the outcome of a random variable X given that the value of another random variable Y is known. If X and Y are dependent on each other, then, in the condition of , the conditional entropy of X is

If Y is given, then the conditional entropy of X iswhere p(x, y) is the joint probability of x and y.

3.2.2. Joint Entropy

Joint entropy is a measure of the uncertainty associated with a set of random variables. Supposing two random variables X and Y and each of them is given a limited value, then, the joint entropy is

The joint entropy is a measure of the correlation of X and Y. If X and Y are independent, then, the joint entropy is

3.2.3. Mutual Information

Mutual information (MI) is a quantity that measures the level of similarity as well as correlation of random variables [33, 37]. Supposing two random variables X and Y and Y contains some information of X, then MI between X and Y can be defined as

MI is typically defined as the measure of the mutual dependence of two random variables. A larger value of MI means a closer relationship between the two random variables which have larger correlation. If the value of MI is zero, it means that the two random variables are uncorrelated and independent. Therefore, in this paper, MI can be used to measure the similarity among features and the correlation between feature and activity.

3.3. Minimal Redundancy Maximal Relevance (mRMR) Algorithm

Although MI is widely applied in the feature selection fields, there still exist some deficiencies. Most of the feature selection algorithms only consider the relationship between features and classification categories but ignore the mutual influence among features. Instead, mRMR feature selection algorithm applied in this paper considers not only the amount of information provided by these features for categorical attributes but also the influence of interaction among features on classification [37].

MI can weigh the quantity of information between feature variables X and Y; furthermore, it can measure how much information quantity that X can provide to Y to classify activity as well. Therefore, MI can not only show the ability of each feature of identifying the activities, but also measure whether there is redundancy among features. According to the traits of MI, two different criteria can be extended to evaluate the features: redundancy and relevance.

3.3.1. Redundancy Measure

Redundancy measure utilizes the quantity of MI between features. If the value of MI is large, it means that there is a large amount of information duplication between the two features; i.e., there are redundancies between the two features. A lower value of redundancy measure indicates a better feature selection criterion. Utilizing redundancy measure is to find the feature which has the minimal value of MI among all features.

According to the idea that the smaller the value of redundancy of information between features is, the more beneficial it is to activity classification, which can be expressed by minimizing the MI among features, the minimal redundancy condition iswhere is the number of features in features subset S and is MI between feature i and j.

3.3.2. Relevance Measure

Relevance measure utilizes the value of MI between the feature and the target activity. If the value of MI is small, it indicates that there is a weak correlation between the feature and the target activity. On the contrary, the larger value of MI means that the feature has greater amount of information to classify the activity. Therefore, it is necessary to select the maximum value of MI between the features and the target activity, the maximal relevance criterion, which can be expressed aswhere c is the target activity and is the MI between the feature i and the target activity c.

Feature selection based on the maximum value of relevance measure is to compose the optimal features subset by selecting MI to target activity.

3.3.3. Algorithm Designing

mRMR feature selection algorithm is based on the relevance measure and redundancy measure described above. It combines the relevance between the features and the target activity as well as the redundancy among the features [33, 37]. According to (9) and (10), the influences of relevance measure and redundancy measure have been taken consideration in feature selection, comprehensively. mRMR feature selection algorithm has two evaluation criteria, which optimizes D and R, simultaneously, as

Supposing that there is a features subset which is composed of m features, the next step is to extract the optimal (m+1)^th feature from the features subset according to (11) or (12), through

The incremental procedures of mRMR feature selection algorithm are as follows: Step 1. In the original features set , the optimal feature can be selected by and then put into the optimal features subset S; Step 2. In the features subset , the next optimal feature is selected which satisfies (11) or (12); Step 3. Repeat Step 2 to find the optimal features subset S which meets the size requirement finally.

4. Experimental Results

4.1. Training and Testing Activities

A total of 10 activities were performed in the CASAS smart apartment by 2 volunteers to provide physical training data for NB classifier and HMM. These activities include both basic and more complex ADLs that are found in clinical questionnaires. These activities are the following:(1)Bed_to_toilet (activity 0): transition happens between bed and toilet in the night time (30 instances)(2)Breakfast (activity 1): the resident has breakfast (48 instances)(3)Bed (activity 2): this is the activity of sleeping in bed (207 instances)(4)C_work (activity 3): the resident works in the office space (46 instances)(5)Dinner (activity 4): the resident has dinner (42 instances)(6)Laundry (activity 5): the resident cleans clothes using the laundry machine (10 instances)(7)Leave_home (activity 6): the resident leaves smart home (69 instances)(8)Lunch (activity 7): the resident has lunch (37 instances)(9)Night_wandering (activity 8): the resident wanders during night time (67 instances)(10)R_medicine (activity 9): the resident takes medicine (44 instances)

The data have been collected in the CASAS smart apartment testbed for 55 days, resulting in total of 600 instances of these activities and 647, 485 collected motion sensor events. The 3-fold cross validation is applied in the data for NB classifier and HMM under the same conditions to ensure that the experimental comparison is fair [38].

4.2. Feature Selection with Maximal Relevance Measure

Firstly, maximal relevance (MR) measure of the 13 features has been analyzed. Usually, MI is employed as the criterion of evaluation. Since the feature with larger value of MI has the closer relationship to the target activity, therefore, the values of MI between features and activities must be calculated. By analyzing relevance measure, the ranks of the 13 features according to the values of MI are listed in Table 1, and then the 13 features have been sorted in a descending order successively, composed of 13 features subsets, respectively [39].

It can be found that feature f₁₀ has the largest value of MI, which means that f₁₀ is the most important feature to activity recognition and has the closest relationship to the target activity. On the contrary, f₃ has the smallest value of MI, which means that f₃ is not discriminatory to activity recognition and has the weakest link to the target activity.

Then, the 13 features subsets with MR measure are the following: Features subset 1: (f₁₀) Features subset 2: (f₁₀, f₉) Features subset 3: (f₁₀, f₉, f₆) Features subset 4: (f₁₀, f₉, f₆, f₈) Features subset 5: (f₁₀, f₉, f₆, f₈, f₇) Features subset 6: (f₁₀, f₉, f₆, f₈, f₇, f₁) Features subset 7: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂) Features subset 8: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂, f₄) Features subset 9: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂, f₄, f₁₁) Features subset 10: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂, f₄, f₁₁, f₅) Features subset 11: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂, f₄, f₁₁, f₅, f₂) Features subset 12: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂, f₄, f₁₁, f₅, f₂, f₁₃) Features subset 13: (f₁₀, f₉, f₆, f₈, f₇, f₁, f₁₂, f₄, f₁₁, f₅, f₂, f₁₃, f₃)

The activity recognition accuracy rate can be divided into two categories: the individual activity recognition accuracy rate (IARAR) and the total activities recognition accuracy rate (TARAR). The definitions are

The results of activity recognition accuracy rates with the 13 features subsets of NB classifier based on MR measures are shown in Table 2, and the results of HMM are shown in Table 3, respectively.

From Tables 2 and 3, the 13 features impact on the classifiers’ recognition accuracy rate can be found differently. Similarly, for NB classifier and HMM, TARARs vary from different features subsets. Generally, with the increasing of size of features subset, activity recognition accuracy rate increases. TARAR tends to be stable as the size of features subset becomes larger finally. However, the trend is not monotonic. Concretely, for NB classifier, TARAR does not improve with features subset 7 (88.7%) which introduces feature f₇ in features subset 6 (88.8%). It means that not all the features are positive to activity recognition obviously and different combination of features has different effect on activity recognition accuracy rate.

It also can be found that the maximal value of TARAR of NB classifier is slightly higher than that of HMM. With features subset 12, the TARAR of NB classifier reaches the maximal value 90.3%, while, for HMM, the maximal value of TARAR is 88.0% with features subset 13.

Furthermore, it can be seen that the trends of IARARs of NB classifier and HMM are similar as well; i.e., with the increasing of the number of features in features subset, IARAR increases until a specific number. Again, the trends are not monotonic. For example, for activity 1, IARARs of NB classifier and HMM have risen along with the sizes of features subset. For NB classifier, the optimal value of IARAR is 100.0%, with features subset 8, and then drop down with the increasing of size of features subset. For HMM, IARAR reaches the optimal value of 93.8% with features subset 6 and then drop down with the increasing of size of features subset. Moreover, from IARARs of activity 1, it can be seen that the performances of NB classifier and HMM are different by introducing the same feature, such as feature f₄, compared with features subsets 7 (97.9%) and 8 (100.0%); NB generates a positive result with this feature; however, the recognition performance of HMM does not improve, from 89.6% decreasing to 83.3%. Even with same features subset, the recognition accuracy rates may be quite different from the performances of classifiers, which can be found much more from the recognition accuracy rates for each activity. For example, the performances of NB classifier and HMM differ greatly for activities 0 with features subset 1 (0.0% vs. 90.0%) and activity 2 with features subset 3 (88.4% vs. 39.1%).

Furthermore, from Table 2, it is indicated that the relatively better features subset is different for each activity; e.g., for activity 3, features subset 10 yields the best result, and the accuracy rate is 76.1%, while features subset 2 generates the best result of activity 2, and the accuracy rate is 100.0%, for NB classifier. With features subset 10, IARARs of activities 3, 4, 5, and 6 are better than or equal to those of the other features subsets; the proportion is 40% for all activities. However, with features subsets 12 and 13, the IARARs of activities 0 through 2, and 4, and 6 through 8 are better than or equal to those of features subset 10; the proportions are 70%. Therefore, the optimal results of TARARs (90.3%) are of features subsets 12 and 13. Relatively, features f₃ and f₁₃ are not discriminatory to activity recognition with NB classifier.

For HMM, it can be found from Table 3 that features subset 10 generates the highest proportion of optimal IARARs for all activities, 40%. Concretely, with features subset 10, IARARs of activities 4, 5, 7, and 9 are better than or equal to those of the other subsets. However, features subset 13 yields the optimal TARAR of 88.0%, which is slightly higher than that of features subset 10, 87.3%. Again, it also can be concluded that feature f₃ is not discriminatory to activity recognition with HMM.

4.3. Feature Selection with mRMR Algorithm

mRMR feature selection algorithm can be adopted to sort the 13 features and reduce the number of dimensions of features subset. This is done by analyzing the relevance of the features to the target activities on the original features set to remove the irrelevant features and keep the features which have strong correlations to the target activities and then analyzing the redundancy of the selected features to reduce the number of dimensions of features subset.

According to (11) to get the value of information of different features, the results of iterations are listed in Table 4 with D-R criterion. At the beginning, the MI equation has been applied to get the first optimal feature and the result is f₁₀. The next interaction, still according to (11), feature f₆ is the optimal feature among the other remaining 12 features. And then, the ranks of each feature have been obtained successively.

Finally, the order of the sorted features is f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇, f₈, f₉, f₂, f₁₃, and f₃. Again, it can be found that feature f₁₀ is the most important feature and feature f₃ is not discriminatory to activity recognition.

Similarly, the information values of different feature groups can be obtained according to (12), and the results of iterations are listed in Table 5 under D/R criterion.

The order of the sorted features is f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇, f₁₃, f₈, f₁, f₉, and f₃.

Although the orders of the sorted features with D-R and D/R criteria are different, the first feature still is f₁₀, the most important feature, the second feature is f₆, and the last feature is f₃, which is not discriminatory to activity recognition.

According to the orders of the sorted features with D-R and D/R criteria, 13 features subsets can be obtained through adding one feature to previous features subset each time successively.

The 13 features subsets under D-R criterion are the following: Features subset 1: (f₁₀) Features subset 2: (f₁₀, f₆) Features subset 3: (f₁₀, f₆, f₅) Features subset 4: (f₁₀, f₆, f₅, f₁₂) Features subset 5: (f₁₀, f₆, f₅, f₁₂, f₁₁) Features subset 6: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁) Features subset 7: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄) Features subset 8: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇) Features subset 9: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇, f₈) Features subset 10: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇, f₈, f₉) Features subset 11: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇, f₈, f₉, f₂) Features subset 12: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇, f₈, f₉, f₂, f₁₃) Features subset 13: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₁, f₄, f₇, f₈, f₉, f₂, f₁₃, f₃) And the 13 features subsets under D/R criterion are the following: Features subset 1: (f₁₀); Features subset 2: (f₁₀, f₆) Features subset 3: (f₁₀, f₆, f₅) Features subset 4: (f₁₀, f₆, f₅, f₁₂) Features subset 5: (f₁₀, f₆, f₅, f₁₂, f₁₁) Features subset 6: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂) Features subset 7: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄) Features subset 8: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇) Features subset 9: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇, f₁₃) Features subset 10: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇, f₁₃, f₈) Features subset 11: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇, f₁₃, f₈, f₁) Features subset 12: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇, f₁₃, f₈, f₁, f₉) Features subset 13: (f₁₀, f₆, f₅, f₁₂, f₁₁, f₂, f₄, f₇, f₁₃, f₈, f₁, f₉, f₃)

The selected features subsets have been validated by NB classifier and HMM to recognize activities. TARARs of NB classifier and HMM, with each features subset, are listed in Table 6, respectively.

It can be found that although D-R and D/R criteria are different, the features subsets which have the same number of features are similar, even the same, e.g., if the number of features is little than 6. Therefore, it can be observed that the results of the same classifier with the same number of features are very slightly different, under the two criteria. If the number of features is smaller than 6 or larger than 11, the results of TARARs are the same.

Moreover, the performances of NB classifier and HMM are different even with the same features subset. Generally, with the increasing of size of features subset, TARAR increases until the size reaches a certain value. Still, the trend is not monotonic. For example, the optimal results are with features subsets 9, 12, and 13 under D-R criterion (90.3%), and the optimal result is with features subset 13 under D/R criterion (88.0%), for NB classifier. For HMM, the optimal result is with features subset 13 under D-R (88.0%) and D/R (88.0%) criteria. The results show that NB classifier and HMM yield relatively higher TARARs with features subset 5 through 13 and TARARs of NB classifier are slightly higher than those of HMM with features subsets 8 through 13, but much higher with 1 through 7, under the same criterion.

Basically, it can be concluded that both D-R and D/R criteria can be applied to sort the features and then to compose features subset effectively.

Tables 7 and 8 (under D-R criterion) show that the optimal IARARs of activities 0, 3, 5, 6, and 9 of HMM are higher than those of NB classifier, while the optimal IARARs of the activities 1, 2, 7, and 8 of NB classifier are higher than those of HMM. The optimal IARARs of activity 4 are the same of the two classifiers. Generally, the trends of the two classifiers are nearly the same with the increasing of size of features subset. Again, the trends are not monotonic. Moreover, it can be observed that different features have different effects on activity recognition. For example, features subset 5 means introducing feature f₁₁ to features subset 4; then, IARARs have been improved of some activities, such as activities 0, 2, 3, 4, and 7 through 9; however, IARARs of other activities are degraded or not improved, of NB classifier, as shown in Table 7.

HMM has a similar result. It also can be found that, according to the different performance traits of NB classifier and HMM, the adding of the same feature has different effects on IARARs. For example, by introducing of the feature f₅ to features subset 2 to compose features subset 3, IARAR of activities 0 has been improved in NB classifier apparently but degraded in HMM. By introducing of feature f₁₁ to features subset 4 to compose features subset 5, IARAR of activity 9 of NB classifier has been degraded but improved in HMM.

Furthermore, from Table 7, it is also shown that, with features subset 13, IARARs of activities 0, 1, 2, 4, and 6 are better than or equal to those of the other subsets; the proportion is 50% for all activities. With features subset 9, the proportion is 40% (activities 1, 5, 6, and 9). However, with features subset 9, IARARs of activities 1, 3, 5, 6, 7, and 9 are better than or equal to those of features subset 13. Therefore, features subset 9 yields the optimal TARAR of 90.3%, which is the same as that of features subsets 12 and 13. Still, features f₃ is not discriminatory to activity recognition with NB classifier.

It also can be found from Table 8 that, with features subset 13, IARARs of activities 2 and 8 are better than or equal to those of the other features subsets; the proportion is 20% for all activities, while features subset 13 gives the optimal TARAR of 88.0%, which is slightly higher than that of features subset 12. Moreover, with features subset 12, IARARs of activities 0, 1, and 3 to 7 and 9 are better than or equal to those of the features subset 13; the proportion is 80%. Again, it also can be concluded that feature f₃ is not discriminatory to activity recognition with HMM.

Under the D/R criterion, Tables 9 and 10 show the activity recognition accuracy rates with each features subset of NB classifier and HMM. From the experimental results, it can be found that the performances of NB classifier and HMM with the features subsets obtained under D/R criterion are similar to those of D-R criterion, respectively.

For example, for NB classifier, the optimal TARARs are 90.3% with features subsets 9, 12, and 13 under D-R criterion, which are the same to those of features subsets 11, 12, and 13 under D/R criterion. And for HMM, the optimal TARAR is 88.0% with features set 13 under D-R criterion, which is the same as that under D/R criterion.

From Table 9, for NB classifier, it is indicated that, with features subsets 12 and 13, IARARs of activities 1, 4, 6, and 9 are better than or equal to those of the other features subsets; the proportion is 40%, for all activities. However, with features subset 11, IARARs of activities 1, 3, 5, 6, and 9 are better than or equal to those of features subsets 12 and 13; the proportion is 50%. Features subsets 11, 12, and 13 yield the same optimal TARARs of 90.3%. Again, features f₃ and f₁₃ are not discriminatory to activity recognition with NB classifier.

It also can be found from Table 10, for HMM, that with features subset 13, IARARs of activities 1, 2, 4, and 8 are better than or equal to those of the other features subsets. Features subset 13 yields the optimal TARAR of 88.0%, which is slightly higher than that of features subset 12. Actually, with features subset 12, IARARs of activities 0, 1, 3, 4, 5, 6, 7, and 9 are better than or equal to those of the features subset 13; the proportion is 80%. Again, features f₃ and f₁₃ are not discriminatory to activity recognition with HMM.

Table 11 shows the comparison results of the optimal IARARs and TARARs of NB classifier and HMM under the three evaluation criteria (MR, D-R, and D/R) and the required minimal size of features subset, respectively.

It indicates that although the features subsets are different under the three evaluation criteria, however, the optimal results of IARARs and TARARs are similar to the same classifier. For example, for NB classifier, the optimal IARARs of activities 0, 2, 4, 6, 7, and 8 under MR criterion are the same as those under D-R criterion; furthermore, the optimal IARARs of activities 2, 4, 6, 7, and 8 are the same even under the three evaluation criteria, and even the optimal TARARs are the same.

Similarly, for HMM, the optimal IARARs of activities 0, 2, and 6 through 8 are the same even under the three evaluation criteria. Besides, the optimal TARARs are also the same with the three evaluation criteria. Additionally, it can be found that the results of the required minimal size of features subset according to the optimal IARAR of each activity and the optimal TARAR are also similar to the same classifier. For example, for NB classifier, the required minimal sizes of features subset of activity 1 to 5 and 7 to 8, under D-R criterion, are the same as those under D/R criterion. Moreover, for activities 2, 4, 7, and 8, the required minimal sizes of features subset are the same under the three evaluation criteria. For HMM, the required minimal sizes of features subset of activities 0, 2, 6, and 8 are the same under the three evaluation criteria; the proportion is 40%. Besides, the required minimal sizes of features subset of TARARs are also the same under the three evaluation criteria.

Obviously, maximal relevance measure and mRMR feature selection algorithm (under D-R and D/R criteria) are effective to feature selection for human activity recognition.

5. Conclusions

This paper has applied maximal relevance measure and minimal redundancy maximal relevance (mRMR) algorithm (under D-R and D/R criteria) to select features and to compose different features subsets based on the observed sensor events for human activity recognition in smart home environments. And then, the selected features subsets have been validated by NB classifier and HMM to recognize human activities. Through the experimental results, it is shown that not all the features are beneficial to activity recognition, as estimated. Different combinations of features lead to different activity recognition results. Furthermore, even the same features subset has different effect on the activity recognition accuracy rate for different activity classifiers. It can be found that feature f₁₀ (the ending time of the current activity) is the most important feature to activity recognition and feature f₃ (day of week) is not discriminatory to activity recognition. Therefore, the suitable features subset must be selected in advance, and the selection of unsuitable feature sets increases the computational complexity and degrades the human activity recognition accuracy rate. Moreover, it is significant for activity recognition to consider both relevance between features and activities and redundancy between features. Generally, maximal relevance measure and mRMR algorithm are beneficial to feature selection and positive to activity recognition of NB classifier and HMM.

Data Availability

The data were collected from the smart home testbed located on Washington State University campus, which can be downloaded from Dr. Cook’s homepage.

Conflicts of Interest

The authors declare no conflicts of interest.

Acknowledgments

Thanks are due to Dr. Diane J. Cook whose positive and constructive comments helped to enhance the quality and presentation of this paper. Also, the authors gratefully acknowledge the data preparation work that was carried out by Dr. Cook’s graduate students.

References

L. Liu, E. Stroulia, I. Nikolaidis, A. Miguel-Cruz, and A. Rios Rincon, “Smart homes and home health monitoring technologies for older adults: a systematic review,” International Journal of Medical Informatics, vol. 91, pp. 44–59, 2016.
View at: Publisher Site | Google Scholar
M. R. Alam, M. B. I. Reaz, and M. A. M. Ali, “A review of smart homes-past, present, and future,” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, no. 6, pp. 1190–1203, 2012.
View at: Publisher Site | Google Scholar
D. J. Cook, “How smart is your home?” Science, vol. 335, no. 6076, pp. 1579–1581, 2012.
View at: Publisher Site | Google Scholar
G. Sprint, D. J. Cook, R. Fritz, and M. Schmitter-Edgecombe, “Using smart homes to detect and analyze health events,” Computer, vol. 49, no. 11, pp. 29–37, 2016.
View at: Publisher Site | Google Scholar
O. D. Lara and M. A. Labrador, “A survey on human activity recognition using wearable sensors,” IEEE Communications Surveys & Tutorials, vol. 15, no. 3, pp. 1192–1209, 2013.
View at: Publisher Site | Google Scholar
T. Luor, H. Lu, H. Yu, and Y. Lu, “Exploring the critical quality attributes and models of smart homes,” Maturitas, vol. 82, no. 4, pp. 377–386, 2015.
View at: Publisher Site | Google Scholar
J. Dahmen, B. Thomas, D. Cook, and X. Wang, “Activity learning as a foundation for security monitoring in smart homes,” Sensors, vol. 17, no. 4, p. 737, 2017.
View at: Publisher Site | Google Scholar
H. M. S. Hossain, M. A. A. H. Khan, and N. Roy, “Active learning enabled activity recognition,” Pervasive and Mobile Computing, vol. 38, no. 2, pp. 312–330, 2017.
View at: Publisher Site | Google Scholar
G. Singla, D. Cook, and M. Schmitter-Edgecombe, “Tracking activities in complex settings using smart environment technologies,” International Journal of BioSciences, Psychiatry and Technology, vol. 1, no. 1, pp. 25–35, 2009.
View at: Google Scholar
A. S. Crandall and D. J. Cook, “Coping with multiple residents in a smart environment,” Journal of Ambient Intelligence and Smart Environments, vol. 1, no. 4, pp. 323–334, 2009.
View at: Publisher Site | Google Scholar
L. Liu, S. Wang, G. Su, Z.-G. Huang, and M. Liu, “Towards complex activity recognition using a Bayesian network-based probabilistic generative framework,” Pattern Recognition, vol. 68, pp. 295–309, 2017.
View at: Publisher Site | Google Scholar
K. S. Gayathri, K. S. Easwarakumar, and S. Elias, “Probabilistic ontology based activity recognition in smart homes using markov logic network,” Knowledge-Based Systems, vol. 121, pp. 173–184, 2017.
View at: Publisher Site | Google Scholar
E. Kim, S. Helal, and D. Cook, “Human activity recognition and pattern discovery,” IEEE Pervasive Computing, vol. 9, no. 1, pp. 48–53, 2010.
View at: Publisher Site | Google Scholar
Y. Zhu, N. M. Nayak, and A. K. Roy-Chowdhury, “Context-aware activity modeling using hierarchical conditional random fields,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 7, pp. 1360–1372, 2015.
View at: Publisher Site | Google Scholar
L. Chen, C. D. Nugent, and H. Wang, “A knowledge-driven approach to activity recognition in smart homes,” IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 6, pp. 961–974, 2012.
View at: Publisher Site | Google Scholar
M. Fahim, I. Fatima, S. Lee, and Y.-K. Lee, “EEM: evolutionary ensembles model for activity recognition in smart homes,” Applied Intelligence, vol. 38, no. 1, pp. 88–98, 2013.
View at: Publisher Site | Google Scholar
A. Fleury, M. Vacher, and N. Noury, “SVM-based multimodal classification of activities of daily living in health smart homes: sensors, algorithms, and first experimental results,” IEEE Transactions on Information Technology in Biomedicine, vol. 14, no. 2, pp. 274–283, 2010.
View at: Publisher Site | Google Scholar
J. Wen and Z. Wang, “Learning general model for activity recognition with limited labelled data,” Expert Systems with Applications, vol. 74, pp. 19–28, 2017.
View at: Publisher Site | Google Scholar
J.-H. Hong, J. Ramos, and A. K. Dey, “Toward personalized activity recognition systems with a semipopulation approach,” IEEE Transactions on Human-Machine Systems, vol. 46, no. 1, pp. 101–112, 2016.
View at: Publisher Site | Google Scholar
C. Debes, A. Merentitis, S. Sukhanov, M. Niessen, N. Frangiadakis, and A. Bauer, “Monitoring activities of daily living in smart homes: understanding human behavior,” IEEE Signal Processing Magazine, vol. 33, no. 2, pp. 81–94, 2016.
View at: Publisher Site | Google Scholar
N. S. Mohamed, S. Zainudin, and Z. Ali Othman, “Metaheuristic approach for an enhanced mRMR filter method for classification using drug response microarray data,” Expert Systems with Applications, vol. 90, pp. 224–231, 2017.
View at: Publisher Site | Google Scholar
J. Che, Y. Yang, L. Li, X. Bai, S. Zhang, and C. Deng, “Maximum relevance minimum common redundancy feature selection for nonlinear data,” Information Sciences, vol. 409-410, pp. 68–86, 2017.
View at: Publisher Site | Google Scholar
C. Xu, S. Zhao, and F. Liu, “Distributed plant-wide process monitoring based on PCA with minimal redundancy maximal relevance,” Chemometrics and Intelligent Laboratory Systems, vol. 169, pp. 53–63, 2017.
View at: Publisher Site | Google Scholar
F. Li, D. Miao, and W. Pedrycz, “Granular multi-label feature selection based on mutual information,” Pattern Recognition, vol. 67, pp. 410–423, 2017.
View at: Publisher Site | Google Scholar
D. Escalona-Vargas, E. R. Siegel, P. Murphy, C. L. Lowery, and H. Eswaran, “Selection of reference channels based on mutual information for frequency-dependent subtraction method applied to fetal biomagnetic signals,” IEEE Transactions on Biomedical Engineering, vol. 64, no. 5, pp. 1115–1122, 2017.
View at: Publisher Site | Google Scholar
S. Mallik, T. Bhadra, and U. Maulik, “Identifying epigenetic biomarkers using maximal relevance and minimal redundancy based feature selection for multi-omics data,” IEEE Transactions on NanoBioscience, vol. 16, no. 1, pp. 3–10, 2017.
View at: Publisher Site | Google Scholar
A. K. Tiwari, “Prediction of G-protein coupled receptors and their subfamilies by incorporating various sequence features into Chou’s general PseAAC,” Computer Methods and Programs in Biomedicine, vol. 134, pp. 197–213, 2016.
View at: Publisher Site | Google Scholar
C. Chen and X. Yan, “Optimization of a multilayer neural network by using minimal redundancy maximal relevance-partial mutual information clustering with least square regression,” IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 6, pp. 1177–87, 2015.
View at: Publisher Site | Google Scholar
Z. Wang, M. Li, and J. Li, “A multi-objective evolutionary algorithm for feature selection based on mutual information with a new redundancy measure,” Information Sciences, vol. 307, pp. 73–88, 2015.
View at: Publisher Site | Google Scholar
P. M. Morgado and M. Silveira, “Minimal neighborhood redundancy maximal relevance: Application to the diagnosis of Alzheimer׳s disease,” Neurocomputing, vol. 155, pp. 295–308, 2015.
View at: Publisher Site | Google Scholar
M. Kamandar and H. Ghassemian, “Linear feature extraction for hyperspectral images based on information theoretic learning,” IEEE Geoscience and Remote Sensing Letters, vol. 10, no. 4, pp. 702–706, 2013.
View at: Publisher Site | Google Scholar
K. K. Kandaswamy, G. Pugalenthi, K.-U. Kalies, E. Hartmann, and T. Martinetz, “EcmPred: prediction of extracellular matrix proteins based on random forest with maximum relevance minimum redundancy feature selection,” Journal of Theoretical Biology, vol. 317, pp. 377–383, 2013.
View at: Publisher Site | Google Scholar
X. Jin, E. W. M. Ma, L. L. Cheng, and M. Pecht, “Health monitoring of cooling fans based on Mahalanobis distance with mRMR feature selection,” IEEE Transactions on Instrumentation and Measurement, vol. 61, no. 8, pp. 2222–2229, 2012.
View at: Publisher Site | Google Scholar
A. Unler, A. Murat, and R. B. Chinnam, “MR2PSO: a maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification,” Information Sciences, vol. 181, no. 20, pp. 4625–4641, 2011.
View at: Publisher Site | Google Scholar
E. Zdravevski, P. Lameski, V. Trajkovik et al., “Improving activity recognition accuracy in ambient-assisted living systems by automated feature engineering,” IEEE Access, vol. 5, pp. 5262–5280, 2017.
View at: Publisher Site | Google Scholar
E. Zdravevski, B. Risteska Stojkoska, M. Standl, and H. Schulz, “Automatic machine-learning based identification of jogging periods from accelerometer measurements of adolescents under field conditions,” PLoS ONE, vol. 12, no. 9, p. e0184216, 2017.
View at: Publisher Site | Google Scholar
H. Peng, F. Long, and C. Ding, “Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 8, pp. 1226–1238, 2005.
View at: Publisher Site | Google Scholar
H. Fang, L. He, H. Si, P. Liu, and X. Xie, “Human activity recognition based on feature selection in smart home using back-propagation algorithm,” ISA Transactions, vol. 53, no. 5, pp. 1629–1638, 2014.
View at: Publisher Site | Google Scholar
P. Tang, H. Fang, and H. Si, “Maximal relevance feature selection for human activity recognition in smart home,” in Proceedings of the 2018 Chinese Control and Decision Conference. (CCDC), pp. 4264–4268, Shenyang, China, June 2018.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2020 Hongqing Fang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2365

Downloads

893

Citations