Abstract

The recommendation engine is similar to the function of the product recommender in our real life, which provides great convenience for people to choose the appropriate decoration scheme in the process of interior design and decoration. A home improvement website or company can design a suitable recommendation algorithm to provide home improvement program recommendation services for users with decoration needs. After understanding the user behavior of the home decoration website, this paper proposes an interior design scheme recommendation method based on an improved collaborative filtering algorithm. The method designs a collaborative filtering algorithm that combines multilayer hybrid similarity and trust mechanisms. Fuzzy set membership function is introduced to correct users’ rating similarity, and users’ interest vector is extracted to calculate users’ preference for different types of items. The algorithm dynamically fuses those two aspects to obtain the mixed similarity of users; meanwhile, the user’s hybrid similarity and trust are fused in an adaptive model. Then, the user neighbor data set generated based on the overall similarity of users is used as a training set, taking the item scores and features into consideration. On the one hand, the users and the projects are taken into account as well. The final prediction score is more accurate, and the recommendation effect is better. The experimental results show that this method can recommend interior design schemes with high performance, and its performance is better than other methods.

1. Introduction

With the sustained and rapid growth of China’s national economy and the increasing per capita income, coupled with the strong promotion of China’s real estate industry, the demand of our interior design and decoration industry has remained strong. The rapid development of the Internet has also promoted people’s participation and demand on the Internet. The number of Internet users has increased to a considerable degree. Online shopping has penetrated into every aspect of our daily life. Amazon, Taobao, Jingdong and many other websites are relatively successful and mature e-commerce platforms, while home decoration e-commerce has not yet become mature. As the online shopping experience of home decoration is not high, users’ purchasing desire is not strong. While the consumption of home decoration is increasingly hot in recent years, it is relatively backward in e-commerce [1].

The domestic decoration market economy scale has exceeded 4 trillion yuan. The number of home decoration consumers is huge, but the proportion of online shopping in the total consumption of the home decoration market is relatively small. Home decoration websites record behavior logs of hundreds of millions of users [2]. However, the main task of this paper is how to automatically locate and push the home decoration scheme that users are interested in from the massive information.

A home outfit plan covers the design idea of the stylist. When it is accepted and adopted by decoration users, the plan will involve the interests of users themselves, furniture companies, decoration companies, designers, home decoration websites, and other aspects [3]. The server can record user behavior logs when users browse, click, or download home decoration solutions. If their favorite home decoration scheme from the growing user behavior log can be dug out and recommended to users, the decoration needs of users will be met and a good reputation be established, with more value being brought to home decoration websites and related enterprise chains.

The online behavior of the home outfit industry is moving towards scale gradually. Literatures [4, 5] have begun to design home decoration products based on user perception. Literature [6] proposes a new method to extract and retrieve furniture items from image databases and online websites containing multiple furniture items. This method supports finding similar items from the database. Consumers usually pay the most attention to the pictures, prices, and brand reputation evaluation of home improvement programs when shopping online for home improvement [7]. Poor user experience and the fact of not being able to find a satisfactory home improvement plan within a certain period of time are major weaknesses of domestic home improvement online consumption, which bring huge economic loss to home outfit network shopping. The recommendation system is a good solution. However, the problems of massive log processing efficiency and recommendation accuracy brought by the rapidly increasing number of users are important factors restricting the application of the recommendation system.

The concept of the recommendation system, which was gradually applied to the commercial field after a period of time, was first formally proposed by Varian and Resnick. Nowadays, recommendation has developed into a very popular research field [8]. Collaborative filtering algorithm is widely used in the research of recommendation system algorithm [9]. However, it is found that the traditional collaborative filtering algorithm still has many problems to be improved with the deepening of research [10]. Researchers have studied the improved algorithm. The key to the improved algorithm in Ref. [11] was the calculation of semantic similarity and the acquisition of similarity word frequency weight. Reference [12] proposed an improved metric. The method considers the ratio between the absolute score value and the number of common scoring items. Literature [13] proposed that the recommendation results could be more reliable by mining the potential relationship between rating users. By comprehensively analyzing the user’s historical behavior and item label information, literature [14] realized the recommendation of label probability without relying on the user’s item score. Literature [15] used genetic algorithm to find the best item list for users. Most of the objects studied by these recommendation algorithms are for music, movies and digital products, etc., but there are few recommendation system algorithms specifically for the consumption behavior of home decoration websites. And it is even rarer to apply the recommendation algorithm to practical applications. Literature [16] mentioned interior design recommendation technology based on collaborative filtering algorithm. The literature only gives a framework, but does not give the actual recommendation algorithm and application.

Aiming at the above problems, this paper applies the improved collaborative filtering algorithm to the design of interior design recommendation technology. This method provides accurate interior design recommendation service for users by integrating user behavior and project behavior. Interior design recommendation technology has low operational requirements for users and plays an important role in enhancing user experience. The results show that the proposed method solves the limitations of the current interior design project recommendation method, and it is a project recommendation method with high precision and high speed.

The collaborative filtering (CF) algorithm uses past behavior or the opinions of existing user groups to predict data and also uses neighborhood data similar to the current user or project to generate recommendations. The basic steps include data collection and processing, generation of a user-item score matrix, calculation of similarity, generation of the nearest neighbor, prediction of score, and generation of a top recommendation.

2.1. User-Based Collaborative Filtering Algorithm (UserCF)

Firstly, the score data set and the current user ID are entered to find other users with similar preferences to the current user. These users are called nearest neighbors. Then, the ratings of all items are ranked from highest to lowest using the ratings of the neighbor user-predicted items. Finally, the top items are recommended to the current user. The algorithm flow in this paper is shown in Figure 1.

Similarity measurement is the core of the collaborative filtering algorithm, including cosine similarity, modified cosine similarity, and Pearson similarity. Pearson correlation coefficient is used to calculate the linear correlation between two variables, and the formula is as follows:

where represents the similarity between users and . represents the set of goods jointly rated by users and , while and represent the scores on goods . and represent the average rating of the product by users and , respectively.

The scoring prediction formula is defined as follows:

where represents the average score of all rated products by target user . indicates the neighbor set of the target user. represents the neighbor user’s rating of item . represents the average value of ratings of the items rated by the neighbor users. indicates the similarity between the target user and the neighbor user.

Although the UserCF algorithm has been widely used in the field of recommendation, it also faces many challenges. For e-commerce websites, the number of projects is relatively stable, but the number of users is updated at a high frequency. When the number of users is much larger than the number of items, computing similarities between users becomes increasingly time-consuming and takes up more memory. The recommendation results generated by UserCF algorithms are poorly interpretable.

2.2. Item-Based Collaborative Filtering Algorithm (ItemCF)

Firstly, an item similarity matrix is constructed to describe the similarity between two items. nearest neighbor projects similar to the current project are found. The user score for each item not seen by the current user is then calculated based on nearest neighbors. Finally, the user’s ratings for all items are sorted from large to small, and the top items with the highest scores are recommended to the current user.

The basic steps of the item-based collaborative filtering algorithm (ItemCF) are processing data, generating user-item score matrix, computing item similarity, generating nearest neighbor, predicting score, and generating top recommendation.

Then, cosine similarity is used to calculate the similarity between the two items. In addition, based on the nearest neighbor projects acquired, the current user’s interest in the target project is predicted.

3. Interior Design Scheme Recommendation Based on Improved Collaborative Filtering Algorithm

The traditional collaborative filtering recommendation algorithm only generates recommendations based on the user’s historical score data. CF only considers scoring data, which not only is incomplete but also leads to cold start problems. This makes new projects or new users unable to recommend according to the collaborative filtering recommendation algorithm. At the same time, it will also bring serious data sparsity problems and affect the accuracy of recommendation results. Therefore, this paper presents an interior design scheme recommendation method based on an improved collaborative filtering algorithm. This method considers multidimensional mixed similarity, trust mechanism, and item scoring, which make the final prediction score more accurate and the recommendation effect better.

3.1. User Feature Extraction

For users or interior design projects, they all have their own characteristics. In the design process of the indoor project design project recommendation method based on the improved collaborative filtering algorithm, the user characteristics have a certain stability. For two users with similar characteristics, the probability of selecting similar interior design projects is higher. Firstly, the characteristics of interior design users are described, and then, they are quantified.

For the gender characteristics of interior design users, “1” represents male and “-1” represents female, while the gender and age characteristics of interior design users are quantified by “0-4,” where “0” represents under 30 years old, “1” represents 30-39, “2” represents 39-44, “3” represents 45-60, and “4” represents over 60 years old. The occupation and education level are also quantified by numbers, which will not be described in detail here.

3.2. Design of Multidimensional Similarity and Trust Measurement
3.2.1. Revise Score Similarity

In the recommendation system, the score of an item reflects the user’s preference for the item to a certain extent. At this point, the measurement of preference degree is determined by the user rating value. But the ratings are not an accurate representation of how much users like them. For example, a movie is scored on a 5-point scale. User gives a score of 4. At this time, the user’s preference for the movie cannot be accurately quantified in numerical form. It can only roughly predict the probability that user likes the movie with high probability. To solve the problem of measuring the degree of preference in this situation, this paper adopts the fuzzy logic method to enrich the single numerical score, so as to obtain the numerical quantification of user preference. The membership function definition of scoring triangle fuzzy set is shown in Figure 2.

The calculation of preference difference between any two users depends on the rating difference between them for the common rated items. The greater the difference in ratings, the greater the degree of preference difference between them. The preference difference can be calculated by formula (3). Based on this inference, the preference similarity of any two users calculated by using public items is represented by .

where and represent the fuzzy preferences of any two users and . represents the th element of vector . represents the dimension of the vector. represents the Euclidean distance between vectors. is used to represent the mean difference degree of the scoring preference.

In the calculation of the user rating similarity , user preference modified by introducing fuzzy mapping is combined with the Jaccard similarity of modified rating. This balances the dimensions of the user to the item and the proportion of items in the common rating among users. Finally, the user’s score similarity is calculated based on the user’s modified cosine similarity.

where . and represent the set of rated items for users and .

3.2.2. User Interest Similarity

The traditional collaborative filtering algorithm only considers the user’s project score value and ignores the similarity of the user’s interest to different project types. However, users’ interest similarity affects the recommendation accuracy of projects to a great extent. In the case of sparse data, there are few common scoring items among users, or even zero. At this point, user interest similarity evaluated based on project type can compensate for the impact of data sparsity to a large extent.

The calculation of the user’s interest degree is affected by the scoring value of different item types and the proportion of quantity. For example, it is known that the user is interested in comedy and romance movies by analyzing and calculating a user’s movie viewing history. Then, when a movie of a related type appears, it can be inferred that the user has a greater possibility to like the movie. The user interest vector is defined as the user’s preference degree for different types of items based on the above deduction. The composition of the interest vector includes the proportion of the scoring value and the proportion of the scoring quantity of each item type.

where represents the average score of a certain type of item. represents the score of a certain type of item. represents the number of scoring for this type of item. represents the collection of all types of items. represents a certain type of items. represents the number of scoring statistics for -type items. represents the number of all scores. represents the weight of interest.

At the same time, using the user’s interest vector combined with the cosine similarity calculation formula, the user interest similarity can be calculated for any two users and .

where represents the set of all item types.

3.2.3. Users’ Mixed Similarity

In summary, the similarity obtained by the two is merged to obtain the user mixed similarity. The calculation of the two similarities is affected by the sparsity of the data in different degrees: when the number of public ratings is small, the measurement of the rating scale is susceptible to the impact of the peak of the rating, resulting in large errors in the calculation results. The interest similarity obtained based on the item type is less affected by the peak score, so the weight of the latter should be increased at this time. When the number of public scoring items is large, the accuracy of the recommendation results calculated based on user scoring is higher, so the proportion of scoring similarity should be increased at this time. Therefore, the threshold of the number of public scores is set. According to the size, the balanced distribution weight of the two in the case of different data sparsity is adjusted to ensure the accuracy and flexibility of the algorithm.

where represents the user mixed similarity. represents the number of user scores. is the fusion factor. represents corrected score similarity. represents user interest similarity.

3.2.4. The Degree of Trust between Users

The analysis indicates that the trust relationship between users plays a vital role in daily life, involving all aspects. However, the description of trust is a relatively vague concept, whose cognition includes subjectivity, asymmetry, and dynamics. Traditional recommendation algorithms only consider from the point of view of scoring. But in real life, the recommendation accuracy obtained only by this method is difficult to guarantee. For example, some merchants falsify rating data to get more recommendations for their products. To reduce the impact of recommendation results on the accuracy of the scoring data, this paper introduces the trust mechanism between users. The higher the reliability between users, the higher the reliability of the recommendation data generated by them.

The trust degree between two users in the recommendation system is influenced by the interactive relationship between users. The more shared historical behavior users experience, the more trust they accumulate between each other. It is considered that there has been an interaction whenever the scoring behavior of the same item occurs between users. The interaction relationship between users can be obtained according to the proportion between the interaction times and their own historical scoring times.

where and represent the set of scoring items of users and , respectively. and represent the potential of the set.

The trust degree between users is affected by the interaction relationship and the interaction state. A successful interaction state is bound to strengthen trust between users. Similarly, the failed interaction state will also weaken the trust between users. This section takes the user satisfaction status as the standard to measure the success of interaction. If two users show satisfaction or dissatisfaction with a project at the same time, their interaction state will be considered to be successful. If two users have different satisfactions with the jointly evaluated items, their interaction state will be considered to be a failure. Satisfaction is calculated by comparing the scores of two users’ common items with their respective average scores. If the score is higher than their own average score, it means satisfaction; otherwise, it means dissatisfaction.

The interaction states between users are as follows.

where represents the statistical times of successful interaction. is recorded for each successful interaction. indicates the statistical number of failed interactions. We record for each interaction failure. The interaction relationship between users is calculated in combination with formula (11) to obtain the user.

In the above formula, the success or failure of the interaction is considered to have the same degree of influence on trust, yet the actual situation is different. In the case of interaction failure, the difference between the two users’ scores is much larger than that in the case of interaction success, which indicates that there is a great difference between the two users’ satisfaction with the common score item. Therefore, it can be argued that the negative effects of failed interactions are greater. In the revision of trust degree, the score difference is added as the penalty weight affecting the interaction balance to strengthen the negative impact of failed interaction. The improved trust between users is as follows.

where represents the trust degree of user to user . represents the set of successful interactions. represents the set of interaction failures. and represent the scores of user and user on item , respectively. Formula (9) satisfies the asymmetric principle of trust, that is, .

If the hybrid similarity between target user and target user is higher, and feels that is trustworthy, then is more likely to adopt ’s suggestion. Therefore, when establishing the neighbor user set , the hybrid similarity and trust degree are arranged in reverse order according to the value, and the top users with the highest comprehensive value are selected to establish the neighbor user set. The prediction score is generated.

In the formula, the value of is determined by the adaptive model, and the value range of is [0,1].

3.3. Improved Algorithm Based on the Item and Its Characteristics

Section 3.2 generates neighbor data according to multidimensional user scores and user characteristic data. However, the similarity of project features is not taken into account; besides, the recommendation accuracy is not very good. In this section, multidimensional user similarity and trust are used to generate user neighbor data sets as training sets, and item scores and features are taken into account to improve the traditional slope one algorithm. Users and items are taken into account, which make the final prediction score more accurate and the recommendation effect better. The calculation method of item feature similarity is similar to that of user feature similarity.

In the formula above, represents the number of attribute features of the item. represents the th attribute of item . represents the th attribute value of item . The similarity of item features is calculated as the method of calculating the similarity of similar user features.

Item similarity is also calculated by the Pearson formula.

In the formula, refers to the average score of all users’ nearest neighbors of user on item . is the number of recent neighbors for all items of project . refers to the score of user on item . refers to the score similarity of item and item .

Based on the recommendation set of the user’s nearest neighbor set, the item’s nearest neighbor set is calculated, and the recommendation is finally generated.

where represents the number of elements in. represents the average deviation between items and . represents user ’s rating of item .

3.4. Recommended Steps for Interior Design Schemes to Improve Collaborative Filtering Algorithm

Step 1: collect the relevant historical data of indoor project design and process it to remove some wrong data.

Step 2: extract the user features and project features of interior project design, which form the corresponding feature set. Then, mark all users and scoring data in the matrix.

Step 3: by introducing the fuzzy logic method, the user’s fuzzy preference is obtained by enriching the score difference between users. Then, the score similarity is modified based on fuzzy preference differences. The user modified score similarity is obtained by combining the Jaccard similarity coefficient.

Step 4: calculate the proportion between the mean scores of various types of items and the sum of the mean scores of all items, as well as the ratio between the scoring times of various items and the total scoring times of items. The sum of the two is calculated to obtain the user’s interest vector. Combined with cosine similarity, of user interest similarity is calculated.

Step 5: calculate the similarity between Step 3 and Step 4 based on different factors. The weight fusion is carried out by combining formula (8) to obtain the mixed similarity . Considering that only using the predicted score generated by for project recommendation may lead to the problem of single recommendation factor and poor recommendation accuracy, the trust relationship between users is introduced on this basis. The first users with the highest comprehensive value are obtained to establish the neighbor user set.

Step 6: use multidimensional user similarity and trust to generate user neighbor data set as the training set. Formula (17) is used to score items.

Step 7: arrange the predicted scores in descending order and recommend the top interior items with the highest predicted scores to the target users for the projects not scored by the target users.

4. Experimental Results

4.1. Experimental Data and Experimental Environment

Some interior design item data sets are used as test objects to test the performance of collaborative filtering technology in the interior design project recommendation method proposed in this paper.

In the researches on the collaborative filtering algorithm, most of them use the MovieLens website data set maintained by the GroupLens laboratory. This data set contains more comprehensive user rating information of movies covering all time points. And it is still being maintained at present. In this paper, the historical behavior data of users are processed into the same format with reference to the MovieLens data set. The data set of user rating on the design scheme, which is recorded by the icolor server of the company’s project background home decoration website for one month, is selected as the experimental data. The data set contains 12,829,401 scoring information of 16,427 home decoration design schemes by 13,512 website users, ranging from 1 to 5 points.

The experimental environment settings are as follows: (1) hardware environment: Intel 7 core CPU @ 2.69 GHz, 12 GB memory; (2) software environment: Win10 operating system with Java programming language.

4.2. Experimental Indicators

In most studies, the commonly used indicators are as follows, which are applied to measure the effect of the collaborative filtering algorithm conveniently.

The mean absolute error (MAE) is the average of the absolute values of errors. It uses the deviation between the item score predicted by the recommendation algorithm and the user’s actual score value to measure the accuracy of the recommendation. The range of this value is [0,1]. The smaller the MAE value, the higher the accuracy of the recommendation algorithm and the better the recommendation quality.

4.3. Analysis of Experimental Results
4.3.1. Determination of Interest Vector Weight

Interest vector weight α is mainly used to adjust the ratio between different types of items and the total score value and adjust the ratio between the number of different types of items and the total score items. By adjusting the proportion of the two, different interest vectors can be obtained. Then, interest similarity between users is calculated according to interest vector . During the experiment, increases with a step size of 0.1, and the value range is [0,1]. The neighbor set takes 10 steps and ranges from 10 to 60. Multiple groups of predicted scores are generated according to the calculation, and the MAE of the corresponding real score and predicted score is obtained. The statistical results are shown in Table 1.

It can be seen from Table 1 that MAE decreases with the increase of the number of neighbor sets, indicating that the recommendation accuracy is significantly affected by data sparsity. As can be seen from Table 2, when and , the average absolute error between the predicted score and the real score is minimum and reaches the optimal value. Therefore, the unknown parameter is 0.7.

4.3.2. Determination of Fusion Factor

The function of dynamic fusion factor is to dynamically adjust the proportion of two similarity algorithms in mixed similarity according to the difference of data sparsity. In the experiment, the number of neighbor sets and fusion factor is uniformly changed by step size, and the change curves of multiple sets of prediction scores and MAE are calculated. The experimental results are shown in Table 2.

It can be seen from Table 2 that as the number of neighbor sets increases, the MAE difference decreases gradually. When the set number is 60 and the dynamic factor is 0.9, MAE obtains the optimal value, so in this paper.

4.3.3. Comparison of Algorithm Performance before and after Introducing Trust Mechanism

The smaller the MAE value obtained by the two algorithms before and after the introduction of trust mechanism, the higher the accuracy of the algorithm. According to the optimal values and obtained in experiment 1, the neighbor user sets before and after the introduction of trust mechanism are obtained. The neighbor set number is incremented with the step size of 10, and two MAE groups corresponding to the two algorithms are calculated. The experimental results are shown in Figure 3.

It can be seen from the analysis in Figure 3 that with the increase of the number of neighbor users, the MAE change trend obtained by the algorithm after the introduction of trust mechanism shows a downward trend from sharp to slow compared with the algorithm before the introduction. It can be seen that this algorithm has higher recommendation accuracy when the data sparsity is high. With the increase of the number of neighbor users, the accuracy of the algorithm with trust mechanism is improved to a certain extent compared with the algorithm without the trust mechanism. This verifies that the trust mechanism is introduced to optimize the performance of the algorithm.

4.3.4. Comparison of Accuracy between the Proposed Algorithm and Other Algorithms

The accuracy comparison of several algorithms also adopts the method of comparative experiment. The comparative algorithms include the collaborative filtering recommendation algorithm with cosine similarity correction score in literature [17], the collaborative filtering algorithm based on improved time function and user similarity proposed in literature [18], and the collaborative filtering recommendation algorithm based on heuristic similarity measurement in literature [19]. MAE corresponding to each algorithm was calculated. The experimental results are shown in Figure 4.

The MAE of the proposed algorithm is smaller than that of the other three algorithms. The experimental results are analyzed as follows.

By introducing user interest, trust mechanism and item score, the algorithm in this paper has obvious advantages in the case of data sparsity compared with other algorithms. When , MAE of this algorithm is 0.7551. Compared with literature [1719], MAE of the proposed algorithm is reduced by 7.23%, 4.30%, and 2.45%, respectively. When the number of neighbor users increase, the recommendation accuracy of each algorithm is improved to varying degrees. When the number of neighbor sets , MAE of this algorithm gets the minimum value, reaching 0.7113. Compared with literature [1719], MAE of the proposed algorithm is reduced by 4.12%, 3.78%, and 2.37%, respectively. In conclusion, compared with other collaborative filtering algorithms, the recommendation accuracy of the proposed algorithm is significantly improved, and the performance improvement of recommendation accuracy is more obvious in the case of data sparsity.

5. Conclusion

Interior design, the quality of which is critical, is the current hot spot. Therefore, the recommendation method of the interior design item has become a highly concerned issue. Due to the shortcomings of current interior design recommendation methods such as large errors and long time consumption, an interior design project recommendation method based on an improved collaborative filtering algorithm was designed to obtain ideal interior design project recommendation results. The following conclusions can be drawn from the experimental data. (1) Due to the fusion of multilayer hybrid similarity, trust mechanism, and extraction of interior design item features, the algorithm in this paper can achieve an accurate recommendation of interior design projects, and its performance is better than that of other algorithms. This algorithm controls the error rate of interior design project recommendation within the scope of practical application, solving the problem of high error rate in current interior design project recommendation methods. (2) The interior design item recommendation method based on improved collaborative filtering technology can be applied to solving problems with similar characteristics in other fields, which has very wide application value. In the future research work, it is necessary to focus on the problem of starting the algorithm. Moreover, we will incorporate some intelligent algorithms such as deep learning, federated learning, and mobile edge computing techniques into the considered system [2027], in order to further enhance the system performance.

Data Availability

The labeled data set used to support the findings of this study is available from the corresponding author upon request.

Conflicts of Interest

The author declares no competing interests.

Acknowledgments

This study was supported by the Scientific Research Program of Shaanxi Provincial Department of Education: Research on the Innovative Application of Chinese Traditional Decorative Elements in Urban Public Space—Taking Xi’an as an Example (No. 19jk0896).