Abstract

The recommendation system is an active, personalized, and real-time technology platform proposed in the 1990s to solve the problem of information overload. The recommendation system can constantly adjust the recommendation results according to the real-time behaviors of users. In other words, if the user’s interest changes, the recommendation system can present different information to the user. Therefore, the recommendation system is the best way to solve the problem of information overload in entrepreneurial projects. Based on the ConvMF algorithm, this paper proposes an entrepreneurial project recommendation algorithm based on a deep neural network and matrix decomposition. A deep neural network was established for the extraction of the hidden features of entrepreneurial projects, and a convolution neural network was used to process the text description information of entrepreneurial projects. One-hot coding was used to process the regional characteristics and financing round characteristics of entrepreneurial projects, and word embedding was used to process the label features of entrepreneurial projects. The implicit features of users are extracted from the user’s rating matrix using matrix decomposition technology. Finally, recommendations are made according to the implicit characteristics of users and the items learned.

1. Introduction

Nowadays, China’s universities have become the front line of innovation. The government and schools encourage college students to carry out innovative and entrepreneurial practice activities, apply teaching theoretical knowledge to extracurricular practice, and guide students to cultivate the ability of independent thinking and problem-solving, which improve the quality of personnel training and increase the employment rate. In the process of participating in innovation and entrepreneurship projects, students have accumulated a lot of knowledge, experience, and project data [1]. However, at present, the resource pool of innovation and entrepreneurship projects in colleges and universities is huge, the utilization rate of resources is low, and it is difficult to search for resources. We study the commonly used recommendation algorithms and build a personalized recommendation model suitable for college students’ innovation and entrepreneurship resource library system, which can help users quickly find interesting resources [2].

The investigation of college students’ entrepreneurship mainly investigates their entrepreneurial purposes and activities, while the GUESSS project is a global research, which has positive homework for understanding college students’ entrepreneurship. The survey also covers a wide range, with about 200,000 students in 54 countries [3]. To study the factors of women’s entrepreneurship development, we first list the important determinants and create a matrix to show the relationship at each level in the process, and we then quantify and define the above factors. Through this research method, it is helpful to encourage women to have entrepreneurial intentions and careers [4]. There are roughly three kinds of factors that affect women’s entrepreneurship, namely, individual, environment, and, most importantly, politics. Nowadays, college graduates are more common, so it is particularly important to train and cultivate personal abilities and the motivation and ability to effectively participate in entrepreneurial activities during school. The entrepreneurial intention model for higher education came into being, which has a positive effect on improving students’ entrepreneurial intention. The model is implemented as a practical digital application, mainly in three aspects: skill awareness, entrepreneurial resources, and network [5]. In view of the research of college students’ entrepreneurial career and their adaptive behavior in occupations, this paper adopts the social cognitive model of career self-management [6]. We use 380 students who are about to graduate, based on the data, and use this model to verify and find the relationship between social cognitive career theory and adaptive exploration, which fills this unknown and has a certain influence on the education of career research and counseling in the future. It is helpful to study the time and type of individual entrepreneurship based on the opportunity to collect entrepreneurs’ experiences. Using the data of Danish university graduates and employees, individuals are tracked. Compared with those who never started a business in terms of time, experience, and type of starting a business, early entrepreneurs earn higher wages and suffer fewer losses than late entrepreneurs [7]. This shows that the time type of entrepreneurial experience determines the subsequent career trajectory and the uncertainty of employers’ value to it. Based on the data of college graduates and the theory of social cognitive career, this paper establishes a model of how different levels of contextual influence can help or inhibit the transformation of entrepreneurial intention into new entrepreneurial creation, which can evaluate the direct and larger background that affects the creation of new enterprises [8]. Entrepreneurship education for students is attracting attention from all walks of life, and it is also a factor affecting students’ entrepreneurial intention, and it is also increasing significantly in schools around the world. Through the in-depth study and review of relevant literature, as the basic background of knowledge, and then relying on narrative as a means of understanding and a form of communication, this research is positioned as a qualitative and explanatory phenomenological research [9]. With the advancement of education, rural students and young people are often not interested in engaging in agriculture as a career, MoMo, and prefer to work in cities. This paper studies this prominent phenomenon. Collecting 345 graduate students as samples, this paper analyzes the decisive factors of their ideas and intentions for agricultural entrepreneurship and shows that intention is the decisive factor, which is directly proportional to career planning and so on [10]. On the other hand, career planning, entrepreneurial ability, and agricultural family background (FB) can significantly predict agricultural entrepreneurial intention. This analysis has reference value for future teaching reform. Through investigation and study, college students’ interest in entrepreneurship is closely related to their gender and race. This paper analyzes and compares their different attitudes and motives and then judges their interest points. For example, immigrant boys are much more enthusiastic about starting a business than local boys, girls are generally lower than boys, and boys will be greatly influenced by external factors, such as family, friends, and economic interests; girls are more motivated to use “internal factors” such as their skills and abilities. Entrepreneurship is equally important for both men and women to be flexible and independent [11]. College students are an important and valuable human force in a country. What is often discussed in society is the employability of graduates and whether they are willing to start entrepreneurial activities. The professional willingness of social entrepreneurs should be spread and cultivated during their student years, and entrepreneurship can be used as the starting point of their career, which requires college students to have the opportunity to get in touch with the concepts and opportunities of social entrepreneurship [12].

2. Recommendation Process

This paper designs a recommendation algorithm for entrepreneurial projects based on a deep neural network and matrix decomposition. This algorithm integrates the content information of entrepreneurial projects into the recommendation algorithm. Figure 1 shows the overall flow chart of the algorithm shown in this chapter. As can be seen from the figure, the recommendation flow mainly includes the following parts:

(1) Data Processing. It mainly includes the following: (i) scoring, calculating the score data of the user’s entrepreneurial project according to the historical behavior data of the user’s entrepreneurial project; (ii) sampling negative samples to generate negative samples based on the historical behavior data of users; and (iii) generation of the scoring matrix.

(2) Recommendation Model Construction. The construction of the recommendation model is the construction of the entrepreneurial project recommendation algorithm based on a deep neural network and matrix decomposition.

(3) Recommendation Result Generation. Use the trained model to generate recommendation results for users.

3. Data Processing

3.1. Processing of Implicit Feedback Data

In the research of personalized recommendation, users’ interest in items is usually reflected by user rating data. For example, the well-known MovieLens dataset and Netflix Prize dataset have clear user movie ratings [13]. However, in many practical application scenarios, there is often no clear user rating data, such as ordinary e-commerce websites such as JD.com and Taobao and news websites such as Today’s Headlines and Baidu News. In this case, the user’s implicit feedback data is usually used to measure the user’s interest in the project [14]. User action data in the entrepreneurial project recommendation system is implicit feedback data. Next, the processing method of implicit feedback data is introduced.

Generally speaking, implicit feedback data is extracted from the interaction history between users and websites, for example, user clicks, browses, collections, and interaction time. For example, on e-commerce websites, users’ reading, clicking, shopping cart addition, collection, purchase, and other behaviors reflect users’ interest in goods [15].

In view of the research problem of this paper, the personalized recommendation system for entrepreneurial projects also has no specific user rating information. For users, the interaction history can be represented by Table 1: each row represents an action type (including reading, paying attention, and commenting, which are represented by 1, 2, and 3, respectively), each column represents an entrepreneurial project, and represents the number of rows of users on entrepreneurial projects .

However, this raw interaction history cannot be used directly and needs to be processed before it can be used. The data type in Table 1 is the number of times, which refers to the number of times users browse, pay attention to, and comment on items, which is implicit feedback data. On the other hand, it can analyze the attention degree of users to the project, which reflects the personalized preference degree.

Users’ interest in entrepreneurial projects is mainly reflected in two dimensions: the number of interactions and the types of interactive behaviors. First of all, in terms of the number of interactions, the more interactions the user has with the startup project, the more interest the user has in the project. For example, if the user browses item A five times and item B once, it indicates that the user is more interested in item A. Second, different interactive behaviors reflect different interests of users. For example, if a user comments on Project A and clicks Project B, it indicates that the user is more interested in Project A than Project B. Using the historical behavior of users, you can get the user score of starting the project. See Equation (1) for the calculation method: where denotes the number of comments made by user , denotes the degree of interest in comments, and the product of the different comment times and interest degree is the degree of interest in items in comments.

3.2. Negative Sampling

In a score-based recommendation system, users show their goodwill towards an item by scoring. For example, MovieLens and Netflix Prize datasets have user rating data on movies. If a user gives a movie a score of 5, it means that the user likes the movie. If the user does not like a movie, the user will give the movie a very low score. Therefore, the score-based dataset contains both positive samples (items that the user likes) and negative samples (items that the user does not like) [16]. For the recommendation system of entrepreneurial projects, the user’s action data is implicit feedback data. One of the characteristics of internal hidden feedback data is that there are only positive samples and no negative samples [17].

For the sampling of negative samples, Xiang Liang put forward the following sampling scheme in the book Practice of Recommendation System and achieved good results: (1)For each user, it is necessary to ensure the balance of positive and negative samples(2)For the negative sampling of users, select those elements that are very popular but have no user behavior

Generally speaking, websites will have columns similar to popular items, so popular items are easier for users to find. If the project is very popular and the user has no behavior, it can better indicate that the user is not interested in the project. For unpopular items, the exposure on the website may not be high, and users may not find them at all during browsing the website, so it is impossible to judge whether users are interested in the item. Therefore, aiming at the research problem [18], this paper adopts the sampling method proposed by Xiang Liang to carry out the negative sampling.

4. Recommendation Algorithm Description

As shown in Figure 2, the business project recommendation algorithm based on a deep neural network is constructed and decomposed into two parts: the business project recommendation algorithm based on a deep neural network and matrix decomposition. As an input element of a business project, the output is used to extract the implicit features of the project.

4.1. Deep Neural Network and Probability Matrix Decomposition Algorithm

The probability matrix factorization (PMF) algorithm is a matrix factorization algorithm [19]. The purpose of probability matrix factorization is to decompose the hidden characteristics of elements and users from the score matrix [20] and then introduce the probability matrix factorization algorithm used in this Table 1.

Assuming that there are users and items, the scoring matrix can be obtained. The goal of matrix decomposition is to calculate the hidden features of users and items and by using the scoring matrix . With the hidden features and of users and items, on the contrary, the unknown scores in the evaluation matrix are calculated by .

Firstly, the probability matrix decomposition algorithm assumes that the hidden feature vectors of users and items follow Gaussian distribution, and its expressions are shown in where represents the probability density function of Gaussian distribution, represents the mean value, represents the variance, and represent the implicit feature evidence of users and items, and represent the implicit feature vectors of users and items, and represents the unit matrix. and are obtained by matrix decomposition, which can better calculate the hidden characteristics of users and projects. represents the probability density function of Gaussian distribution.

In addition, it is assumed that the conditional probability of observation score data also follows Gaussian prior distribution, and the expression is shown in

The Bayesian formula can be used to obtain the postinspection probability of users and projects with hidden characteristics, as shown in

Assuming that is the parameter set of the deep neural network and follows Gaussian distribution and that is the characteristic input of term , the calculation method of term , , is shown in

In addition, denotes the deep neural network, anddenotes the random error and expresses the equation shown in Equation (7) according to Gaussian distribution:

Assuming that the parameter set also obeys Gaussian distribution, the probability distribution of is shown in

The conditional probability distribution of the implicit feature of the item is shown in

The probability distribution can be calculated by the Bayesian formula, as shown in

4.2. Extraction of Implicit Features of Entrepreneurial Projects

Entrepreneurial projects contain a lot of specific information, especially the description of commercial projects, brand names and signature characteristics of commercial projects, regional characteristics of commercial projects, and advantages of commercial projects, including structured data of project content such as regions, domains and labels, and structured data such as project description texts. When using the traditional word-back model to simulate entrepreneurial projects, there are certain boundaries. On the other hand, the word-back model has many limitations, such as not finding the context information of text data and extracting very little feature data. On the other hand, using the same method to process all feature data indiscriminately will also affect the recommendation results.

In order to extract more effective features from these content data, this paper combines the convolution neural network, word embedding technology, and one-hot technology to establish a deep neural network to extract hidden features of entrepreneurial projects. The convolution neural network is used to deal with the text description of entrepreneurial projects, word embedding technology is used to deal with the label data of entrepreneurial projects, and one-hot is used to deal with the regions and regional characteristics of entrepreneurial projects. Figure 3 is the structure diagram of the deep neural network established in this paper. Next, the network structure will be described in detail.

In Figure 3, the text input, region, domain, and label are of different data types or have different attribute requirements. Different inputs are the result of processing for the performance of attribute characteristics. Different input attributes in Figure 3 need to process different attribute values. After processing, all attributes will be converted into a unified processing attribute format. The input data of different attribute formats are unified into a unified data type after different processing, which is convenient for the application of the deep learning model.

4.2.1. Input Layer

The input layer inputs the characteristic data of the collated project, including label data input, text data input, company field data input, and regional data input (four parts). The input data is as follows.

Step 1 (item label data input). The number of item labels is between 3 and 10.

Step 2 (text description data entry for business projects). Text description data must separate words and enter them into the network in word order.

Step 3 (regional data input). Regional data input is a single discrete value, such as Beijing and Shanghai.

Step 4 (domain data input). The domain information of the company is a sequence value, which can contain one or two values.

4.2.2. Label Data Processing of Entrepreneurial Projects

Each label is represented by a -dimensional vector. The principle of the same vector is the same. The more similar the labels are, the closer the spatial distance is. Table 2 shows the 10 labels closest to “finance” and their similarity. In this paper, cosine similarity is used to calculate the similarity between different tags.

Because each item has multiple tags, the trained tag vector cannot be directly used as the input of the neural network. Assuming that the item has a tag set, then using the method of adding and averaging tag vectors, where , then the processed of the tag for each item can be expressed as where represents the number of tags for the project and represents the processing result in Table 3.

Finally, is input into the deep neural network as the feature of item tag information. Table 4 shows the content information of the item with item number 50000, and the tag vector display of the item can be calculated using calculation formula (15). Table 4 shows the 10 tags closest to the item tag vector.

4.2.3. The Processing Method of Text Description Data of Entrepreneurial Projects

In order to improve the accuracy of feature extraction of entrepreneurial projects, this paper uses a convolution neural network to process the file description information of entrepreneurial projects. The structure of the neural network containing text is shown in Figure 4. Next, the working mode of the text convolution neural network will be introduced in detail.

(1) Embedding Layer. An important concept of the embedding layer is embedded language, that is, distributed representation. The purpose is to use dense vectors to represent words to avoid the data sparsity problem of one-hot.

The purpose of the embedding layer is to transform the sequence of word segmentation results of text into the matrix and input it into the convolution neural network. The input of the embedding layer is in the order shown in

For example, “Today’s weather is fine” after the word segmentation order is as follows: “Today’s weather is fine”; this word order can be used as the input of the embedding layer.

In the embedding layer, each word is represented by a -dimensional vector (for example, 100 dimensions). After each word is represented by a -dimensional vector, the results are spliced together, and each text sequence input can be converted into a matrix (where represents the length of the sequence and represents the dimension of the word vector), which can be expressed by where represents the dimension of the word vector, represents the number of words in the text, and represents the word vector.

After the input text sequence is converted into the matrix, it can be used as the folding neural network input of text, in addition, how to quantify the words. In other words, there are two main ways to convert a word into a dense high-order vector: (a)Pretrained word vectors can quantitatively express the relationship between words, such as the similarity between words. Vectorization of words is also beneficial to the modeling of natural language processing problems and reduces the sparsity of data. Therefore, word vectorization is a research hotspot in recent years. In recent years, many algorithms represented by word vectors have been proposed, such as Word2Vec, FastText, and WordRank. Here, the Word2Vec algorithm is the most widely used, which was proposed by Mikolov and others. The algorithm includes the CBOW language model and Skip-Gram language model. Because this part is not the main content of this paper, it will not be explained here. The word vector training tool in this paper is Word2Vec(b)At the beginning of network training, the word vector is initialized randomly, and the word vector is always optimized and adjusted during training

(2) Convolution Layer. The main purpose of the convolution layer is to extract local features. Figure 5 folds a schematic diagram of the local field of view of a neural network with an input of the matrix, with a local perceptual range of . All inputs in each local perceptual field are connected to the same hidden neuron, and and hide the original output, as shown in

The convolution kernel of the text convolution neural network is , the number of rows per convolution of is variable, and represents the dimension of the word vector, which is constant.

Therefore, the convolution operation of the text convolution neural network can be expressed by where represents the activation value on a convolution kernel, is the offset term, and are the shared weights, is the activation function, and represents the input of the convolution layer.

After the above convolution operation, the output of the convolution layer can be expressed by

In this paper, using the ReLU activation function, the ReLU function can avoid the gradient disappearance problem and avoid falling into the local optimal solution.

(3) Pool Layer. In a convolution neural network, the pool layer follows the convolution layer closely. Its goal is to simplify the output of the convolution layer. In fact, this is a desampling operation. Pooling can reduce the size of data space, thus reducing the number of parameters and calculations.

The pool layer takes the output of the convolution layer as an input and converts the input into a fixed feature map. The most commonly used operation is max-pooling, which takes the maximum value in the local sensory field of a specific size so as to achieve the purpose of dimensionality reduction. The specific operation method can be expressed by

4.2.4. Regional Characteristics of Entrepreneurial Projects and Domain Characteristics of Entrepreneurial Projects

The regional characteristics and domain characteristics of entrepreneurial projects are coded by one-hot. Taking the region feature processing method as an example, assuming that there are region information, the region information of the project is mapped into a -dimensional vector. Assuming that the region set can be expressed as , the region information of the item can be encoded as

Similarly, the domain information of the project can be encoded in this way.

In addition, you can use one-hot encoding to handle lost data. There are a few missing values in the project area information. In this case, the missing value can be incorporated as a code directly into a one-hot encoding. For example, if there are regions, adding a “missing” region will become a region, and the region vector of each item will become a dimension vector accordingly.

4.2.5. Fusion Layer

In this paper, the horizontal stitching method is adopted directly, and the output of the text convolution neural network is directly stitched into region information, domain information, and label information as the input of the next layer, as shown in where represents the output of various features after corresponding processing.

4.2.6. Full Connection Layer

The combined features need to extract the abstract features of the project through the whole connection layer, and the calculation formula of all connection layers is as shown in where represents the activation function, represents the parameter set, and represents the offset set.

4.2.7. Output Layer

The purpose of the output layer is to transform the output of the previous layer into feature vectors contained in items with specific dimensions. The calculation method is shown in

5. Model Optimization

To optimize the above model, the optimization objectives are shown in

By optimizing formula (22), the optimal , , and can be obtained. In this paper, the gradient descent algorithm is adopted, and the first partial derivative of and in formula (22) is obtained so that it is equal to , and the gradient descent direction can be obtained, such as where represents the parameter set of the depth neural network, which cannot be solved directly by the above formula, but when and are determined, the depth neural network can be established.

The above optimization process can be described by Algorithm 1.

Input: : user rating matrix
: item attribute feature data
Output:
: user implicit feature vector
: implicit feature vector of items
Initialization:
 Regularization parameter:
 Maximum number of iterations: max_iter
 Dimension of implicit features: latent_dim
 Proof of implicit characteristics of randomly generated users and projects:
 Parameter set of the deep neural network:
 Calculate the loss value according to the formula:
For from 1 to max_iter do:
 For each in trainset
  Update
   Update
 End for
 Training DNN (deep neural network)
 Calculate the loss value according to the formula
 If
   Termination of iteration
End for

Using the algorithm in the third part, we can get the hidden features and of the optimal user and scheme and use and to predict the missing score information in the score matrix . The score of user on item can be calculated by

Calculate the user’s scores of all items, sort the predicted scores from big to small, and take the first results as the user’s top recommendation results.

6. Experiment and Evaluation

6.1. Data Acquisition and Data Processing

In this paper, a Python crawler is used to capture data from a network information platform. Different information needs to be processed separately, and the processing methods are as follows.

6.1.1. Missing Data Processing

After deleting the data of users with too little action, users with too much data, and entrepreneurial projects, there are 7762 entrepreneurial projects and 7765 users. With the implicit feedback data processing method, the sparsity of 248605 scoring data of users is 99.59%.

6.1.2. Data Processing of Entrepreneurial Projects

Among the entrepreneurial projects, only a few projects have a loss value. Table 5 is an example of entrepreneurial project information.

6.1.3. User Data Processing

The characteristics of the user mainly include the user number, the user area, the user profile, the area of interest of the user, and the purpose on the user’s website. Table 6 is an example table representing the characteristic data of the user.

As shown in Table 7, the loss rate of each field of user-specific information can be seen from the table whether the loss rate of user information is large or not. The main reason is that the user’s characteristic data is filled in by himself when registering the user, which is unnecessary information. Among them, the missing rate of the user profile is 75.83%, which is too large to be used. The missing rate of user areas, areas of interest, and purposes on the user’s websites is about 20%. Because these fields are processed using one-hot encoding, one-hot encoding has the characteristic of being able to handle missing data. Keep these three fields.

6.2. Evaluation Criteria

In this paper, two algorithms of the recommendation system, RMSE and , are used to evaluate the experimental results. RMSE represents the mean square error, which is mainly used to evaluate prediction problems. The calculation method is shown in

is often used in the evaluation of top recommendations. The calculation method is shown in

6.3. Data Preprocessing
6.3.1. Text Data Processing

The text introduction information and user description information of business projects need word segmentation.

6.3.2. Word Vector and Pretraining

Use Stanford Segmenter for word segmentation. Then, the wordERC tool provided by Python’s Gensim wrapper is used to train the word vector, and finally, about 2.7 million word vectors are obtained.

6.3.3. Feature Extraction

In regional information processing, the data is indeed encoded separately, and finally, there are 44 regions. There are 21 areas of concern to users. There are 9 purposes on the user’s website.

6.4. Experimental Results

In this paper, the PMF algorithm and ConvMF algorithm are selected and compared with the recommendation algorithm proposed in this paper. As can be seen from Table 8, RMSE is 9.83% higher than that of the PMF algorithm and is 2.97% higher than that of the CovMF algorithm. As can be seen from Table 8 and Figure 6, the DNN-MF algorithm significantly improves the other two algorithms in the recall indicator.

The recall rate of the DNN-MF algorithm from Figure 6 varies from 50-300, and the DNN-MF has an obvious upward trend, which is higher than the other two algorithms.

6.5. Influence of Adding Different Numbers of Features on Recommendation Results

In this paper, the DNN-MF model is experimented with adding various quantities of item feature information in stages, as shown in Table 9 and Figure 7.

From Table 9, the RMSE index of the DNN-MF1 model is increased by 1.73% compared with that of the ConvMF model. Compared with the DNN-MF1 model, the RMSE index of the DNN-MF2 model increased by 1.26%.

From the above experimental results, it can be seen that by adding more content features, the recommendation accuracy of the model can be effectively improved. This is mainly because the characteristics of project content can reflect users’ interests to a certain extent. For example, for the financing rounds of entrepreneurial projects, some users may pay more attention to the projects in the seed round, while others may pay more attention to the projects after Round A.

6.6. Influence of the Number of Implicit Features and the Length of the Word Vector on Recommendation Results

In this paper, the grid search method is adopted. Table 10 and Figures 8 and 9 are experimental results implicitly containing the number of features and the length of the word vector. It can be seen from Table 10 that if the number of features is 200 and the length of the word vector is 200, the RMSE value obtained is the smallest and the recommendation effect is the best.

As you can see from Figure 8, if the word vector dimension is 200, the RMSE value is the smallest. When the number of implicit features increases from 50 to 200, the RMSE value of the algorithm decreases continuously.

Figure 9 is a diagram of the relationship between word vectors of different dimensions and RMSE when the dimension of implicit feature is 200. As can be seen from the figure, the recommendation effect is best when the length of the word vector is 200, so the best length of the word vector is 200.

7. Conclusion

Firstly, this paper introduces the basic process of an entrepreneurial project recommendation system based on a deep neural network and matrix decomposition and introduces the internal feedback data processing method and negative sampling method related to this study. Moreover, this paper introduces the principle of the entrepreneurial project recommendation algorithm based on a deep neural network and matrix factorization proposed in this paper. We introduce the principle of probability matrix decomposition and the extraction of hidden features of entrepreneurial projects. Finally, the optimization method of the model and the generation method of recommendation results are introduced.

Data Availability

The experimental data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declared that they have no conflicts of interest regarding this work.