#### Abstract

Scientific talents can make great contributions, including scientific breakthrough innovations and discoveries, and coordinate and guide the actions of many others, propelling the scientific knowledge frontier. We investigate international scientific talent migration from 2001 to 2013 with the quantitative method. The relationship between complex network and international talent migration is introduced. Considering most of talents migrate between some countries with good economy and innovation, the migration network including 37 countries is analysed. The countries are noted by nodes of the migration network, and the migratory flow of talents from one country to another country is viewed as the directed weight edge between the corresponding nodes. The discrete dynamics of talent migration under complex network is proposed. The unknown parameters of the proposed model are identified. The overall situation and time evolution of international talent migration from 2001 to 2013 are given from the discussion on the indicators of complex network. Furthermore, we study the talent migration flows in the view of obstacle factors. It is found that the great majority of talents migrate between developed countries and emerging economies from 2001 to 2013, and this phenomenon becomes more significant. The USA has attracted a great number of talents all over the world, and the country is also the ideal destination for talents who want to live or work in another country for more job opportunities, attractive payment, and better innovation environment. China and India begin to attract talents. Talents emigrate from more and more original countries. It becomes more convenient for talents to immigrate to other countries. The effectiveness of obstructs to migration has become weakening. For immigrating to a certain country, the obstacles have a relationship with the country’s innovation.

#### 1. Introduction

With the rapid growth of knowledge economy and economic globalization, the number of scientific talents working or studying in foreign countries becomes increasing. As mentioned by Beechler and Woodward [1], scientific talents are a kind of professional class who have strong desire to move. They are used to immigrating to other countries, which can give them secure environment, good job opportunities, attractive remuneration, sophisticated laboratory conditions, or other benefits. In another way, the frequent international migration of scientific talents also leads to new scientific communication, cooperative research, and chances for industrial development [2]. Moreover, scientific talents are also a kind of key factors for developing science and innovation [3]. The countries attracting a great number of scientific talents will be at the top of science and technology in the world. Hence, many countries are actively engaged in the global talent competition and introduce substantive policies to attract talents, intensifying the degree of international talent migration [4]. Therefore, the phenomenon of international talent migration is unavoidable. Then, investigation of talent migration and its causes is very important to attract more talents from other countries. It follows that more scholarly and policy attention has been paid to this topic.

The international migration of scientific talents is complex and multidimensional. The initial study of this topic just focused on the phenomenon of brain drain or brain gain, for example, in [3, 5–8]. The work [3] reviewed analytical and policy issues related to the international migration of talented individuals, examining the main types of talent who moved internationally, their specific traits, characteristics, and the implications of migration for original and host countries and for global development. In [6], the authors examined how Chinese- and Indian-born engineers were accelerating the development of the information technology industries in their home countries, initially by tapping the low-cost skill in their home countries and over time by contributing to highly localized processes of entrepreneurial experimentation and upgrading, while maintaining close tied to the technology and markets in Silicon Valley. According to [7], more and more the migration pattern changed from a blue-collar migration of low qualified workers to a white-collar migration of highly skilled professionals, and there was a substantial brain drain from 1991 to 2000. In [8], the authors analysed the importance of foreign talents for host countries, considered the determinants of international talent flows at the individual and firm levels, and sketched some important implications.

However, in these above results, only permanent migration was discussed through case analysis in empirical method. Furthermore, these results might not give an overall situation and might not be very convincing. Considering these imperfections, a lot of scholars and policy makers try to analyse international talent migration and its causes with the quantitative approach.

The most general quantitative method to investigate talent migration is according to bibliometrics to tracking international scientific migration, based on an analysis of the affiliation countries of authors publishing in peer-reviewed journals. For example, in [9, 10], the authors declared this approach was promising, and a bibliometric study of scientific migration provided significant outcomes. As discussed by Czaika and Orazbayev [11], they found an increasing diversity of original and host countries integrated in global scientific migration and significantly lower migration frictions for internationally migratory scientists compared to nonscientist migrants with the help of a quantitative assessment of global scientific migration over the past four decades based on bibliometric data. The authors in [12] established the networked model of international talent migration, investigated the topic with complex network analysis, and identified factors to explain international talent migration flows by multiple linear regression. They said that the share of migrants in population was the major negative factor for international talent migration, and the factors of host countries were more significant than original countries. The work in [13] proposed a framework model of international top talent migration and gave two approaches to identify unknown parameters. The result could be employed to analyse the overall situation of international talent migration and predict its development. Furthermore, according to the analysis of a novel estimation procedure based on a pseudo-gravity model in [14], migration to non-OECD countries accounted for 20% of all high-skilled migration and this migration comprised relatively large numbers of individuals from low income and the least developed countries in many regions of the world. To provide an initial and exploratory contribution to the analysis of the factors driving the international migration of talents, the work in [15] adopted an empirical gravity model to describe and analyse new aggregate, bilateral data on international scientist migration. In [16], determinants of international scientific migratory inflows were quantified from multiple sources using panel-data analysis techniques. In [17], the authors estimated international migration models for OECD countries based on a dual approach: using conventional econometric approaches such as panel-data regression and network-based regression techniques such as multivariate regression quadratic assignment procedures.

Moreover, complex network is a kind of helpful tools to study some social problems similar to international migration of talents, such as population migration and international trade. In [18], the international migration was noted as a weighted-directed graph where nodes were countries and links accounted for the stock of migrants originated in a given country and living in another country at a given point in time. In [19], the proposed novel human migration model managed to construct the -clique overlapping community structuring the common statistical features observed from distinct real social network and achieved a good trade-off between complexity and reality. In [20], a network model of human migration with migration cost including movement/translocation and training is introduced. The proposed model not only permitted migration of a class across regions but also allowed for class transformations within a region or across regions.

In our opinion, the biggest challenge of studying international migration of scientific talents is to obtain confidential data suitable for evaluation and measurement of its characteristics and impact. Furthermore, the difficulty becomes more serious due to existing complex circulation immigration and tracking trajectories of talents across borders. According to immigration statistics of population and working people, traditional method can just portray the migration flow of top talents, and the collected data is incomplete in some cases.

With studies extend, many questions about international talent mobility are emerging and hinged on the policy agenda of governments. For example, what is the relationship between countries in talent migration? What is the evolvement of international talent migration in recent years? To what extent is the effectiveness of obstructs in the international scientific migration of talents? The answers to these questions are very important for governments to initiate policies in order to attract talents all over the world, so that they must be explored with great conviction.

To deal with these questions, in this paper, international scientific talent migration is investigated in convincing way with the help of quantitative analysis. Talent migration is viewed as complex network, and the stock of talents in each country is given by a discrete dynamics. In addition, based on the existing data of talent migration from bibliometrics, the unknown parameters of the talent migration networked model are identified. Moreover, the evolution of talent migration is discussed under the proposed model. Finally, we assume the pattern of talent migration in the absence of obstacles and study the effect of obstacles in talent migration.

The remainder of the paper is structured as follows. Section 2 describes the data source and the normalized data processing. Section 3 describes the relationship between talent migration and complex network with discrete dynamics and introduces the method to identify unknown parameters in the proposed discrete model. Section 4 discusses the effectiveness of obstacles in international talent migration. Finally, Section 5 concludes the paper.

#### 2. Data Source for Investigating Talent Migration

Considering it is very difficult to obtain confidential data describing bilateral flows of talents between countries, especially the annual data, we focus on inventor migration as captured in patent applications. It can overcome many limitations associated with migrant stock data. The grouping of patent-inventing talents is more targeted than the wide spectrum of tertiary educated talents. In addition, inventors arguably have special economic importance, as they create knowledge, which realizes technological and industrial transformation.

Consequently, migratory data of talents employed in the paper come from “patent applicants,” which are extracted from applications filed under the Patent Cooperation Treaty (PCT). In addition, the PCT data contains bilateral counts of cross-border movements of “migrant inventors” for a long time span, with an exhaustive list of “sending” and “receiving” countries.

The PCT is an international treaty administered by the World Intellectual Property Organization (WIPO), offering patent applicants an advantageous route for seeking patent protection internationally. The treaty came into force in 1978; and there were 146 PCT contracting states in 2012. The PCT filing data covers a large number of countries over a long time span (from 1978 to 2012). In 2010, around 54% of all international patent applications went through the PCT system. By December 31, 2012, the total number of PCT applications stood at 2361455. Each PCT application includes the names of the applicant(s), agents, inventors, common representatives, and special addresses for correspondence. Given our interest in studying the migratory history of inventors, we only focus on inventors and applicant inventor records. This subgroup accounts for exactly 6112608 records. We observe both the nationality and residence information for 4928076 of the 6112608 records, a coverage rate of 80.6%. Considering research significance, representativeness, and data integrity, we choose the time interval from 2000 to 2012 in the database.

The PCT patent applications contain information such as the names and addresses of the patent applicant(s) (generally, the owner), as well as the names and addresses of the inventor(s). What is unique about the PCT applications is that, in the majority of cases, they record both the residence and the nationality of the inventors. In sum, the PCT records offer good coverage of inventor nationality and residence information and represent a promising data source for migration research. More detailed and careful interpretation of International Migration of Inventors database is available in [21].

#### 3. Complex Network of Scientific Talent Migration with Discrete Dynamics

In this section, the dynamical discrete model of scientific talent migration is established, and the unknown parameters in the proposed model are identified. After that, we draw the topology of talent migration between countries and propose its characteristics and evolution.

##### 3.1. Model Framework

In this section, we establish the networked model framework of international talent migration with discrete dynamics. Based on discussion of the relationship between complex network and international talent migration from original countries to host countries, talent migration is represented as a kind of complex network.

Complex network, which is noted by a topology, is an abstract representation of a group of nodes. The relationship between nodes is known as directed edges. The international migration of talents can be viewed as a kind of information transmission among nodes. It is natural and convenient to model international talent migration among countries by directed weighted topology. The authors in [22, 23] gave a more detailed and thorough interpretation of complex network and its applications.

Considering that we investigate the migration in a series of time, the symbol is used to note time. To describe migration in quantization as a directed topology, nodes are set as countries. It means that country is noted as node in the directed topology. In addition, the migratory channel from country to country is viewed as the edge from node to node . Next, we define a directed path (directed migratory flow) as a sequence of successive nodes starting at node and ending at node so that successive nodes are adjacent. Moreover, in binary topology, the weight of any edge is if the edge exists; otherwise, the weight is 0.

The number of migratory talents (patent applicants) from country to country is denoted by , and is the number of talents residing in country according to the PCT database at time .

Since we aim to compare talent flows of different years to obtain the evolution of international talent migration, annual data should be normalized.

For normalizing the initial value , the following method to get the normalization value is given bywhich follows that

always holds at any time, where is the number of countries considered in the migration network. If there is no talent from country to country at time , the edge does not exist, and , . It is straightforward to see that always keeps in the range [0, 100].

Note that the number of talents in country at year is just relative with the number of talents in countries having talents immigrating to country at year . Moreover, it is also assumed that the number of talents of country at year is the accumulation of migratory talents from other countries immigrating to country at year , including staying in country . We have the following discrete dynamics:where is viewed as the joint factors to drive talents migrating from country to country at year . In addition, in the networked model with discrete dynamics (4), is noted as the weight of edge to be identified next. Moreover, based on the normalization equation (1), we obtain .

##### 3.2. Parameter Identification

In the above section, the model framework is established, but the weight is still unknown. These parameters are the key coefficients to get characteristics of discrete migration network. In this section, we introduce the way to identify these unknown parameters.

Considering the number of talents in each country is normalized, we have that

always holds. Without loss of generality, for one certain node , it is obvious that

because the talent flow coefficient from original country to host country just depends only on the factors of country and country . Therefore, the unknown parameter is given by

It follows that the parameter reflects the proportion of talents from country to country to all talents in country at year . And (6) is equivalent to .

Moreover, to switch the dummy variables and , we have

Following from (4), we obtain

It is straightforward to see . Noting , the column sum of is ; that is, at any time.

#### 4. Topologies and Evolution of International Talent Migration

In this section, the topologies of international talent migration between 37 countries, including the USA, the UK, Germany, France, Sweden, Japan, Australia, Switzerland, India, China, Brazil, Russia, Mexico, Indonesia, and South Africa and other major countries, are drawn from 2001 to 2013 according to a series of the identified matrices . The overall evolution of international talent migration is discussed with the help of indicators of complex network.

Based on the identified adjacent matrices , if the number of talents in a country is represented by the size of the corresponding node and the number of migratory talents from the original country to the host country is measured by the width of a directed edge, the network topologies of international talent migration are drawn in Figure 1.

**(a)**

**(b)**

**(c)**

**(d)**

**(e)**

*Remark 1. *Figure 1 just supports intuitive visualization of talent migratory network with qualitative analysis. The following credible results about talent migration are obtained according to evolutions of network indicators in the quantization.

Based on the proposed model with the help of some indicators of complex network, the characteristics and evolution of international talent migration can be discussed. Moreover, considering the characteristics and realities of international talent migration, the following statements about topology and complex network may be different from their general concepts.

The first and most crucial definition of complex network is degree, including in-degree and out-degree given byrespectively. Degree represents the ability of the chosen countries pulling or pushing talents. It is obvious that out-degree is the sum of columns of , so that always holds for any node. The evolutions of in-degrees for the chosen countries from 2001 to 2013 are given in Figure 2.

From Figure 2, the in-degree of the USA is the maximum value. Germany is in the second position, and Switzerland, the UK, Japan, and other developed countries with high GDP per capita also have high values of in-degrees. The values of these countries’ in-degrees are stable, and their in-degrees are far from the USA. It is concluded that developed countries receive a great number of talents, and the number of talents residing in these countries is also very high, but the attraction of developed countries, except the USA, is stable continuously. The USA is still at the summit of the number of scientific talents all over the world.

Another outstanding point of Figure 2 is that the in-degree of China increases from 0.6222 to 1.4759—double increasing. It means that the ability of China to attract talents is strengthened very significantly because of the development of economy and attractive policies. In addition, the in-degrees of other countries, such as Indonesia and Greece, are relatively low, which means that these countries with low GDP per capita cannot attract many talents, and the number of talents residing in these countries is also very limited.

The next important indicator to study international talent migration under complex network is distance from node to node , which is defined by the minimum sum of weights of edges from to through a direct path. It is employed to measure the efficiency of information delivery in the network. Based on this idea, considering that the weights of edges represent the ability of attracting talents of host countries, the efficiency of talent migration network is defined by average distance in the following:where is a directed path from to . For talent migration, it describes the efficiency or the convenience for talents to go aboard in general. The smaller means that it is more convenient for talents studying or working to other countries. The average distance evolution of the talent migration network is given in Figure 3.

It is obvious from Figure 3 that the average distance becomes smaller over time, and the decreasing is significant, over 50%. It means that the efficiency and convenience for talent migration were improved from 2001 to 2013. Then, we maintain that a great number of obstacles to talent migration are removed considerably, giving impetus to the transnational activities of talents for other countries.

In the theory of complex network and graph, a clustering coefficient is a measurement of the degree to which nodes in a network tend to cluster together. Clustering coefficient is a local measure. Therefore, clustering coefficient of a node under undirected network is calculated by using following formula:where is the degree of node and is the number of edges between the neighbours of node .

In this paper, to investigate the network of international talent migration, we redefine this concept. For undirected binary topology, clustering coefficient is given bywhere is the number of edges connected to node under the undirected binary topology. For directed weight topology, clustering coefficient is given bywhere are the weights of at time , respectively.

The clustering coefficient of the entire graph is the average clustering coefficient of the entire network, which can be employed to discuss the density of edges in average. It is given byThe evolutions of clustering coefficients under directed weighted topology and undirected binary topology are calculated and drawn in Figures 4 and 5.

According to Figure 4, it is seen that the clustering coefficient under directed topology of international talent migration is close to 0.01 with volatility in range between 0.0095 and 0.0110 from 2001 to 2013, so that it is relatively stable, which follows that the density of the weighted topologies does not change significantly. However, from Figure 5, it is obvious that the clustering coefficient under undirected binary topology is increasing from 0.8023 to 0.8325, and the trajectory is monotonously increasing in general. It means that growing numbers of talents immigrate to other counties directly and conveniently because of the increasing clustering coefficient under undirected binary topology, which is in agreement with the result according to average distance. However, there also exists the stable density under directed weighted topology, so that most of international talent migration has been only concentrated on a small number of countries.

Moreover, to analyse which countries are critical in the talent migration, the evolutions of clustering coefficients of these countries are drawn.

According to the representation of clustering coefficient in talent migration network, it is obvious that there are the greatest numbers of immigrating talents in the USA, and the USA plays the most critical role in structure of talent migration network.

To analyse the heterogeneity of international talent migration, network structured entropy is introduced. Network is nonheterogeneous or called homogeneous if all of nodes have the same importance approximately; otherwise, the network is heterogeneous. Network structured entropy for in-degree is given bywhere . Because of , network structured entropy for out-degree is 1 at all times.

It is obvious that . In addition, if is close to 0, the network is heterogeneous; if is close to 1, the network is nonheterogeneous. Network structured entropy can be used to estimate whether the characteristics of international talent migration depend on one country or a small group of countries.

It is seen from Figure 6 that the variation interval of entropy for in-degree is from 0.2085 to 0.1888, far from 1, and the monotonous decreasing of this value is obvious . According to its meaning, the international talent migration is nonhomogeneous severely. Combined with the in-degree distribution shown in Figure 2 and the clustering coefficient distribution in Figures 6 and 7, most of talents usually migrate among the countries with high GDP, and this trend is becoming more and more obvious from 2001 to 2013.

##### 4.1. Effectiveness of Obstructs to International Talent Migration

Despite a growing international competition for talents, scientific migration is affected by the factors of obstructs certainly, as various economic, political, and professional factors continue to play a significant negative role in shaping international migration of talents. To assess the roles of obstructs, we need to construct a hypothetical counterfactual, which will answer the following question: how would migration patterns look like if there were no barriers to migration?

Following [24], we use a random-utility framework to examine what it would look like if obstructs to international talent migration were equal to zero. This is similar to the analysis made by Head and Mayer in [25], except for the fact that we analyse international talent migration instead of trade flows.

Assume the individual’s utility from this choice presented by the parameter identified in the above section is given bywhere is the attraction of host country to all original countries, is the barrier factor for talents emigrating from country , and is the specific random component for talent migration from country to country .

To assess the role of obstructs, it is assumed that the associated items do not exist, that is assuming . After that, compared with the results within and without the item , the effect of obstructs is given.

Assuming there are no barrier factors, holds. Hence, it follows from (3) that the number of talents in country at time becomes

Considering with normalization equation (1), we have

An intuitive interpretation of the above equation is, in the absence of obstructs, the proportion of migratory talents emigrating from origin and immigrating to destination ; that is, would be equal to the share of the number of talents residing in destination in the total amount of talents all over the world at the same time. Next, we denote

to represent the effectiveness of obstructs. This value can represent the difficulty of talents immigrating to country . For most cases, is larger than 1 in reality, but may also exist. The reason of can be explained as the barrier factor does not play a blocking role but encourages people to emigrate from country

The average of all countries from 2001 to 2013 and its evolution are calculated and drawn in Figure 8.

Figure 8 reports that there is a decrease of average from 2001 to 2013, which means that the effectiveness of obstacles is reducing during the same time.

However, through the deep analysis, we find that the obstacles for different countries are different. To show the difference, we plot the evolution of average from 2001 to 2013 for each country by the average number of residing talents also counting by the PCT in Figure 9. The -axis of Figure 9 is chosen as the logarithm (base 10) of to make the result clear.

With the number of residing talents increasing, the difficulty for talents immigrating to host countries is reducing in general, but it also maintains that after a certain value of the number of residing talents, the difficulty increases slightly. It is believed that the number of residing talents has a positive relationship with innovation ability, so that countries with high innovative capabilities have attracted talents in the world, and talents have the strong will to immigrate to these countries. Nevertheless, the countries having the largest amount of talents, such as the USA, may restrict entry for normal talents because of enough talents, and these countries just welcome top ones.

For investigating the obstruct effectiveness of some typical countries, including the USA, China, the UK, Germany, Australia, and India, the time evolutions of of these countries are drawn in Figure 10.

Figure 10 shows the largest value of is for the USA, the most innovative countries, also having the largest amount of talents. It is analysed that it is not easy for foreign talents to study or work in the USA. As mentioned earlier, the reason may be that there are enough normal talents, and most of talents want to go to the USA for better job opportunities, attractive remuneration, or sophisticated laboratory conditions. There is similar decreasing of for China and India, emerging economies. The reason is that talents immigrate to the two countries more and more conveniently. On the one hand, China and India need an increasing number of talents for innovation and development; on the other hand, the talents have chosen them as better destinations because of the good working and living conditions that the countries support. For developed countries in west Europe and Oceania, the values of are stable between 2.5 and 3.5, and the range of these numerical variations is small because the attraction of these countries is not very high.

#### 5. Conclusions

In this paper, we investigated international scientific talent migration from 2001 to 2013 in the quantification. The data source is employed from patent applicants under the PCT. Next, international talent migration is abstracted as a kind of complex network with discrete dynamics for nodes, and the relationship between them is also discussed, so that the proposed analysis makes results more credible, and the unknown parameters of the discrete model are identified. A series of these quantitative topologies of international migration are established, and time evolution of international talent migration is given by calculating the values of some network indicators. In addition, with the help of hypothesis, the effectiveness of obstruct is discussed.

Based on the analysis, we maintain that the overall situation of international talent migration is relatively stable, but there exist some small fluctuations from 2001 to 2013. The USA has attracted a great number of talents from other countries. Emerging economies, such as China and India, have been from brain drain to brain gain. We also find that it becomes more convenient for scientific talents to immigrate to other countries. In addition, for host countries, obstructs of talents immigration is more and more weakened with the increasing number of residing talents. However, there exists a turning point, and, after the value, the effectiveness of obstruct is more negatively significant in the host countries.

The future work should focus on analysing the key determining factors affecting talent migration data, such GDP per capita, innovation, and R&D expenditure. We will also attempt to study different types of migration of specific talents, such as young talents and top talents, and compare the migration laws for different types of talents. According to these analyses, some targeted policies could be offered to governments for attracting a great number of talents. In addition, there exist many advanced quantitative methods, which could be explicated to study talent migration, such as entropy-based measurement in [26], symmetry distribution in [27], and supervised text classification with a networked model in [28].

#### Data Availability

All data included in this article are available upon request to the corresponding author. There are no restrictions on data access. Also, the original data from Ref. [21] have been employed by many other works.

#### Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

#### Acknowledgments

This work was supported by the Youth Program of President’s Foundation of NAIS (no. 2020yzjj-019).