Abstract

Mexico’s National Institute of Geriatrics (INGER) is the national research center of reference for matters related to human aging. INGER scientists perform basic, clinical, and demographic research which may imply different scientific cultures working together in the same specialized institution. In this paper, by a combination of text mining, coauthorship network analysis, and agent-based modeling, we analyzed and modeled the team assembly practices and the structure of the knowledge produced by scientists from INGER. Our results showed a weak connection between basic and clinical research and the emergence of a highly connected academic leadership. Importantly, basic and clinical-demographic researchers exhibited different team assembly strategies: basic researchers tended to form larger teams mainly with external collaborators, while clinical and demographic researchers formed smaller teams that very often incorporated internal (INGER) collaborators. We showed how these two different ways to form research teams impacted the organization of knowledge produced at INGER. Following these observations, we modeled, via agent-based modeling, the coexistence of different scientific cultures (basic and clinical research) exhibiting different team assembly strategies in the same institution. Three virtual experiments were run in our agent-based model. The three experiments kept similar values to the collaborating dynamics of INGER in terms of average team size and probabilities of choosing incumbents and external collaborators. The only difference among these experiments was the value of homophily defined as the trend to collaborate with research studies from the same field (14% corresponding to the 46% and 79%). The main result of these experiments is that by modulating just one variable (homophily), we could successfully reproduce the current situation of INGER (homophily of 79%) and simulate alternative scenarios in which interdisciplinary (46%) and transdisciplinary (14%) research could be done.

1. Introduction

The National Institute of Geriatrics (INGER), founded in 2009 in Mexico, became one of the national institutes of health of the Ministry of Health in 2012, as an answer to the challenge posed by the aging of the Mexican population [1]. INGER is the national research center of reference for matters related to human aging [1]. Aging is a complex multidimensional problem that would require the development of interdisciplinary research in order to satisfy the demand of knowledge that provides solutions to the several health problems related to aging [24]. However, according to its official website [5] and its specific organization manual [6], INGER exhibits a traditional departmentalized organization in which research is mainly performed by three separated departments: basic research, clinical epidemiology, and demographic epidemiology. It is important to point out that this is the common model for Mexican institutions in which medical research is performed [7, 8].

Currently, one of the biggest challenges in medical research is the translation of basic research discoveries into clinical practice and societal outcomes. However, this is not an easy task as clinical and basic are quite different research cultures with different jargon research instruments and standards of scientific quality [9]. One of the key elements of the knowledge translation challenge is the assembly of interdisciplinary (translational) research teams [10]. Importantly, team assembly mechanisms have been described as fundamental elements that determine the structure of the collaboration networks and the performance of the teams [11]. This model considers three parameters: “team size, the fraction of newcomers in new productions, and the tendency of incumbents to repeat previous collaborations.” [11] More recently, Bakshy and Wilensky developed an agent-based model of team assembly dynamics [12] which is based on the previous work of Guimera et al [11]. However, this model does not consider the coexistence of different research cultures, such as clinical and basic research, which could have different team assembly practices and, more importantly, their members could be reluctant to collaborate with scientists outside their disciplinary field.

In this investigation, we set out to analyze and model the team assembly dynamics of INGER researchers and their impact in the knowledge produced by the institution.

2. Methodology

The present study used as a source of information, the scientific production (a total of 178 papers published between 2012 and 2017) of 21 INGER scientists (3 officeholders and 5 junior and 13 senior scientists), and 8 former INGER researchers. By a combination of text mining and coauthorship network analysis, we analyzed the relation between team assembly practices and knowledge produced at INGER. Also, we developed an agent-based model that allowed us to recreate the dynamics of team assembly at INGER and visualize alternative scenarios. We followed the below-given steps:(1)KH Coder, the software for quantitative content analysis (text mining), [13] was used to perform bimodal (authors and terms) correspondence analysis of terms contained in the title and abstracts of the papers. Correspondence analysis is an explorative multivariate technique that provides information on the data structure by summarising a set of data in a two-dimensional graphical form. In the plot, the closer the variables (authors and terms), the more similar they are.(2)A bimodal network model of papers and authors (INGER researchers and their collaborators) was built and visualized using Cytoscape, the open-access software for visualization and analysis of networks [14]. Papers on the bimodal network model were classified as basic research if their content was about phenomena at the cellular or biomolecular level, whereas papers classified as clinical-demographical research are studies at the individual or population level. Researchers were classified according to the department they belong to except if they were former INGER researchers or officeholders.(3)The bimodal network model was transformed in a collaboration network of authors which was then divided in years so the research teams of each year could be visualized. The average of team size and the proportion of internal and external collaborators were obtained for basic and clinical research teams.(4)An agent-based model of team assembly [12] was previously built in NetLogo [15] by Bakshy and Wilensky which in turn was based on the work of Guimera et al. about the mechanism of team assembly that could determine the collaboration network structure and performance [11]. This model does not consider the coexistence of communities (scientific cultures) that exhibit different team assembly strategies like biomedical and clinical researchers at INGER. Therefore, we modified the original model in order to have two coexisting breeds of scientists (basic and clinical researchers) whose team assembly strategies (the probabilities of choosing incumbents and previous collaborators) can be modulated independently. Moreover, a new system variable, homophily, was added. In our model, homophily is the tendency of scientists to collaborate with members of the same scientific culture (basic and clinical research). By modulating homophily and team assembly strategies, different scenarios can be simulated. It is important to mention that in order to provide robustness to out model, we added randomness to the variables homophily and team size. The model, which can be run in the open-source software (GNU General Public License) NetLogo, the code, and a more detailed description can be downloaded here: http://modelingcommons.org/browse/one_model/5676.(5)Three virtual experiments were run 100 times. Each experiment corresponds to different percentages of homophily (14%, 46%, and 79%) in order to generate multidisciplinary and interdisciplinary scenarios. The three experiments reproduce the INGER scenario in terms of team size and probabilities of choosing incumbent (internal) and new collaborators (external) collaborators. That is, the experiments respond to the hypothetical question of what would happen if INGER researchers would have different levels of homophily. Each experiment lasts 1500 steps (ticks), and the evolution of the team size and composition of the team were plotted.

3. Results

3.1. Content of INGER Papers and the Structure of the Collaboration Network

The correspondence analysis performed on the title and abstract of the 21 current INGER researchers generated two components (dimensions 1 and 2) that, together, explain just 33.5 per cent of the relations among variables (terms in title and abstract). However, the analysis provides useful information on the organization of the research activities at INGER. The correspondence analysis (Figure 1) displayed clinical (from individual to population level) researchers forming a cluster near the origin while basic (subindividual level) researchers are scattered in the plot. Moreover, the plot shows that INGER papers have been by far mostly clinical-epidemiological research over the years. 2014 is the year, according to the correspondence analysis, with more proportion of basic research papers. On the contrary, the co-occurrence network of terms in the title and abstract of INGER papers showed a primacy of demographic-epidemiological terms (Figure 1).

On the contrary, the bimodal network of authors and publications of INGER confirmed the results of the correspondence analysis: clinical and demographic researchers are strongly connected to each other at INGER, while basic researchers are scattered (Figure 2). As a matter of fact, biomedical researchers at INGER are more strongly connected to external authors than to INGER authors (Figure 2). Another important observation is that there is a well-defined leadership among clinical researchers. That is, the two most productive authors at INGER are also the most collaborative, and they are located at the core of the network model.

3.2. Clinical and Basic Research Team Assembly Profiles

The annual slices of the collaboration network (2012–2017) showed that clinical-demographic researchers and basic researchers had different team assembly practices (Supplementary Material). Clinical-demographic teams tended to be smaller than basic teams (the average team size was 4.78 and 7.48, respectively), whereas basic teams showed a smaller proportion of internal collaborators than clinical researchers (the average proportion of internal collaborators per team was 0.22 and 0.45, respectively) As a matter of fact, just 4% of basic teams included at least two INGER researchers, whereas 43% of clinical-demographic teams were formed by at least two INGER researchers.

3.3. Modeling of the Current Team Assembly Dynamics in the INGER and Generation of Alternative Scenarios of Multidisciplinary and Interdisciplinary Research

We successfully developed an agent-based model of assembly of interdisciplinary teams which can be run in the open-source software (GNU General Public License), NetLogo. The model can be downloaded here: http://modelingcommons.org/browse/one_model/5676. A schematic representation of the dynamics of the model is provided in Figure 3.

The virtual experiments that corresponded to different percentages of homophily successfully reproduced the INGER current stage (79% of homophily; Figure 4) and generated multidisciplinary (homophily 46%; Figure 4) and interdisciplinary (homophily 14%; Figure 4) scenarios. The three scenarios exhibited stability over the time (steps of ticks) in terms of number of teams with size over the selected team size and the compositions of the teams (basic and clinical members; Figure 5). In the case of the simulated INGER, there can be observed a giant component of clinical researchers and dispersed teams of basic researchers (Figures 4 and 5). With a homophily of 46% emerged a giant component that was formed by two clusters of clinical and basic researchers that were united by a small region (Figures 4 and 5). This is quite similar to a multidisciplinary research scenario. Finally, with a homophily of 14%, clinical and basic researchers were blended in a giant component (Figures 4 and 5).

4. Discussion

As far as we know, this is the first time that a combination of text mining, network analysis, and agent-modeling is used to provide a systemic view of the current situation of a research institution (INGER) and to generate alternative scenarios. This combination of analysis tools and modeling can be a fundamental element for the knowledge management of research institutions. By modulating one of the system variables, homophily, while keeping fixed the others, the different behaviors could be obtained which fit well with the definitions and empirical observations of multidisciplinary and interdisciplinary (translational) research [16, 17]. However, our agent-based model of multi-interdisciplinary research can be adapted to a plurality of medical research institutions like the rest of the National Health Institutes of Mexico, which share a similar institutional design [6, 7], by modulating the full set of variables. Moreover, this set of methodologies can be applied to the analysis and management of intramural research performed in the United States National Health Institutes.

One possible extension of our model would be the inclusion of a third class of researchers, translational researchers that could establish the connection between basic and clinical research. Currently, we are working on building a model that considers the coexistence of several research communities.

It is important to mention that our results showed how team assembly strategies could impact the knowledge produced by a research institution (Figures 1 and 2). This could be explained by the phenomenon of homophily whose “pervasive fact of homophily means that cultural, behavioral, genetic, or material information that flows through networks will tend to be localized. Homophily implies that distance in terms of node information translates into network distance.” [18] Therefore, a good management of team assembly practices could lead a better quality of the produced knowledge.

Finally, it is important to discuss how to transfer the results of the analysis and modeling of the team forming practices at INGER in order to improve the quality of the research produced by the institution. The great challenge of INGER and similar institutions is to reduce the levels of homophily. That is, clinical and basic researchers at INGER exhibited complementary trends in their team formation dynamics. Clinical researchers tend to form smaller teams and to collaborate more with each other, while basic researchers form bigger and more interinstitutional teams. By promoting the collaboration between clinical and basic researchers at the INGER, more complex and productive collaborating networks would emerge. This is not an easy task as interdisciplinary collaboration requires the construction of a shared jargon, a common knowledge base and undergoing a complex negotiation process among the different research cultures. In the case of INGER, a first step would be the promotion of workshops with the participation of INGER researchers and authorities aimed at building consensus on the research priorities of the institution, the main problems to solve, and the way the INGER scientist could collaborate with each other. In this regard, it has been proposed that the assistance of learning strategies combined with the use of ontology engineering could facilitate the construction of the interdisciplinary agreements [19, 20]. The generation of multiple scenarios at the request of the workshop participants by taking advantage of the flexibility of our model in NetLogo would be useful to the planning of the research activities in the institution.

5. Conclusions

The combination of data science and agent-based modeling can be a fundamental tool for understanding of the current situation of research institutions and to project possible scenarios. Therefore, this set of methodologies is a key source of information for strategic decision-making.

Data Availability

The INGER researchers data used to support the findings of this study are included within the article. The “assembly interdisciplinary teams” model used to support the findings of this study has been deposited in the Modeling Commons Repository (model number: 5676).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This project was supported by a grant from the Secretaría de Ciencia, Ciudad de México SECITI 042/2018 “Red colaborativa de Investigación Traslacional para el Envejecimiento Saludable de la Ciudad de México (RECITES)”.

Supplementary Materials

Collaboration network model of INGER authors sliced by years (2012–2017) is shown. The authors were equally colored as in Figure 2. (Supplementary Materials)