Abstract
Injection molding is one of the most significant material processing methods for mass production of plastic products. It is widely used in various industry sectors, and its products are ubiquitous in our daily life. The settings and optimization of the injection molding process dictate the geometric precision and mechanical properties of the final products. Therefore, sensing, optimization, and control of the injection molding process have a crucial influence on product quality and have become an active research field with abundant literature. This paper defines the concept of intelligent injection molding as the integral application of these three procedures—sensing, optimization, and control. This paper reviews recent studies on methods for the detection of relevant physical variables, optimization of process parameters, and control strategies of machine variables in the molding process. Finally, conclusions are drawn to discuss future research directions and technologies, as well as algorithms worthy of being explored and developed.
1. Introduction
The Society of the Plastics Industry (SPI) has reported that, in the United States, the plastics industry is the third largest manufacturing industry. In 2017, the US plastics industry accounted for $432.32 billion in annual shipments and directly employed nearly one million people [1]. Hence, the plastics industry has a large contribution to the nation’s economy. Plastics are now among the most widely used materials and their use covers the entire spectrum of industries worldwide [2]. Injection molding is regarded as the most important and efficient process used to manufacture plastic products. It accounts for approximately 80% of the plastic merchandise in the modern plastics industry [2], and as such, it is one of the important pillars of the manufacturing industry. As a highly complicated process, plastic injection molding can be divided into three stages: filling, packing, and cooling. During the entire process, the polymer is subject to large and dynamic changes in pressure and temperature. The whole process is complicated because the process variables are strongly coupled and hard to analyze accurately. For now, manufacturing of qualified products by injection molding still mainly relies on manual operation and trial-and-error methods [3–5]. Obviously, this traditional approach has disadvantages of low production efficiency, poor reliability and repeatability, and dependence on prior experience. Therefore, it is imperative and crucial to develop an advanced injection molding method that is science based and technology oriented.
Recently, much attention has been focused on intelligent (smart) manufacturing, which represents an in-depth integration of next-generation artificial intelligence (AI) technology and advanced manufacturing technology. It runs through every link in the full life cycle of design, production, product, and service [6]. Intelligent injection molding refers to the production process that employs AI technology—such as extracting information from production, computer optimization methods, and control strategies—to develop an online production optimization system. Through the comprehensive use of sensing, optimization, and control methods, the intelligent injection molding production process can increase production efficiency and product quality.
During the production process, when material and mold are preselected, an intelligent injection molding method is required to obtain high quality and stable production. As shown in Figure 1, intelligent injection molding contains three phases—sensing, optimization, and control—all of which are interrelated. Process sensing is first needed to realize real-time detection of variables from the injection-molding process and further on to diagnose and guide the manufacturing process. Moreover, optimal process parameters should be determined by process optimization since it is the key to obtaining high-quality injection molded products with high precision. Finally, sufficient machine control accuracy and repeatability—which means robust control over machine parameters—are required. Furthermore, advanced control strategies will help to conserve energy in manufacturing [7]. Integrating these three phases into an effective online quality control model, which is the section in the dotted-line frame in Figure 1, is one of the major goals of intelligent injection molding. It has attracted extensive attentions from many researchers in the field of injection molding.

In a 2005 review article, injection molding control [8] after process setup was classified into three levels—machine control, process control, and quality control. Due to the lack of quality sensors and the process and quality relationship model, there is a potential opportunity for advancement in the optimization and control of the product quality in injection molding. Moreover, with further development of science and technology in injection molding, and the fact that there has been very little review of process sensing, this paper aims to review and summarize the research on intelligent injection molding in recent years. It will focus on sensing, optimization, and control so that readers will be able to obtain useful information and an overview of the intelligent injection molding process. This paper is outlined as follows. Section 1 provides a brief introduction of intelligent injection molding. Sections 2, 3, and 4 discuss sensing, optimization, and control methods for the injection molding process and present up-to-date developments in these fields. The last section provides a summary of and future directions for intelligent injection molding.
2. Process Sensing
The sensing of the injection molding process is focused on the temperature, pressure, position, speed, etc., which reflect the physical state of the process. Temperature and pressure are the two fundamental physical variables that are the most important in the injection molding process. Furthermore, among various sensing technologies, temperature and pressure sensors are the most well developed and the most widely utilized. By means of sensing the temperature and pressure, one can obtain abundant information about the mold and polymer melt. With continuing technology development, some new methods have emerged in the field of injection molding. Some of them are capable of characterizing more variables besides temperature, pressure, position, and speed. This section will briefly introduce the conventional and emerging methods used in injection molding processes and the up-to-date research progress. Finally, a conclusion and analysis will be put forward and discussed.
2.1. Conventional Methods
Temperature and pressure are the most important variables in the injection molding process. As such, they have received the most attention. Therefore, traditional pressure and temperature sensors are maturely developed and have been the most important tools in measuring the melt state and in characterizing product quality (cf. Figure 2(a)).

(a)

(b)

(c)
2.1.1. Pressure Measurement
Pressure sensors come in mainly two types—resistance and piezoelectricity—which generate resistance changes and voltage changes, respectively, in response to the different pressure levels to characterize the value of the pressure. Commercial pressure sensors have advanced rapidly. In recent years, commercial pressure sensors have been widely used to study the correlation between melt pressure and various parameters of the injection molding process [12–15]. Furthermore, pressure sensors have also been utilized to study the correlation between the cavity pressure curve during the injection/packing stage and the final deformation of the molded part [16]. The results indicated that the pressure difference of the melt at two locations along the radial flow path was related to the part deformation, which could then be used to characterize the warpage of the molded product. Mao et al. [17] proposed a novel automatic feature learning method using cavity pressure data. The results showed that the proposed method could achieve higher classification accuracies and offer more optimal solutions for process monitoring. Based on these studies, online cavity pressure curves can be used to characterize injection process parameters and product quality. Since the value of cavity pressure that the pressure sensor detects usually turns out to be a one-dimensional (1D) value (in the normal direction), Heinle and Drummer [18] therefore proposed a standard three-way force sensor, which was stiffly connected to the measurement element in the cavity to measure the 3D force in a cavity. However, installing direct pressure sensors sometimes introduces product defects in the form of surface sensor marks.
In recent studies, researchers have put forward some alternative methods for cavity pressure measurements in order to maintain the integrity of the part and mold. Gim et al. [10] installed indirect pressure sensors under the lens core to monitor filling imbalance and short shots without causing product defects (cf. Figure 2(b)). Tsai and Lan [19] investigated the correlation between melt pressure and cavity pressure in different runner positions and identified the runner position where the local runner pressure represented the cavity pressure. In addition, product quality can be monitored by installing sensors at different runner positions. Guan and Huang [20] used a surface strain sensor and pressure sensors installed at the cavity and surface of the mold, respectively, and found that the cavity pressure was proportional to the mold surface strain through a data-based regression method. Using this correlation, the precise cavity pressure distribution could be obtained indirectly via mold surface strain data. Furthermore, some quality monitoring methods using other pressure sensors have also been proposed. Zhang et al. [21] proposed a statistical quality monitoring method for injection molding, which used only hydraulic pressure and screw position data obtained from built-in machine sensors. With the present method, the rate of successful fault detection was excellent. Zhou et al. [11] established a quality prediction model based on polymer melt properties to monitor product weigh variation online. The pressure integral, which was obtained by using the machine’s built-in injection pressure sensor, was used as an effective process variable to predict product weight variation (cf. Figure 2(c)).
2.1.2. Temperature Measurement
Thermocouples and infrared temperature sensors are the two most common sensors for online temperature detection in the injection molding process. A thermocouple is a kind of sensor that transforms thermal potential difference into electrical potential difference. According to the selected conductive material, it can be classified into S, B, K, E, T, and J types. Considering the cost and the working environment, J-type and K-type thermocouples are usually used. Thermocouples are widely used in the development and operation of rapid heat cycle molding (RHCM) [22–27], which is a process that rapidly changes the surface temperature of the mold during the injection and cooling stages for the purpose of producing products with special features, such as high-gloss surfaces, without prolonging the cycle time. Temperature sensors can feasibly monitor the temperature of the mold, cavity surface, and heat transfer medium (e.g., the coolant).
However, when detecting the melt temperature, the thermocouple can only measure the contact temperature at the contact point between the material (mold or melt) and the sensor, and this measurement is unlikely to be characteristic of the bulk temperature of the melt. Johnston et al. [28] developed an analysis that used the data from an in-mold thermocouple to predict the bulk melt temperature of a plastic. The heat flux through the mold steel was taken into account to calculate the overall melt temperature and the result was verified by an infrared (IR) temperature sensor. In another study, by measuring the instantaneous melt temperature, Yang et al. [29] investigated the crystallization and solidification kinetics during injection molding, which represented another usage of a thermocouple in injection molding.
Injection molding is known as a process that is accompanied by drastic changes in both temperature and pressure. In order to reveal more online process information during injection molding, the comprehensive use of various sensors has become a commonly used research approach. Gao et al. [30, 31] obtained online melt viscosity and speed through the combined use of a variety of sensors (thermocouples, infrared temperature sensors, pressure sensors, and multitemperature and pressure sensors, cf. Figure 3). The viscosity and speed were calculated from the measured values from those sensors. Mendibil et al. [32] monitored pressure and temperature transducers located in a runner system and microfeatured cavity and further achieved online quality inspection and quality test results from a confocal microscope.

2.2. Emerging Methods
To obtain melt information other than temperature and pressure, some emerging methods have been employed for injection molding process sensing. Such emerging methods include ultrasound, visualization molds, X-ray computed tomography, capacitive sensors, and magnetic levitation detection. These methods will be introduced as follows.
2.2.1. Ultrasound
Ultrasound waves are mechanical waves whose frequencies are greater than 20 kHz [33]. Ultrasonic technology is a promising method for the characterization of polymer processes as it can be applied in real time and online, as well as being nondestructive and environmentally friendly [34–36]. Ultrasound waves reflect at the surface of the media, and the attenuation and speed of the waves change according to the properties of the media. By studying the relationship between the attenuation and speed of ultrasound pulse-echoes and the physical properties of the polymer, this method has been applied in many studies for offline detection of injection molded parts [37, 38]. Similarly, ultrasound can be applied online for quality detection during the production process [39].
Based on the principle of the ultrasonic method, online detection of the material in the barrel and nozzle during injection molding can be realized. For example, the solid bed to melt pool ratio has been quantified using a noninvasive ultrasound system based on reflection measurements [40]. Automated analyses of the reflected pulses were also realized, which means that the plasticization process can be monitored online at different axial positions along the barrel. Praher and Steinbichler [41] measured the clearance between screw and barrel in the plasticization unit of an injection molding machine, where the wear status of the tribomechanical screw-barrel system and unsuitable process conditions could be detected. Moreover, besides the melt content and melt condition in the plasticizing process being detected by ultrasound, a noninvasive ultrasound tomography system was also proposed by Praher et al. [42, 43]. An ultrasound transmitter and five ultrasound receivers were employed, and five individual sound paths through the polymer melt were achieved (cf. Figure 4(a)). Thus, the temperature distribution in the polymer melt could be calculated by analyzing the transmission times of the ultrasound pulses through different sound paths. Ultrasound probe design strategies were also discussed, given that conventional ultrasound transducers cannot endure high temperature environments, such as the injection barrel.

(a)

(b)
Ono et al. made a great contribution in this area. Specifically, a high temperature ultrasonic transducer was invented and applied in microinjection molding [45], coinjection molding [46], and gas-assisted injection molding processes [47] for online polymer characterization. By installing high temperature ultrasound transducers at the nozzle location, Wu et al. [48, 49] revealed the flow characteristics of the polymer melt and the flow conditions at the nozzle. The melt flow characteristics at the nozzle, including the dynamic flow speed and density of the polymer melt, can be monitored online.
The detection of in-mold information by ultrasound reveals abundant information about the process and the properties of the melt during the injection molding process. The microcellular injection molding (MIM) process was also characterized by an in situ ultrasonic method [50] (cf. Figure 5). In this study, cell size, surface roughness, and skin layer thickness of the products were characterized by ultrasonic echo signal. Moreover, the detection of longitudinal waves characterized the injection process data and polymer morphological changes in real time, including the melt front arrival time, injection time, filling and packing stages, polymer solidification process, and morphological changes in the polymer crystallization [44, 51]. Shear waves in real time diagnosed Young’s modulus and shear storage modulus, as well as the anisotropic properties of the polymer in the injection molding process [44] (cf. Figure 4(b)). Although the methods and theory used here are quite similar to those used to detect plasticization, this part of the measurement is an online, nonisothermal process that is quite difficult to implement. The data is difficult to analyze as well.

(a)

(b)

(c)
Our research group proposed an ultrasonic methodology to measure cavity pressure during the injection molding process via the Gaussian process [9, 52]. Recently, we employed an ultrasonic method to measure the stress on the tie bars of an injection molding machine, therefore monitoring the service situation of the machine [53, 54]. We also established a physical model to estimate the correlation between the cavity pressure and stress on the tie bars and as such achieved an indirect measurement of the cavity pressure via the stress on the tie bars [55].
2.2.2. Visualization Molds
Because the molten polymer behavior during the injection molding process is carried out in a closed cavity, the molding process cannot be observed by the naked eye. Therefore, researchers often develop a visualization mold combined with a high-speed camera to observe the molding process [56–60]. Guerrier et al. [56, 57] developed a visualization mold by replacing part of the mold steel with glass to observe and verify the accuracy of a 3D numerical simulation of the injection molding process carried out by computer molding and obtained ideal results (cf. Figure 6). Jong et al. [58] added aluminum glitter powder to the melt to observe and track the flow trajectory of the particles in the melt to observe the fountain flow effect at the melt front. Jiang et al. also developed a visualized experimental device to observe the flow of the polymer melt of coinjection molding and ultrasonic-assisted injection molding process [59, 60].

2.2.3. X-Ray Computed Tomography
For injection molding parts, the researchers paid most attention to the mechanical properties of the products. However, the surface replicability and internal structure of the parts could be more concerned in a considerable part of the situation. These properties of the products have yet been difficult to be characterized and there is rarely related research. The application of computed tomography (CT) in industry is rapidly increasing. CT scanning is capable of evaluating the voxel-based and surface-based characteristic [61]. As being voxel based, it can serve as a visualization method and give the images of the inner structure of the work pieces. It is widely applied in the defect detection and fiber orientation distribution observation [62–64] of the injection molded parts (cf. Figure 7(a)). As being surface based, such as digitization and dimensional metrology [65], it can measure the surface profile and deviation of injection molded parts (cf. Figure 7(b)). Certainly, it requires accuracy and traceability. In addition, CT scanning still has its limitations; the faster cycle time and better economy need to be achieved for wider manufacturing applications.

(a)

(b)
2.2.4. Capacitive Sensors
Capacitance is a useful physical phenomenon for sensor design. It has been successfully applied for measuring the distance, area, volume, force, humidity, etc., in industry [66]. If two metal plates can be separately installed on the two mold halves and isolated from each other, they can form the two electrodes of a capacitor. Such a capacitive sensor for the measurements of the melt front position and melt front velocity has been proposed for injection molding process monitoring [66–69]. Due to the filling of the polymer melt, the capacitance between the two electrodes changes, resulting in a change of voltage on the capacitor. The principle and installation scheme of a capacitive transducer are shown in Figure 8(a). The ratio of polymer to air causes variations in the dielectric properties of the capacitor, further resulting in a change of voltage on the electrodes.

(a)

(b)

(c)
Capacitive sensors can detect not only the polymer melt front position and melt flow rate [69], but also the start and end of mold filling, the time of gate freezing, and overpacking [66]. The sensor outputs can be further correlated to online part weight prediction and also the measurement of the part solidification rate [68]. Combined with data-based regression analysis, this type of sensor could be used for online monitoring, part weight prediction, and check-ring failure detection for injection molding [67].
Based on the same principles as dielectrics, a rheodielectric sensor, also called a planar capacitor sensor, has been developed with interdigitated electrodes deposited on the surface of the substrate adjacent to the dielectric material [70]. Interdigitated electrodes are electrode strips arranged like the interlocking fingers of two hands folded together (cf. Figures 8(b) and 8(c)). This kind of sensor can be mounted on the cavity surface, where the voltage variation on the electrodes reflects the dielectric constant of the material, which is related to the shear stress and orientation of the material [71–73]. Therefore, the planar capacitor sensor is able to measure the shear stress and orientation of the material in the injection molding process.
2.2.5. Magnetic Levitation
Quality detection of final parts is an important component of injection molding production. In the field of materials science, there are several characterization methods including wide angle X-ray diffraction (WAXD) [74–79], nanocalorimetry [80], and X-ray tomography [62, 63]. These methods are generally and widely utilized for measuring physical properties of final products. They have excellent characterization accuracy, but the cost of equipment is usually very high.
A magnetic levitation detection method has been proposed by Mirica and colleagues in 2009 for measuring density [81], which also is called the MagLev method. The device consists of two identical permanent magnets set 45 mm apart, with identical poles facing each other. The sample is immersed in paramagnetic media and is then levitated at an equilibrium position along the centerline of the device (cf. Figure 9(a)). According to the levitation height, the density of the sample can be calculated. The accuracy of the device is estimated as 0.0002 g/cm3. Many researchers focus on developing MagLev technology and have done many works and achievements on this particular field [81, 82, 85–87]. By flattening the device (cf. Figure 9(b)), the ability to measure samples with large densities was achieved [83, 88, 89]. Moreover, our research group employed ring magnets to MagLev [90, 91] ensuring better visibility, accessibility, and operability of the samples.

(a)

(b)

(c)

(d)
The research by Subramanian and colleagues found that the levitation posture of a sample was correlated with the sample’s shape [92]. Hence, Hennek’s work explored the possibility of using the MagLev method for quality control of plastic parts [93]. Recently, our research group further analyzed the correlation between the position of the defects and the levitation posture of the sample [94, 95]. This method is further applied in the evaluation of washers [82] and injection molded lens [96] and subtle differences could be detected (cf. Figure 9(c)). Besides, based on its high accuracy, the MagLev method was also studied in separating and recycling different polymers via small differences in density [84, 97, 98] (cf. Figure 9(d)).
In summary, ultrasound is an outstanding and integrated method for online process sensing of injection molding as it is a nondestructive method that provides rich information. CT scanning can provide abundant product quality features. MagLev is a good emerging method for product defect detection. It has excellent measurement accuracy at a low cost, making it worthy of pursuing.
3. Optimization of Process Parameters
The injection molding process involves dozens of process parameters such as injection speed and pressure, packing duration and pressure, mold temperature, and cooling time. The effects of these parameters are coupled to each other, which makes it very difficult to set the process parameters and often relies heavily on the long-term experience of skilled operators.
Many studies have been conducted for the optimization of injection molding processes in order to reduce the time to market and to obtain consistent quality molded parts, and some methods have been put forward. Generally, these optimization methods can be classified into three categories. The first category involves identifying optimal process settings by noniterative analysis based on experimental or simulated results. It also involves analysis based on past experience. The second category establishes relationships between process parameters and product quality through experiments based on the approximate mathematical function obtained by a surrogate model. The third category uses a global search process in a given parameter area, as established by a surrogate model, using intelligent optimization algorithms that follow some rules in nature. These three categories are usually integrated when solving a practical problem and will be introduced in this section.
3.1. Noniterative Optimization Methods
Noniterative methods are generally based on experimental results and practical experiences. They can provide initial process settings or give a direction or route toward optimization. Noniterative optimization methods are easy to implement and require less computation. They have been widely used in research, including the following methods: case-based reasoning (CBR) [3], expert system, fuzzy system [99], and the Taguchi method [100–116].
When optimizing process parameters, implementing a set of real injection molding experiments [111] is a reliable way to gain product quality information under certain parameter settings. However, due to time, expense, and other reasons, a method using simulation analysis [101–104, 107–110, 117–120] is more likely to be used in research to characterize the impact of various levels of various parameters on the product quality. Some common software packages for simulation include Moldflow, Moldex3D, HsCAE, and ANSYS.
3.1.1. Taguchi Method
The Taguchi method is an effective approach for optimizing the throughput in various manufacturing-related processes. It has been extensively used in the engineering design and analysis of optimal manufacturing parameters [121, 122]. It provides an optimization framework according to experiments and data analysis. Its concrete steps can be divided into (1) determination of the experimental parameters and their levels, (2) design of experiments, (3) implementation of experiments, and (4) data analysis. By analyzing the quality of injection molded products with different parameters, the influence of the process parameters on product quality can be accessed by means of analysis of variance (ANOVA), range analysis, and analysis of signal-to-noise ratio (S/N). Finally, relatively optimal parameter settings can be determined.
Applying the Taguchi method, Kuo and Liao [100] and Lin and Hsieh [101] investigated the influence of parameters on the accuracy of lens geometry. Many other researchers have also applied the Taguchi method to optimize residual stress, volume shrinkage, and warpage properties [102–104]. Xie et al. [106] took four parameters into account and investigated the effect of key injection molding process parameters on the density and strength of a specimen. Wang et al. [108] extended the range of variables investigated to the number of gates, gate size, mold temperature, polymer temperature, fill volume conversion, V/P switch over, and cooling time. Analysis of the signal-to-noise ratio (S/N) and analysis of variance (ANOVA) were conducted and the best process parameters were found. Using this method, the compression strength increased 12% compared to the average of the experiment. Sánchez-Sánchez et al. [114] employed the Taguchi method to achieve the optimum level of ultrasonic-assisted injection molding parameters and to maximize the tensile strength of the ultrahigh molecular weight polyethylene (UHMWPE)/graphite composites; the tensile strength was increased by 8.8%.
The foundation of designing an experiment with the Taguchi methodology is the orthogonal array. By using an orthogonal array, a better design on sets of samples can be achieved and results can be obtained by running the simulation code or performing corresponding experiments.
An orthogonal array is a typical design of experiments (DOE) method. Besides the Taguchi method, it is widely applied in other optimization solutions for designing sets of samples. The practical process usually contains multiple influencing factors. The full-factor design includes all possible combinations of all factors and levels and as such it requires a large number of trials that take a lot of time and is too costly to implement. DOE, however, can significantly reduce the number of experiments needed while still revealing the main effects. DOE are almost exclusively used in iterative and noniterative methods. The orthogonal experimental design (OED) used in the Taguchi method is the most extensively used DOE method. In addition, there are many other DOE methods such as CCD, BBD, and LHD. Table 1 exhibits the most common DOE methods used in the literature.
3.1.2. Intelligent Methods
Case-based reasoning (CBR) is a procedure that uses an old solution to solve new problems and to evaluate new solutions for further application. CBR is a good method to determine the initial process parameters. The difficulty lies mainly in the description and definition of new problems and finding old solutions to match. The optimization results are strongly dependent on the case library.
Expert systems (ESs) are a subbranch of artificial intelligence that solve problems and give advice using the knowledge of human experts within a specific domain. ESs employ expert-level knowledge to achieve expert performance for the sake of solving specific problems. Their knowledge is both theoretical and practical; that is, the human experts who provide the system’s knowledge have generally augmented their own theoretical understanding on the problem domain.
Fuzzy systems are an approximate inference methodology based on the concept of fuzzy set theory. The fuzzy rules and their membership functions compose the system’s knowledge base [3, 4]. The fuzzification procedure converts the input variables into fuzzy values, while the defuzzification procedure transforms fuzzy results into crisp output variables. The fuzzy inference engine uses fuzzy rules during each evaluation cycle.
Expert systems and fuzzy reasoning are methods of modifying process parameters, combined with process parameters and product defects, based on some refined knowledge of the human experience. These methods can guide optimization [132] and optimize some product defects that cannot be quantitatively described, such as flash, flow lines, and pores.
Zhou et al. [4] adopted the idea of case-based reasoning (CBR) to determine the initial process parameters and discussed four case-adaptation strategies. The ideology of fuzzy inference based on expert knowledge and practical experience was utilized for defect correction and process parameter optimization. The fuzzy rules and membership function were discussed. Finally, an integrated intelligent optimization system based on CBR and fuzzy reasoning was constructed. A CBR adaptation strategy was employed to simplify the model into a combination of several basic geometries, and a simplified simulation model was applied to the CBR method to obtain initial parameter settings [5]. Furthermore, based on the previous work, Zhao et al. [3] introduced the empirical model (EM) into the optimization system, combined with the CBR method, to give the initial process parameters, and fuzzy logic algorithm was then combined to optimize the defects, and this system was effectively applied in practical production.
3.2. Surrogate Models
Surrogate models are not an independent method for optimization. They were originally used as a tool for fitting data and can be applied to the optimization process. The optimization results obtained by a noniterative method are often only a local optimal combination of parameters. In order to obtain a global optimal result, some surrogate models are needed to correlate process parameters and objectives because it is expensive and impractical if the techniques involve a time-consuming simulation for each function evaluation. Commonly used surrogate models include the response surface method (RSM), artificial neural network (ANN), support vector regression (SVR), Kriging model, and the Gaussian process. A survey of surrogate models for injection molding optimization in recent years is shown in Table 2. Some surrogate models, such as RSM and ANN, can be utilized in the noniterative method as an experimental data processing method, where a reasonable interval of parameters can be given as a process window. Generally, surrogate models serve as an approximation function for each iteration of the intelligent algorithm, which will be introduced in the next section.
The polynomial models generated by RSM are often referred to as response surface (RS) models in the literature. RS is a model building technique based on the statistical design of experiment (DOEe) and least-square-error fitting methods. When constructing an RSM model, regression analysis is performed. If the user has prior knowledge about the objective problem, the proper model can be chosen. Otherwise, several models need to be constructed from first to third or fourth order, with the best one being selected among the models according to the predicted errors. Heidari et al. [125] constructed an RSM to optimize the shrinkage and warpage of three polylactic acid (PLA)based bone screws. Using this method, they successfully obtained the minimum shrinkage and warpage of the product.
ANNs are widely accepted as a type of technology that offers an alternative way of simulating complex and ill-defined problems. They have been used in diverse applications and are particularly useful in system modeling similar to RSM. An ANN is made up of one or more hidden layers placed between the input and output layers. It has a computational structure that consists of a number of highly interconnected processing units called neurons. The neurons sum up weighted inputs and then apply a linear or nonlinear function to the resulting sum in order to determine the output. The ANN learns to approximate the functions through a training process and adjusts their weights and biases until a performance index reaches a preset threshold value. Kitayama et al. [128, 129] developed an injection mold with a conformal cooling channel, combined with computer simulation, using a radial basis function network for multiobjective optimization of the injection process. The cycle time improved by 26%, and the warpage improved by 14% using an optimal combination of process parameters. Jou et al. [112] and Oliaei et al. [113] optimized the injection parameters using response surface methodology and neural network methods, respectively, and a lower product shrinkage was achieved. Also, the Taguchi method was applied and compared with those surrogate models, and the results were in good agreement.
There are many other surrogate models that are utilized for constructing approximation functions based on experiments or simulation such as support vector machines (SVM) [143] and the Kriging model. They have similar roles in optimization and will not be specifically introduced here. These surrogate models serve an objective function for the characterization of the iteration results when conducting iterate optimization strategies, thus making the evaluation of iteration results less time-consuming [123, 130, 131, 133, 135, 136, 141, 144, 145].
3.3. Iterative Optimization Methods
As mentioned above, a simple optimization algorithm based on experiments and data analysis is unlikely to give a global optimal solution. In order to obtain better optimization results, iterative optimization algorithms are often utilized to obtain a convergent global optimal combination of parameters. Iterative optimization approaches can be classified into deterministic and stochastic approaches according to the methods of improving the optimal point within each iteration [132]. Stochastic approaches have been mostly employed for injection molding process parameter optimization and are generally referred to as intelligent optimization algorithms. These algorithms include the genetic algorithm (GA) [123, 133, 134, 146–155], particle swarm optimization (PSO) [99, 131, 136, 147, 151–153, 156, 157], simulated annealing, and hill climbing. These algorithms start with random variables and iterate for the next variables according to their own iterative principles until the optimal points are found. Iterative directions are determined similar to the evolution of natural phenomena. Surrogate models are applied to characterize the quality of the products under the parameters settings after every iteration. Global optimal parameter settings are expected to be found after several runs or iterations, and thus, they become a feasible approach for the optimization of injection molding parameters. However, the probability of finding the global optimal solution decreases as the problem size increases.
Genetic algorithms are search algorithms designed to mimic the principles of biological evolution in a natural genetic system. GAs can be used to solve difficult problems in terms of objective functions that possess “bad” properties, such as those that are multimodal, discontinuous, and nondifferentiable. These algorithms maintain and manipulate a population of solutions and implement their search for better solutions based on a “survival of the fittest” strategy. This provides an implicit as well as explicit parallelism that allows for the exploitation of several promising areas for a given solution space at the same time. Applying the genetic algorithm combined with an RSM model, Park and Nguyen [133] solved the multiobjective optimization problem of energy consumption and product quality. Li et al. [123] minimized warpage for fiber-reinforced composite injection molding by combining a GA with a BP neural network. Zhao et al. [146] employed the improved effective global optimization (IEGO) algorithm to approximate the nonlinear correlation between process parameters and quality indicators of a final product, and then the nondominated sorting genetic algorithm II (NSGA-II) was developed to find the combination of process parameters for the smallest warpage and volume shrinkage of plastic products.
The particle swarm optimization methods were first inspired by the biological and social behavior of bird flocks and fish schools [158]. This behavior was collectively referred to as swarming. When a swarm looks for food, its individuals will spread out in the environment and move around independently. Each individual has a degree of freedom or randomness in its movements, which enables it to find food. Sooner or later, one of them will find something digestible and, as being social, will announce this to its neighbors [159]. In PSO, the potential solutions, called particles, are analogous to birds or fish. These particles fly according to their own flying experiences and their companions’ flying experiences.
Applying PSO, Kramar and Cica [99] maximized the tensile strength of molded products. Zhang et al. [131] carried out a multiobjective optimization of injection molding process parameters of diesel engine oil cooler covers based on the ellipsoid basis function neural network. Xu et al. [136] developed a back-propagation neural network model to reflect the complex nonlinear correlation between process parameters and product mechanical properties. They found that the particle swarm optimization algorithm significantly improved the mechanical properties—and hence the performance—of the product.
There are many other evolutionary algorithms, such as simulated annealing algorithms and ant colony algorithms, which share many similarities with other GA and PSO models. They both initialize a population in a similar manner and search for an optimal solution by updating generations. They have been successfully applied in many research and industry applications.
Ideal injection molding process optimization results always come from a combination of using a variety of methods. This is summarized in Figure 10. Generally, it starts with a noniterative method like CBR to give an initial process setting and then the design of experiments (DOEe) technique is used to provide sets of samples. After performing the experiments and analyzing the influence of different parameters, a local optimal combination of parameters can be given like a process window. Furthermore, surrogate models can be employed to correlate process parameters and objectives. Hence, one can optimize the process parameters and obtain an optimized product quality with the application of intelligent iterative algorithms such as genetic algorithms and particle swarm optimization models.

For example, the GA approach was combined with BPNN [149], RSM [134, 150], and the Kriging model [148] as surrogate models. The optimal combination of process parameters was obtained and became stable. Also, the PSO algorithm was applied with BPNN to optimize the flash and volume shrinkage of injection molded parts [157]. Moreover, a GA/PSO hybrid approach was developed to serve as an iterative method [147, 152, 153] and with the assistance of BPNN or RSM the warpage was reduced by 38.6%.
The model-free optimization (MFO) algorithm is also an effective method. Johnston et al. [160] and Yang et al. [161] developed an online, model-free process optimization method and obtained ideal convergent results. A systematic approach that combined digital image processing (DIP) and model-free optimization (MFO) was proposed to solve optimization problems [161]. MFO uses measurements like DIP as feedback to determine the optimal settings, which is a typical online intelligent optimization approach.
4. Process Control
The control target of the machine includes the injection speed, shot size, screw rotation speed, cylinder pressure, injection pressure, packing pressure, packing pressure, barrel temperature, and coolant temperature. These quantifiable and easily measured variables are the most common control variables in modern industrial injection molding. The precise closed-loop control of these variables ensures high repeatability of the injection molded process and the resultant product.
The injection molding process is a typical cyclic batch process. Obviously, the control algorithm for the batch process is different from the continuous process. Due to the nonlinear and time-varying complex dynamic characteristics of the batch process, it is much more difficult than a continuous process. Therefore, the development of injection molding process control has gradually transferred from the traditional continuous-control strategy to control strategies that focus on batch-direction characteristics. The research group of Gao F has made great advances in this field [144, 162–175]. This section will introduce the application of two typical strategies in injection molding—traditional feedback control and iterative learning control—and summarize and analyze them. Finally, the latest relevant studies will be introduced.
4.1. Feedback Control
4.1.1. Proportional-Integral-Derivative (PID) Controller
The PID controller is the most standard feedback control algorithm and measures the controlled variable, calculates the error between the output and set point, and generates the controller output based on the proportional, the integral, and the derivative of the errors. In brief, a PID controller makes use of the error to adjust the output to obtain the set variables. Researchers have achieved control of mold temperature [176], servohydraulic systems [177], and the stability of product quality [143] using PID controllers.
However, traditional PID control does not work well for the injection molding process due to its batch operation nature and the nonlinear and time-varying characteristics of the process. Adaptive control is a good alternative for this kind of control. Its parameters are adapted in a certain way to conform to the nonlinear or time-varying process dynamics and provide good control performance. Yang and Gao proposed a self-tuning adaptive controller [162] for injection velocity control, and robustness and tracking performance were improved.
4.1.2. Model Predictive Control
Model predictive control (MPC) can obtain the control signal by optimizing future behavior through minimizing a cost function and making explicit use of a prediction model. It predicts the process output of future time over a certain period, called the prediction horizon. MPC calculates a set of future input for certain steps by minimizing a cost function, called the control horizon, so that the predicted output reaches the set point in an optimal manner. It is suitable for batch processes since the reference trajectories of the process settings are known before the cycle starts. This controller has been successfully applied in the injection molding process to control the screw speed and hydraulic back pressure [178, 179], coolant flow, coolant temperature [180], and cavity pressure [181, 182]. The group of Zhang et al. [166, 167, 170, 173, 183] designed fault-tolerant control through a predictive functional control framework for a batch process with unknown disturbances and partial actuator failures to achieve the desired closed-loop response.
Based on the fuzzy logic and neural network systems, establishing a precise model in process control can be avoided. If constructed from human knowledge in the form of fuzzy IF-THEN rules, it can easily perform multi-input, single-output, nonlinear mapping from a multidimensional vector to a scalar. This method has been used to control the injection speed and repeatability [184, 185]. Kanagalakshmi et al. [186] developed a multimodel control strategy that comprehensively included a variety of approaches, such as PID, fuzzy, and adaptive neurofuzzy inference system (ANFIS) control. The weighted average of these three strategies was utilized to control the barrel plasticizing temperature.
4.2. Learning Control
Injection molding is a batch process with a repetitive property. It is different from continuous process control where past control information can be used to improve current and future cycles. This kind of cycle-to-cycle improvement is similar to the human learning process; therefore, it is called learning control.
4.2.1. Conventional Iterative Learning Control
The batch process is usually described by two dimensions—the time dimension and the cycle dimension (batch dimension). The repetitive nature of the batch dimension can be used to guide the control output of the system. Generally, there are two ways to improve the learning control performance along the cycle direction: controller parameter learning and iterative learning control. Controller parameter learning adjusts and updates the controller parameters from cycle to cycle. Control signal learning control, also called iterative learning control (ILC), adjusts the controller output directly. The ILC method has been widely used in injection molding to achieve stable tracking of the time and batch sequence of the set point trajectory so as to improve process accuracy and repeatability [164, 169, 187, 188].
4.2.2. Two-Dimensional (2D) Control
Conventional ILC is only cycle-wise feedback control, with cycle-wise integral action that ensures performance improvement along the cycle direction. Two-dimensional feedback control is the strategy to ensure performance improvement along both time and cycle directions. The integral action along these two directions forms a 2D integrator cascade for eliminating the control error within and along the time and cycle directions.
The research on this control approach was also mainly studied by Gao’s group. They proposed a robust 2D feedback control combined with an iterative learning control (FILC) method for batch processes [165, 171, 172]. They further proposed iterative learning fault-tolerant control (ILFTC) for uncertain disturbances and fault conditions [168]. Then, a novel 2D predictive functional iterative learning control (ILC) scheme [175, 189] was developed, thus introducing a new model formulation and error compensation approach. They also proposed some hybrid methods that combined conventional MPC with ILC to yield two-dimensional time model predictive control [174], combined with ILC and predictive function control (PFC) [163].
Due to the nonlinear and time-varying characteristics of the injection molding process, conventional feedback control strategies are not very effective for batch process control. This is because the information in the batch direction is an important guidance for control strategies. The iterative learning control method is the earliest effective strategy. After that, by introducing the feedback control of the time dimension, one can achieve faster convergence and better tracking performance. Using control strategies involves a lot of expertise in control discipline. Readers who are interested can consult more detailed references [2, 190] in this field.
4.2.3. Reinforcement Learning Control
Reinforcement learning (RL) is the learning from the environment to behavior mapping, which aims at allowing the agent to learn how to behave in an environment through the interaction with the environment. Intelligent system through interaction with the environment achieves the maximum reward signal. Recently, with the combination of artificial neural networks (ANNs), RL has attracted more attention in solving practical control problems [191]. Reinforcement learning control (RLC) can effectively handle nonlinear and highly uncertain systems. It is a powerful algorithm for solving optimal control problems.
Ruan et al. [192] first proposed applying RLC to the field of injection molding control. They proposed a hybrid method that combined RLC with ILC to solve the motion control problem in injection molding. RLC is ideal for optimal control, but it is not suitable for precise tracking control. Therefore, by combining with other tracking control algorithms, such as iterative learning control, it can effectively solve the motion control problems in injection molding.
5. Conclusions and Prospects
This review introduces methods and strategies on the sensing, optimization, and control of intelligent injection molding and summarizes recent studies in these three areas. As for process sensing, conventional methods like pressure and temperature sensors are still indispensable, while emerging methods like ultrasound can obtain other important physical variables to enrich the information extracted from the process. In terms of parameter optimization, intelligent algorithms combined with surrogate models have gradually become the widely used optimization method and have delivered increasingly excellent results. In machine control, feedback control and learning control are currently the most effective control algorithms for typical batch processes. At present, these three areas have achieved considerable development. Furthermore, it can be concluded that the main research trend in the future lies in the integration of these three areas and in achieving reliable and real-time control of product quality in a more efficient and intelligent manner. However, there are still some problems that need to be further studied in order to achieve intelligent injection molding, including the following aspects.
5.1. Perception and Reconstruction of Multiple Physical Fields
There are many methods for detecting or diagnosing the physical parameters of the molding process. However, it can be found in the literature that the methods can only detect physical variables at some finite points. Some papers have suggested a data-fitting method to obtain the distribution of certain physical fields in the cavity. There is also an ultrasound “tomography scan” to construct the temperature field. But such methods are indirectly calculated and there are considerable errors. How to develop advanced sensing technology in the future and realize the perception of the physical field will continue to attract much interest, such as using a pressure detection film to detect the pressure distribution of the entire domain. In addition, it is still quite difficult to detect some microscopic physical variables online. A number of investigations applying ultrasound and dielectric methods to detect microphysical variables have emerged, but they are still in the early stages. Moreover, it is worth looking forward to the advancement in physics and modeling; it could directly enhance the observability of many physical variables in what are otherwise confounded states (e.g., barrel temperature and melt pressure). In this field, there are many scientific and engineering problems worth exploring.
5.2. Autonomous Learning Driven by Big Data
With the development of information technology, more and more data can be collected in the current industrial production process, including the process parameters and product quality of each batch. If autonomous learning based on big data analysis can be realized, higher efficiency in the optimization of process parameters can be achieved. It is without a doubt though that this process contains many difficulties, such as data standards and protocols, the accurate acquisition of data samples, and the integration of various complicated methods. It is believed that future technological breakthroughs in this area will bring fruition to industrial integration and production.
5.3. Definition of Product Quality Characteristics
It is well known that many types of defects in injection molded parts cannot be accurately described under certain circumstances. Some process optimization methods already have the capability of giving a better combination of process parameters, but these methods are often offline. To realize the combination of online detection and process optimization, a definition of quality characteristics of injection molded parts is very important. The approach may be to estimate their physical valuables per area. At present, the online detection mode for product weight, density, warpage, etc., is insufficient, and a direct quality feedback control system has yet to be developed. Hopmann and Heinisch [185] have made a great attempt in this direction even though they characterized the viscosity of the melt rather than the part quality. In addition, there are some product defects such as flow marks and bubbles that cannot be characterized quantitatively, thus rendering quantitative optimization methods inadequate. Therefore, how to properly characterize defects and further promote them will be a new trend in future optimization studies.
5.4. Model Identification Based on the Learning Method
For a batch process like injection molding, the development of feedback control and learning control reduces the dependence of the algorithm on the model. However, for model-based control algorithms, an accurate model has a crucial influence on the control result. And a good model can still accelerate the convergence of the algorithm and improve the tracking performance. During the injection molding process, the batch-direction data can be considered as a basis for system modeling and the characteristics of the batch can be further exploited. In addition, owing to changes in molds and materials, previous algorithms cannot necessarily be continued, which also puts forward new requirements for the modeling of the process.
Conflicts of Interest
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Acknowledgments
The authors would like to acknowledge the financial support of the National Natural Science Foundation Council of China (nos. 51875519, 51635006, and 51821093), the Zhejiang Provincial Natural Science Foundation of China (no. LZ18E050002), and the Key Research and Development Plan of Zhejiang Province (no. 2020C01113).