Abstract

Determining the homogeneity of material mixing in real time during product processing is critical for quality control. According to the Kubelka–Munk (K-M) function of diffuse reflectance absorption spectrum, absorbance (A) is approximately linear with the content of the components when the sample scattering coefficient (S) is in a certain range. The S is determined by the particle size of powder samples. Therefore, this study determined particle size ranges that satisfy linear additivity in near-infrared diffuse reflectance spectroscopy (NIRDRS). Thus, the proposed NIRDRS analysis technique can be used to determine the homogeneity of material mixes or analyze the percentages of the components in the mixture. In this study, vitamin B3 and vitamin C were used for preparing mixed samples with varying percentages. The experimental results revealed that linear additivity is satisfied when the powder particle size is in the range of less than 280, 280–450, and 450–900 μm. When the confidence level is 0.01, the actual mixed spectra are not significantly different from the “simulated mixed spectra” constructed by linear addition, with their relative deviations less than 1.08%. The absolute errors of the actual and analytic percentages were within 2.98% for each component in the mixtures. The above conclusions also hold for sorghum, which has a complex material composition. Statistical models cannot analyze the percentages of components in the mixture. In contrast, linear addition and direct calibration approach avoids the use of a large number of samples for statistical modeling and analyze the percentages of mixed samples. Meanwhile, it can be used to discriminate and analyze the material mixing uniformity by building a mechanistic model.

1. Introduction

Near-infrared (NIR) diffuse reflectance spectroscopy can provide rapid analysis and the simultaneous determination of multiple components, and it does not require chemical reagents. Considerable progress has been made in using this technique in the quality and safety evaluation of medicine, as well as food and feed [15]. Medicine and agricultural products are high-value commodities that considerably affect people’s lives. The formulation design and mixing uniformity of the products are critical indicators for evaluating their quality. If the mixture is not uniform, the percentage and effectiveness of the product is severely affected. Vitamin B3 and vitamin C are the essential nutrients for maintaining the normal functions and metabolic activities of the body [69]. The homogeneity of the drug powder mixture considerably influences the consistency and stability of medicine quality. Testing is required to determine whether the actual content is consistent with the labeled content. Sorghum is used in brewing, food processing, and breeding [1012]. However, japonica and glutinous sorghum are used for different purposes. Raw material proportioning and mixing uniformity in processing and production considerably affect product utilization and nutritional value.

Image analysis [1315], UV-visible absorption spectrophotometry [1621], and high-performance liquid chromatography (HPLC) [2226] are typically used for determining mixing uniformity. The image analysis method requires particles of various colors and distinguishing between components of similar colors. UV and HPLC methods require a laboratory environment, which increases the difficulty and cost of detection. Determining the mixing uniformity method requires the establishment of predictive models in conventional quantitative NIR spectroscopy techniques. The accuracy of the technique depends on the established quantitative model. Furthermore, model maintenance and update become complex at a subsequent stage [2729]. Therefore, the development of a rapid and accurate quality monitoring and detecting method is critical for ensuring the uniform mixing of agricultural products and medicine.

Yan et al. [30] verified the additivity of NIR diffuse reflectance spectra and stated that for transmission spectroscopy, the absorbance (A) and the component content are proportionally linear according to Beer’s law. Therefore, transmission spectrum exhibits the additivity of spectral absorbance because diffuse reflectance spectroscopy can be converted based on the K-M function. A and K/S can be approximated by a straight line with intercept not equal to zero when the sample concentration range is small and S is within a certain range. Thus, K is proportional to the component content, such that A is approximately linear with the component content. Therefore, the diffuse reflectance spectrum of the solid exhibits a linear additive property.

This study combined the linear additivity of NIR diffuse reflectance with the formulation regression algorithm. The particle size range satisfying the linear additivity was verified experimentally. Furthermore, a method to determine the homogeneity of a mixture was proposed. Comparing the absolute error between the analytic and actual percentages or the difference between the simulated mixture homogeneity spectrum and the actual mixture spectrum can reveal the homogeneity status of the mixture. This technique is a low-cost determination method that does not require statistical modeling of multiple samples.

2. Materials and Methods

2.1. Material
(i)Starch-containing vitamin (vitamin B3 and vitamin C) tablets were procured from Tianjin Lisheng Pharmaceutical Co., Ltd. In this experiment, five particle size ranges of >900 μm, 450–900 μm, 280–450 μm, 180–280 μm, and <180 μm were obtained by grinding and sieving. All the mixed samples were prepared according to the mass percentage of 50%–50%. The particle sizes of the samples in the mixture are presented in Table 1:(ii)In this experiment, glutinous sorghum obtained from Shandong, China, Yungu sorghum and japonica sorghum obtained from Liaoning, China, and Farmer sorghum were used as raw materials. Raw powder material was obtained by grinding and sieving from 280 to 450 μm. Five groups of mixture test samples were prepared as presented in Table 2.
2.2. Instruments and Measurement Methods

The preparation process of the mixed samples is as follows: the two single samples were laid flat and then stirred for 2 mins to ensure a uniform mixture. The sample is placed in the experimental sample cup and naturally compacted with weight. An S450 grating-type NIR spectrometer (Shanghai Lengguang Technology Ltd., China) was used to obtain the spectra of all samples. The spectra were collected between the wavelengths of 1000–2500 nm, with a resolution of 8 nm at the diffuse reflection mode. Data collected were in absorbance and were obtained by scanning for 90 s. The single and mixed samples were resampled three times each, three NIR spectral data were measured, and the average spectrum was considered to be the final spectrum.

TMS is calculated by the linear addition of single sample spectra according to the mixing percentage. In the legend, the simulated mixture spectra are suffixed with the suffix “simulated spectrum” after the name of the sample group. The actual mixture spectrum is added with the suffix “actual spectrum” after the name of the sample group.

2.3. Experimental Method
2.3.1. Simulated Mixture Spectrum

In this study, the simulated mixture spectrum, is obtained by the linear addition of the spectra of each single sample in the mixture according to their respective percentages. The formula is as follows:where i denotes the ith single material, denote the spectral data measured corresponding to the single material, is the mass percentage corresponding to the single material in the mixture. For example, VB + VC(1)−simulated spectrum = VB spectral data×50% + VC spectral data × 50%.

2.3.2. Class of Constrained Regression (Formulation Regression)

In this study, the formulation regression algorithm is used to analyze the percentages of the components of a mixture. This algorithm, which was proposed by the famous Chinese mathematician Kai-Tai Fang, is a regression algorithm with non-negative regression coefficients of sum (1). This method has been applied in the tobacco industry [31, 32], and its algorithmic model is described as follows [33, 34]:

In the aforementioned PR model, Y is a n × 1 matrix, X is a n × matrix, β is a × 1 matrix, ε is a n × 1 matrix, and is a matrix with all 1 s in the data. Under the constraints of (1b) (1c), the least squares can be used to estimate β as follows:

If β ≥ 0 is obtained, the solution with the smallest β matrix is the unique solution of the PR model. Otherwise, the elimination change method is used to ensure the components of β remain non-negative.

In this study, independent variable Y constitutes the mixed sample spectral data vector, and the dependent variable X constitutes a single samples spectral data matrix. The percent vector β, which is solved according to the non-negative linear regression algorithm, is composed of the percentage coefficients, which can be used for quantitative analysis. For the binary quantitative analysis, two percentage coefficients exist. The sum of the two coefficients is 1, and the actual percentage coefficient is set as the criterion for accuracy.

2.3.3. Relative Deviation of the Spectrum

In this study, the difference between the actual mixed spectra and the simulated mixed spectra is detailed by comparing the relative deviation between the two spectra and whether the F and T tests pass. Here, the relative deviation is defined as follows:where n is the number of spectral wavelength points, are the actual spectral data, and are the simulated spectral data. These data were measured by NIR diffuse reflectance. The basic assumption of this method is that the relative deviation is small when the actual mixed spectra coincide with the simulated mixed spectra. Thus, the particle size range of this sample group satisfies linear additivity.

3. Results and Discussion

3.1. Analysis of Small Differences in the Particle Size Range

Two starch-containing vitamin samples of the same particle size were prepared according to the procedure presented in Table 1.

The single sample spectra, the actual mixed spectra, and the corresponding simulated mixed spectra of samples VB + VC (1) −VB + VC (4) are displayed in Figure 1.

The “VB + VC (1)-simulated spectrum” and “VB + VC (1)-actual spectrum” of 1750–1850 nm in Figure 1(a) are enlarged in Figure 2.

The F-value, the T-value, and relative deviation between the actual mixed spectra and the simulated mixed spectra of the vitamin samples VB + VC (1)−VB + VC (4) are presented in Table 3.

The actual percentages, the analytic percentages, and the error between them of the vitamin samples VB + VC (1) −VB + VC (4) are presented in Table 4.

Figures 1(a)1(b) and Figure 2 reveal that when the wavelength is in the range 1750–1850 nm, it belongs to the scattering region. No absorption of chemical groups is observed, which is defined as the chemically invalid absorption band. Analyzing the full spectrum affects the calculation accuracy. Therefore, the wavelengths in 1000–1749 nm and 1851–2500 nm were considered to be the characteristic bands. The scattering area becomes smaller as the particle size of the mixed sample expands, which effectively reduces the effect of scattering, as displayed in figures 1(c)1(d).

In Figure 1 and Table 3, the actual mixed spectra and the corresponding simulated mixed spectra are highly consistent. No significant difference was observed between them at α = 0.01, and the relative deviations are small, all within 1.03%. Table 4 reveals that the percentage of each component in the mixture analyzed by formula regression is close to the actual mixing mass ratio. The absolute error between the analytic percentage and the actual percentage is within 2.98%. Thus, the simulated mixed spectra obtained according to the linear addition principle in the range of less than 180, 180–280, 280–450, and 450–900 μm are consistent with the actual situation. The additivity of the diffuse reflectance spectra is feasible in the aforementioned range.

3.2. Analysis of the Expanded Particle Size Range

To determine the NIR diffuse reflectance linear additivity and gradually expand the range of mixed sample particle sizes, the sample sets of vitamin mixtures with various particle sizes are prepared according to Table 1.

The single sample spectra, the actual mixed spectra, and its corresponding simulated mixed spectra of the vitamin samples VB + VC (5)−VB + VC (8) are displayed in Figure 3.

The F-value, T values, and the relative deviation between the actual mixed spectra and the simulated mixed spectra of the vitamin samples VB + VC (5)−VB + VC (8) are presented in Table 5.

The actual percentages, the analytic percentages, and error between them of the vitamin samples VB + VC (5)−VB + VC (8) are presented in Table 6.

Figure 3, Table 5, and Table 6 reveal that as the particle size range of the sample group is expanded, the spectrograms and the corresponding data analysis of the sample group VB + VC (5) are in accordance with the diffuse reflection linear additivity. However, the actual mixed spectra of VB + VC (6), VB + VC (7), and VB + VC (8) the sample groups differed significantly from the constructed simulated mixed spectra with large relative deviations, and the T-test failed. Thus, the scattering coefficients varied considerably in this particle size range, and the diffuse reflectance spectra were not additive. Combining the sample groups under the same particle size, we have the following: the linear additivity can be satisfied when the particle size is in the range of <280 μm, 280–450 μm, and 450–900 μm.

3.3. Diffuse Reflectance Additivity of Sorghum Samples

To verify the aforementioned findings, five samples of japonica sorghum and glutinous sorghum with various gradient mass ratios were prepared according to Table 2.

The single sample spectra, actual mixed spectra, and corresponding simulated mixed spectra of sorghum samples are displayed in Figure 4.

The F-value, T-values, and relative deviation between the actual mixed spectra and the simulated mixed spectra of sorghum samples are presented in Table 7.

The actual percentages, the analytic percentages, and the error between them of sorghum samples are presented in Table 8.

As displayed in Figure 4, Table 7, and Table 8, the actual mixed spectra and the simulated mixed spectra overlap considerably with insignificant differences and relative deviations less than 0.84%. Thus, they passed the F-test and T-test. The absolute error between actual and analytic percentage were within 4.16% when the powder particle size was in the range 280–450 μm. Thus, the linear additivity of NIR diffuse reflectance is feasible at various mass gradients.

3.4. Application of the Determination of Mixing Uniformity

In case the percentage of each component in the mixture is known, the homogeneity of a material mixture can be detected when the material particle size range satisfies the NIR diffuse reflectance linear addition conditions. Two discrimination methods are listed as follows:(i)The percentage of each component in the sample can be analyzed using the known complex material system spectra combined with the formulation regression algorithm. The error in the homogeneity of the material mixture was determined from the analytic percentage and the actual percentage. In medical research, researchers determine the homogeneity of a mixture by measuring the amount of a component in the mixture (comparing the percentages in the formulations) [35]. The principle of this method is similar to the determination method proposed in this study.(ii)The NIR diffuse reflectance spectra of complex material systems are in accordance with the additive property of the spectral absorption values in that premise. The spectra of each component are added in corresponding percentages to produce a simulated mixed homogeneous spectrum. This spectrum is calculated as shown in equation (4). This spectrum represents the state in which the blend is uniformly mixed. F-test and T-test were applied to determine whether there is a significant difference between this spectrum and the actual mixture spectrum to evaluate the homogeneity of the mixture.

The flowchart for determining the mixing uniformity is displayed in Figure 5.

3.5. Application of Percentage Analysis

When the mixture is known to be homogeneous, a formulation regression algorithm is used to analyze the percentages of the components. In textiles, the content of cotton, polyester, polyamide fibers, and other fabrics is required to be estimated in blended fabrics. In petrochemicals, the composition of gasoline blending components, such as olefins and octane values, vary considerably. Therefore, the percentages of various gasoline blending components can be analyzed.

The flowchart of the percent analysis is displayed in Figure 6.

3.6. Standard Normal Variate Spectrum

The data for the vitamin and sorghum samples pretreated after standard normal variate (SNV) are presented in Tables 9 and 10, respectively. The table includes the analytic percentage, the actual percentage, the absolute error of both, the relative deviation between the actual mixed homogeneous spectrum, and the simulated mixed spectrum.

To eliminate the effect of uneven particle size and particle distribution on the measured spectrum, in this study, an SNV transformation preprocessing method was applied to verify whether the linear addition range can be extended. However, Table 9 reveals that the SNV pretreatment increases the relative deviation between the simulated mixed spectra and the actual mixed homogeneous spectra. The absolute error between the analytic and actual ratios also increases considerably. Even in the homogeneous state of the mixture, these differences were aggravated by the pretreatment for the SNV-processed sample groups. Therefore, the range of near-infrared diffuse reflectance linear addition and use was still within the range of <280 μm, 280–450 μm, and 450–900 μm.

However, Table 10 reveals that for the sorghum samples, SNV pretreatment reduced the absolute error in the actual and analytic percentages. The relative deviation between the simulated mixed spectra and the actual mixed homogeneous spectra also reduced. SNV can effectively eliminate the effect of scattering on the spectra in sorghum sample measurements.

4. Conclusion

This study revealed that the particle size range of <280 μm, 280–450 μm, and 450–900 μm satisfies the condition of linear addition. On this basis, we propose the use of NIR analysis technique to determine the homogeneity of material mixing. By building a mechanical model, a discriminative analysis of the material mixing situation can be achieved. As a simple and fast method to identify medicine and design agricultural product formulations, the method has a broad application prospect. The linear addition based on spectra and direct calibration approach and the percentages of the components in a mixture can be analyzed. It does not need to use a large amount of data to build statistical models. This method only requires the use of a single spectrum of each component of the mixture and a mixture spectrum.

However, the SNV pretreatment method was not effective. Therefore, further research on spectral pretreatment method is critical for reducing the scattering effect. The range of particle sizes that satisfy the linear addition property should be expanded. The materials used in this study exhibit similar density. Therefore, the mass percentage is used in the experiments. For samples with large density differences, further verification is still required.

Data Availability

The spectral data used to support the findings of this study are included within the supplementary materials.

Conflicts of Interest

The authors declare that there are no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Acknowledgments

The study was supported by the National Key Research and Development Program (Grant no. 2016YFD0700304).

Supplementary Materials

The supplementary material contains the spectral data that was used to support the findings of the manuscript. In the data file, column A gives sample group, column B gives sample’s name, and the other columns show the spectral data. The first row presents the wavenumber value. (Supplementary Materials)