Journal of Spectroscopy

Journal of Spectroscopy / 2014 / Article
Special Issue

Spectroscopy in Materials Chemistry

View this Special Issue

Research Article | Open Access

Volume 2014 |Article ID 901310 | 5 pages |

Analysis of the Oil Content of Rapeseed Using Artificial Neural Networks Based on Near Infrared Spectral Data

Academic Editor: Qingrui Zhang
Received09 May 2014
Accepted02 Jun 2014
Published23 Jun 2014


The oil content of rapeseed is a crucial property in practical applications. In this paper, instead of traditional analytical approaches, an artificial neural network (ANN) method was used to analyze the oil content of 29 rapeseed samples based on near infrared spectral data with different wavelengths. Results show that multilayer feed-forward neural networks with 8 nodes (MLFN-8) are the most suitable and reasonable mathematical model to use, with an RMS error of 0.59. This study indicates that using a nonlinear method is a quick and easy approach to analyze the rapeseed oil’s content based on near infrared spectral data.

1. Introduction

Infrared absorption spectroscopy is a common approach for analyzing food composition [13]. For a certain characteristic absorption frequency, Lambert’s law provides the following equation [4, 5]: where represents incident light intensity, represents transmission light intensity, represents the attenuation coefficient, represents the distance the light travels through the material, and represents concentration.

Equation (1) is widely used for determining food composition. However, because the wavelengths in the infrared absorption spectrum are diverse and the force of penetration is tiny, infrared absorption spectroscopy can only be used for analyzing transparent liquids. It is of great difficulty to analyze the oil content of rapeseed using infrared absorption spectroscopy. Therefore, to solve this problem, this study instead uses a nonlinear approach to analyze near-spectral data to determine the oil content of rapeseed.

2. Artificial Neural Networks

2.1. Fundamental of ANN Models

Artificial neural networks (ANN) model is composed of an interconnected group of artificial neurons. In most circumstances, an artificial neural network is an adaptive system that is equipped to be adapting continuously to new data and learning from the accumulated experience and noisy data [6, 7]. Apart from that, the system structure can be changed based on external or internal information that flows through the network during the learning phase. Meanwhile, essential information can be abstracted from data or model complex relationships between inputs and outputs [810].

As can be seen from Figure 1, the main structure of the artificial neural network (ANN) is made up of the input layer and the output layer. The input variables are introduced to the network by the input layer [11]. Also, the response variables with predictions, which stand for the output of the nodes in this certain layer, are provided by the network. Additionally, the hidden layer is included. The type and the complexity of the process or experimentation usually iteratively determine the optimal number of the neurons in the hidden layers [12].

2.2. Model Development

Gu and Wang [12] have accomplished a series of researches from correlative precision instrument from which we could obtain data of rapeseeds’ near infrared spectroscopy by analyzing absorbance under different wavelengths. We defined (%) as the percentage composition of the oil in rapeseed. Data of 29 rapeseed samples are shown on Table 1.

Sample (%)Wavelength (μm)


In order to confirm the most suitable and robust ANN model in analyzing the oil content of rapeseed, 21 models were established including linear prediction model, general regression neural networks (GRNN) [14] and multilayer feed-forward neural networks (MLFN) [15, 16]. Into that matter, nodes of MLFN models were set to be from 2 to 20, so that the most robust MLFN model could be found. The independent variables are the absorbancies under the wavelength of 1.68 μm (reference wavelength), 1.73 μm (characteristic absorption wavelength of fat), 1.94 μm (characteristic absorption wavelength of water), 2.10 μm (characteristic absorption wavelength of starch), and 2.18 μm (characteristic absorption wavelength of protein), respectively, while the dependent variable is the percentage composition of the oil in rapeseed. Training set is consist of 24 samples while the rest of the samples are considered to be the testing set. To ensure the accuracy of the experiments, we did the training process repeatedly. The composing of trained samples and tested samples is different in each experiment. Results of the 21 models were obtained by correlative software, which are shown in Table 2.

ModelTrained samplesTested samplesRMS errorStopped reason

Linear prediction2450.60Autostopped
MLFN 2 nodes2451.04Autostopped
MLFN 3 nodes2451.23Autostopped
MLFN 4 nodes2450.88Autostopped
MLFN 5 nodes2451.29Autostopped
MLFN 6 nodes2451.31Autostopped
MLFN 7 nodes2452.20Autostopped
MLFN 8 nodes2450.59Autostopped
MLFN 9 nodes2453.39Autostopped
MLFN 10 nodes2451.93Autostopped
MLFN 11 nodes2450.83Autostopped
MLFN 12 nodes2451.54Autostopped
MLFN 13 nodes2451.10Autostopped
MLFN 14 nodes2451.57Autostopped
MLFN 15 nodes2453.02Autostopped
MLFN 16 nodes2451.67Autostopped
MLFN 17 nodes2451.07Autostopped
MLFN 18 nodes2451.72Autostopped
MLFN 19 nodes2450.85Autostopped
MLFN 20 nodes2452.79Autostopped

Results presented by Table 2 imply that the lowest RMS error of testing exists in the MLFN model with 8 nodes (MLFN-8), which is 0.59, lower than those generated by linear prediction model and GRNN model. And the accuracy rate of the testing is 100% with the permission error. Therefore, the MLFN-8 model is proved to be an accurate and robust model.

3. Results and Discussion

3.1. Training Results of MLFN-8

Training and testing results of MLFN-8 model were extracted from the experiments. For more intuitionistic, six figures described by data are used to portray the training and testing results, which are shown in Figures 2 to 7.

In training process, the comparison result between predicted values and actual values is depicted by Figure 2. The regulation between predicted values and actual values implies that the training process is precise.

Figure 3 depicts the relationship between residual values and actual values during training process, showing that the residual values are relatively concentrated.

Different from Figure 3, Figure 4 depicts the relationship between residual values and predicted values during training process. Similar to the result shown in Figure 3, the residual values present the same phenomenon as Figure 3, which indicates that the training process is precise.

In general, Figures 2, 3, and 4 depict the results of training process, showing that the values are concentrated and correspond with the normal training process of MLFN-8 model. It is worth mentioning that the residue values are generally tiny and close to zero, which implies that the training process is correct and precise.

3.2. Testing Results of MLFN-8

To analyze the testing process, three figures were used to present the average values of testing results, which are shown in Figures 5 to 7.

In testing process, as shown in Figure 5, the comparison between predicted values and actual values is also close to linear situation, which means that the MLFN-8 model is precise while predicting.

In order to confirm the robustness of comparison between residual values and actual values as well as the comparison between residual values and predicted values, we plotted the comparison between residual values and these two kinds of values, which are shown in Figures 6 and 7.

Figures 5, 6, and 7 depict the average testing process of the MLFN-8 model. All the values shown in the three figures are the average values, from which we can draw a conclusion that the model is accurate and robust.

According to the results presented above, MLFN-8 model is proved to be a suitable and rational model in determining the oil content of rapeseed.

3.3. Discussion

There are several previous studies that are relative to the field we studied [12, 1720]. Gu and Wang [12] analyzed the oil content of rapeseed by multiple linear regression based on near spectral data, which is the chief inspiration of our work. In contrast, our work has a higher robustness and precision since the core we paid attention to is the well-fitted nonlinear function. Besides, Madsen [17] established a quick determination approach of oil content in rapeseed by a commercial nuclear magnetic resonance spectrometer. Tkachuk [18] utilized a near infrared reflectance technique to determine oil, protein, chlorophyll, and glucosinolate content in whole rapeseed kernels. In addition, Velasco and relative coworkers [19] used near-infrared reflectance spectroscopy to estimate the seed weight, oil content, and fatty acid composition in intact single seeds of rapeseed. Shafii and his coworkers [20] analyzed the interaction effects on the winter rapeseeds yield and oil content. These researches can analyze the oil content and other properties of rapeseeds effectively, which can be seen as the great references. However, these analytical approaches still need complex manual operation and the process is intricate to some extent. Our study has successfully proved that the oil content of rapeseed can be analyzed by artificial neural networks, which is a quick and easy method that can be calculated automatically by computer.

In the field of food science and analytical chemistry, oil content of rapeseed reveals the yield of the relative products in practical applications. Taking one of the production steps as an example, people should estimate and evaluate the oil content of the rapeseed samples before mass run. Therefore, using artificial neural networks can achieve this step in a high effective way.

4. Conclusion

Oil content of rapeseed is a crucial aspect on practical applications of food science and chemistry. In this paper, instead of using traditional analytical methods, we successfully used artificial neural networks (ANNs) method to analyze the oil content of 29 rapeseed samples based on near spectral data with different wavelengths. Results show that the multilayer feed-forward neural networks with 8 nodes (MLFN-8) are the most suitable and reasonable mathematical model during experiments. In future research, we will aim at looking for the explicit nonlinear functions of near spectral data in the analysis of rapeseed’s oil content.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.


This work was funded by the National Marine Public Welfare Research Project (nos. 201305002 and 201305043) and the Natural Science Foundation of Dalian (no. 2012003219).


  1. K. Motobayashi, K. Minami, N. Nishi et al., “Hysteresis of potential-dependent changes in ion density and structure of an ionic liquid on a gold electrode: in situ observation by surface-enhanced infrared absorption spectroscopy,” The Journal of Physical Chemistry Letters, vol. 4, no. 18, pp. 3110–3114, 2013. View at: Google Scholar
  2. L. V. Brown, K. Zhao, N. King, H. Sobhani, P. Nordlander, and N. J. Halas, “Surface-enhanced infrared absorption using individual cross antennas tailored to chemical moieties,” Journal of the American Chemical Society, vol. 135, no. 9, pp. 3688–3695, 2013. View at: Publisher Site | Google Scholar
  3. Y.-T. Su, Y.-H. Huang, H. A. Witek, and Y.-P. Lee, “Infrared absorption spectrum of the simplest criegee intermediate CH2OO,” Science, vol. 340, no. 6129, pp. 174–176, 2013. View at: Publisher Site | Google Scholar
  4. J. M. Parnis and K. B. Oldham, “Beyond the Beer-Lambert law: the dependence of absorbance on time in photochemistry,” Journal of Photochemistry and Photobiology A: Chemistry, vol. 267, pp. 6–10, 2013. View at: Publisher Site | Google Scholar
  5. K. Fuwa and B. L. Vallee, “The physical basis of analytical atomic absorption spectrometry: the pertinence of the Beer-Lambert law,” Analytical Chemistry, vol. 35, no. 8, pp. 942–946, 1963. View at: Google Scholar
  6. N. Gupta, “Artificial neural network,” Network and Complex Systems, vol. 3, no. 1, pp. 24–28, 2013. View at: Google Scholar
  7. H. Li, X. F. Liu, S. J. Yang et al., “Prediction of polarizability and absolute permittivity values for hydrocarbon compounds using artificial neural networks,” International Journal of Electrochemical Science, vol. 9, no. 7, pp. 3725–3735, 2014. View at: Google Scholar
  8. Y. Wang, H.-C. Han, J. Y. Yang, M. L. Lindsey, and Y. Jin, “A conceptual cellular interaction model of left ventricular remodelling post-MI: dynamic network with exit-entry competition strategy,” BMC Systems Biology, vol. 4, no. 1, article S5, 2010. View at: Publisher Site | Google Scholar
  9. Y. Wang, T. Yang, Y. Ma et al., “Mathematical modeling and stability analysis of macrophage activation in left ventricular remodeling post-myocardial infarction,” BMC Genomics, vol. 13, supplement 6, article S21, 2012. View at: Google Scholar
  10. T. Yang, Y. A. Chiao, Y. Wang et al., “Mathematical modeling of left ventricular dimensional changes in mice during aging,” BMC Systems Biology, vol. 6, supplement 3, article S10, 2012. View at: Google Scholar
  11. C. H. Aladag, A. Kayabasi, and C. Gokceoglu, “Estimation of pressuremeter modulus and limit pressure of clayey soils by various artificial neural network models,” Neural Computing and Applications, vol. 23, no. 2, pp. 333–339, 2013. View at: Publisher Site | Google Scholar
  12. W. Z. Gu and Y. X. Wang, “Analysis of rapeseed oil by linear regression using near spectral data,” Journal of the Chinese Cereals and Oils Association, vol. 10, no. 2, pp. 57–64, 1995 (Chinese). View at: Google Scholar
  13. Ö. Polat and T. Yıldırım, “FPGA implementation of a General Regression Neural Network: an embedded pattern classification system,” Digital Signal Processing, vol. 20, no. 3, pp. 881–886, 2010. View at: Publisher Site | Google Scholar
  14. C. H. Chen, T. K. Yao, C. M. Kuo et al., “Evolutionary design of constructive multilayer feedforward neural network,” Journal of Vibration and Control, vol. 19, no. 16, pp. 2413–2420, 2013. View at: Google Scholar
  15. S. Mirjalili, S. Z. Mohd Hashim, and H. Moradian Sardroudi, “Training feedforward neural networks using hybrid particle swarm optimization and gravitational search algorithm,” Applied Mathematics and Computation, vol. 218, no. 22, pp. 11125–11137, 2012. View at: Publisher Site | Google Scholar
  16. R. Pahlavan, M. Omid, and A. Akram, “Energy input-output analysis and application of artificial neural networks for predicting greenhouse basil production,” Energy, vol. 37, no. 1, pp. 171–176, 2012. View at: Publisher Site | Google Scholar
  17. E. Madsen, “Nuclear magnetic resonance spectrometry as a quick method of determination of oil content in rapeseed,” Journal of the American Oil Chemists Society, vol. 53, no. 7, pp. 467–469, 1976. View at: Publisher Site | Google Scholar
  18. R. Tkachuk, “Oil and protein analysis of whole rapeseed kernels by near infrared reflectance spectroscopy,” Journal of the American Oil Chemists' Society, vol. 58, no. 8, pp. 819–822, 1981. View at: Publisher Site | Google Scholar
  19. L. Velasco, C. Möllers, and H. C. Becker, “Estimation of seed weight, oil content and fatty acid composition in intact single seeds of rapeseed (Brassica napus L.) by near-infrared reflectance spectroscopy,” Euphytica, vol. 106, no. 1, pp. 79–85, 1999. View at: Publisher Site | Google Scholar
  20. B. Shafii, K. A. Mahler, W. J. Price et al., “Genotype X environment interaction effects on winter rapeseed yield and oil content,” Crop Science, vol. 32, no. 4, pp. 922–927, 1992. View at: Google Scholar

Copyright © 2014 Dazuo Yang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

More related articles

1121 Views | 547 Downloads | 6 Citations
 PDF  Download Citation  Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

We are committed to sharing findings related to COVID-19 as quickly and safely as possible. Any author submitting a COVID-19 paper should notify us at to ensure their research is fast-tracked and made available on a preprint server as soon as possible. We will be providing unlimited waivers of publication charges for accepted articles related to COVID-19. Sign up here as a reviewer to help fast-track new submissions.