Abstract

In order to improve the data retrieval and mining ability of agricultural information management system, an agricultural information management data model based on cloud computing and semantic technology was proposed. Fuzzy C-means algorithm is used for adaptive fusion and clustering of semantic association features of distributed data in large-scale information management systems; feature compressors are used to reduce the storage space dimension of large-scale information management system to improve the target data mining capability and adaptive scheduling capability of information management system. The registration rate improves than the traditional method by 12.46% and computational cost by 23.76%. This method has higher accuracy in storage data mining and retrieval for large information management systems and superior performance than the traditional method. The introduction of the concept and technical methods of related data is the best practice to realize the fine disclosure, standardized description, semantic organization, and in-depth integration of massive agricultural science and technology information resources, which will play an important role in improving the visibility, visibility, and accessibility of agricultural information resources. It is innovative in the construction of the multidimensional semantic-related model and field knowledge service system driven by related data.

1. Introduction

As the basic industry of China’s economy, agriculture’s development level has a profound impact on the quality of life of Chinese people [1]. At present, China’s agriculture is in the stage of developing to informationized agriculture, and the foundation of agricultural informationization is agricultural spatial information [2]. The construction of cloud portal for agricultural spatial information management is studied, and the research background of this study is expounded from the perspectives of theory, technology, and application. Since the 21st century, with the geometric growth of data, cloud computing technology has become one of the hotspots in the field of Internet high-tech research [3]. Cloud computing belongs to distributed computing, while cloud GIS, based on the basic theory and technology of cloud computing, realizes the storage, management, processing, and analysis of geo-spatial information, thus changing the traditional GIS structure and application mode [4]. The cloud GIS technology can realize the storage and processing of massive agricultural spatial data and provide the platform with more efficient agricultural spatial data computing and agricultural space application analysis capabilities; it greatly improves the management and sharing efficiency of agricultural spatial information and is a powerful technical support for the construction of agricultural information management platform in the future. With the continuous development of modern society, human beings have entered an era of information explosion. With the increasing number of application users, increasing demand for computing power, and increasing security requirements, enterprises have to increase their investment in hardware equipment to meet the growing needs. At the same time, in order to maintain the safety and reliability of the hardware equipment, the system operation and maintenance requirements are also increasing day by day. More importantly, this growth model is conducted exponentially. SMEs and individuals are not only discouraged from the cost of software development but also difficult for large companies. Therefore, cloud computing technology, which is easy to use and easy to obtain resources, has gradually entered people’s sight.

In the network information environment, the era of “BigData” has come, and the “fourth paradigm” of scientific research characterized by data-intensive computing is rising. Existing information organization and service means are difficult to adapt to the development and change of agricultural scientific research environment and scientific research methods; due to the lack of orderly integration, organization, standard control, mining, and utilization of massive agricultural scientific and technological information resources, agricultural researchers are in the “information ocean;” however, it is often faced with the dilemma of “excessive information and lack of knowledge and medical treatment,” making it increasingly difficult for people to find and obtain information accurately, quickly, and comprehensively [4]. The large information management system is built in the database setting based on the calculation and information retrieval algorithm and combined with the information scheduling technology, conducts the overall design framework and database development of the information management system, improves the information retrieval and integrated information processing ability, and studies the data mining technology of the large information management system, which is of great significance to improve the distributed retrieval and scheduling ability of the information management system. In large-scale information management systems and cloud database media, effective data mining methods are needed to access and schedule distributed data in large-scale information management systems to improve data utilization efficiency and resource-sharing degree; in traditional methods, semantic feature mining of large-scale information management system is based on statistical analysis and feature extraction of distributed data bit information flow in large-scale information management system; typical data mining methods include association rule mining, semantic ontology model mining, spectrum feature extraction, and fuzzy K-means mining; the fuzzy clustering center of big data feature distribution in large-scale information management system is constructed, combined with spectral feature extraction and semantic analysis method, data mining is carried out, and some research results have been obtained [5]. The construction of agricultural information management data model can significantly improve the visibility, visibility, accessibility and scientific value of agricultural science and technology information resources will play an important role and has an important research value and practical significance for promoting agricultural scientific research and innovation.

According to the above problems, this study proposes a data mining technique for a large information management system based on semantic correlation feature extraction. This section first briefly analyzes some new trends and characteristics of the environment of agricultural scientific research innovation and information needs of scientific research users under the background of data-intensive computing and the rise of the “fourth paradigm” of scientific data and the arrival of the era of “big data.” It is proposed to introduce new service concepts and technical means and other related data to innovate service content and functions and improve the role and contribution of knowledge service to scientific and technological innovation [3, 6].

2. Research Methods

2.1. Analysis of Data Storage Structure and Characteristics of Large-Scale Information Management System
2.1.1. Data Storage Structure Model

In the cloud computing and semantic technology environment, especially from the traditional file network (Web of Document) to the structured and semantic-rich data network (Web of Data) and traditional knowledge organization systems (such as narrative lists, topic vocabulary, and taxonomy) have regained their attention and attention and also play an increasingly important role in organizational management, mining, and analysis and development and utilization of massive information resources. The database storage basic block of large-scale information management system is ; combined with the closed frequent item set fusion method, the target data semantic training set of large-scale information management satisfies , ; let be the distributed amplitude of the data structure; the probability of occurrence of target number in large-scale information management system is represented by , ; the interval unit of big data sampling is obtained. The time window of data mining is expressed as:where represents the characteristic component of data association rules, while η represents the mutual information, the data storage element is input into the storage link layer of the large-scale information management system, the steady-state characteristic quantity of information scheduling is , and the transfer function of distributed storage of large-scale information management system can be obtained as follows:

The model of associated rule distribution for large information management systems (Figure 1).

2.1.2. Feature Analysis of Semantic Correlation Dimension

The information management system is optimized and reorganized by combining the feature recombination method of big data information flow; the feature quantity of semantic association dimension of information management distributed data is extracted from the topology structure of the reorganized information management system. The decision statistics of association rule information fusion are as follows:where represents the big data sampling sequence in the large-scale information management system, d represents the data sampling interval, , is the mean, and represents the weight of the evaluation index, which is

Combining with the binary semantic weight analysis method, the binary semantic decision matrix is obtained as follows:

By constructing the evaluation decision matrix and combining with the adaptive information fusion method, the decision criteria of data mining of large-scale information management system are obtained as follows:

The ontology clustering center was built according to each evaluation object and the optimal scheme, and the absolute error of and was calculated; the decision threshold value of data mining of large-scale information management system is , when

At the confidence level of 95%, the accurate distribution probability of semantic association feature mining can be obtained as follows:

Type: ; when , the target data mining of the information management system is aggregated with a probability of 95% in the semantic ontology model, and the semantic association feature extraction results are

2.2. Optimization of Data Mining Algorithm
2.2.1. Semantic Relevance Feature Extraction

The integrated scheduling and data mining of the information management system are carried out with the semantic association feature quantity as the training sample set; fuzzy C-means algorithm is used for adaptive fusion and clustering of semantic association features of distributed data in large-scale information management systems; distribution data distribution sample attribute set is , and its equivalent similarity transitive mapping relationship satisfies

The semantic mapping relationship between ontology is established, for the target data containing n samples, where the correlation characteristic distribution vector of sample , iswhere refers to the envelope amplitude of potentially useful information of distributed data in the information management system to be large scale, is the statistical average value of data, and is an interference term, and each concept in the ontology model can also be called a class. Semantically, when φ: , an embedded high-dimensional phase space exists for ; two groups of very similar ontology fragment data sample sequence , are obtained; thus, semantic mapping relationship between ontology of information management system is obtained:where represents the information flow vector of distributed data input to the large-scale information management system, is the embedding dimension of phase space, is the correlation order of semantic information, and is the data sampling interval. In the topology structure of reorganized information management system, the feature quantity of semantic association dimension of information management big data samples is calculated, and information scheduling and association rules mining are carried out by combining data clustering algorithm [7, 8].

2.2.2. Information Management System Target Data Mining

Considering two groups of ontology fragments with the similar structure, the objective function of semantic association feature fusion of distributed data in large-scale information management system is constructed:where is the ontology segment in the distributed data information of the massive large-scale information management system that contains the target attribute set of mining. P is the measure distance, expressed as equation (14), by Euclidean distance:

According to the semantic mapping SM-Context, the output information management data target sample set meets the following requirements:

Combined with the context feature optimization method of the concept, the target data scheduling of the information management system is carried out, and the optimal output solution is obtained:

The semantic components contained in the concept are clarified, and then, the feature quantity of semantic association dimension of distributed data in the large-scale information management system is extracted, and the iterative formula of feature extraction is obtained:

The semantic association feature quantity is taken as the training sample set to carry out the integrated scheduling and data mining of the information management system, and the data classification mining is carried out in the logical form of intersection. The output results of data mining are as follows:where ω is the initial weight of semantic mapping, is the joint characteristic quantity of the disjunctive relation, and represents the difference between the semantics of a concept and the elements around it. The semantic association feature is input into the fuzzy C-means classifier, and the feature compressor is used to reduce the dimension of the storage space of the large-scale information management system, so as to improve the target data mining ability and the adaptive scheduling ability of the information management system.

2.3. Cloud Computing Technology
2.3.1. Features of Cloud Computing

Since 2007, cloud computing has gradually appeared in people’s field of vision. With the rapid development of computer technology, cloud computing has also begun to shift from academic theoretical research to practical applications and has brought convenience to people’s lives. Compared with other computing modes such as traditional grid, cloud computing, as a new computing mode, has the following characteristics:(1)Virtualization: cloud computing uses virtualization technology to encapsulate hardware and software resources into services, and users can access and use these services on demand through the Internet. These services execute somewhere deep in the “cloud,” where the user does not know and need not know where they are running. Users only need a cloud terminal device, such as a laptop, to easily obtain services in the cloud, even including powerful computing tasks.(2)It can be dynamically expanded and has The ability to provide functionality quickly and elastically to scale and to quickly add or release resources to scale down. For cloud users, the resources available for rent are virtually endless, and any number of purchases can be made at any moment.(3)Universality: the application of cloud computing has nothing to do with the physical platform. All operations are unified on the virtual platform built by the cloud, and various applications can be run on its platform.(4)On-demand deployment: because the “cloud” uses virtualization technology, the deployment of its platform is very flexible. Cloud computing can change capacity according to user needs and deploy applications into virtual computing pools. When a user submits a task, virtual machine resources are deployed according to the actual number of user tasks.(5)Pay on demand: charges for features use pay-per-use services or use an advertising-based charging model to increase cloud computing resource usage. For example, storage, computing, and bandwidth resources are measured and charged monthly based on the number of users used. An enterprise’s private cloud may increase the cost of the enterprise, and the internal personnel of the enterprise may use physical currency, or the private cloud may be used for free.(6)High reliability: the user’s services and computing will not be affected by the failure of a certain machine in the cloud. Data redundancy technology, virtual machine dynamic migration technology, etc., are used to ensure the smooth completion of user tasks.(7)High cost performance: cloud data centers are composed of a large number of cheap server clusters. The configuration of a single server may be low, but after they are unified to form a huge computing resource pool, its computing power will far exceed that of a mainframe.

2.3.2. Classification of Cloud Computing

At present, there are many classifications of cloud computing in the industry, but there are two generally recognized classification methods, namely, by service type.

Classification and Classification by Service:

(1) Classification by Service Type. Cloud computing can provide users with a variety of services. By using these services, users can obtain corresponding resources. According to the type of service, it can be roughly divided into three categories from the bottom up, namely, Infrastructure as a Service (Infrastructure as a Service) (IaaS), Platform as a Service (PaaS), and Application Software as a Service (SaaS).

(2) Classification by Service Mode. As an emerging computing model, although cloud computing has many advantages that existing computing models do not have, there are still some problems. The first is the issue of security. For companies with high security requirements, such as banks, operators, and the military, once security is compromised, the consequences will be disastrous. The second is the issue of supervision. Some enterprises have very strict internal management and operation and maintenance systems and do not want to be grasped and interfered by related industries outside the company. Although cloud computing can provide protection for enterprises and users through security isolation measures, it still cannot meet the needs of all users. Therefore, in response to these problems, cloud computing is divided into public cloud, private cloud, community cloud, and hybrid cloud according to the relationship between service providers and users.

3. Result Analysis

The construction of the large-scale information management system is based on the database design and information retrieval algorithm, combined with the information feature extraction and optimization technology, which carries out the overall design framework and database development of the information management system and improves the information retrieval and integrated information processing ability. The design of a platform is to design platform objectives, specify design rules, and conduct actual platform construction according to the design rules. Only in the process of building the platform, strictly observing the specified design rules and improving the functions according to the design rules can the platform that meets the needs and purpose be made. This study designs the following rules for the built agricultural space information management cloud portal. The distributed data in the information management system includes 32 concept sets and 80 attribute sets and contains 100 semantic information instance sets [9]. The initial position of semantic mapping (0, 0.25), the correlation coefficient is 0.23, the inertia weight acceleration factor of information management system data retrieval is 0.56, and the initial sampling frequency of sample data is , terminating the sampling frequency , the simulation duration is 200 ms, the management data of the information management system contains 32 concepts, 62 attributes, and 100 instances, and the mapping of the semantic relationship is shown in Table 1.

According to the semantic ontology association mapping in Table 1, the semantic mapping results between the reference ontology and the first 10 ontologies are given, as shown in Table 2.

The data mining of the information mining system obtains the time-domain and frequency domain waveforms of the original data samples collected in the large information management system. Taking sample data as the test object set, semantic correlation feature extraction and big data mining can obtain feature extraction results. The analysis results show that the data mining conducted by this method has strong anti-interference feature extraction and good data amassation. In order to compare the performance of different mining algorithms, the present method and the traditional method conduct the target data mining and retrieval of the information management system under the same conditions. The check rate comparison results of the test data mining are shown in Table 3, and the data mining accuracy comparison results are shown in Figure 2.

According to the analysis of Table 3 and Figure 2, the greater the number of iterations, the higher the registration rate of this method is increased by 12.46%, and the calculation cost is decreased by 23.76%. The accuracy of storage data mining and retrieval of large information management systems is higher, and the performance is superior to the traditional method. However, the method designed in this study has big data and low accuracy of data scheduling, so the next step needs to be improved in this respect [10]. The construction of a large information management system is based on the database design and information retrieval algorithm, combined with information feature extraction and optimization, to carry out the overall design framework and database development of the information management system and improve the information retrieval and integrated information processing ability.

4. Conclusions

Through the above research work, we have mastered the core technical principles of associated data, the construction method of semantic association model, and the principles, processes, and key technical methods of associated data from construction, publication to consumption, and application. An agricultural information management data model based on cloud computing and semantic technology is proposed. The cloud storage model is constructed for big data distributed storage design in large information management system, extracts the semantic correlation dimension features of information management distribution data in reorganized information management system, uses fuzzy C mean algorithm for the adaptive integration and clustering of distributed data in large information management system, uses the storage space dimension reduction processing of large information management system, and improves the target data mining ability and adaptive scheduling ability of information management system. Significantly improving the availability, visibility, availability, and scientific value of agricultural science and technology information resources will play an important role, which has important research value and practical significance for promoting the innovation of agricultural scientific research. Methods: the data mining of large-scale information management system has good accuracy and strong anti-interference ability.

Although the massive data contains a large amount of valuable information, the vast majority of them are semistructured or unstructured and isolated data, lacking normative control, deep disclosure, and semantic correlation, making it difficult for computers to process these data automatically. To this end, people carried out the construction of semantic network research and launched the associated open data movement; more and more data providers and application system will release their own data in the Internet in the form of the associated data, implanted more standard semantic information in the data and interconnection with other data sources, formed a huge data network rich in semantics, and laid the foundation for computer accurate understanding and intelligent processing of the data.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.