Abstract

The traditional data automatic office system has limited mining and computing capabilities. Due to the iterative complexity of data mining algorithms, it is difficult to discover the relationships and rules existing in the Internet of Things data as well as impossible to advance the efficiency of the office system based on the existing Internet of Things data. This paper combines cloud computing and machine learning to construct an intelligent network office system, realizes large-scale IoT data processing through the combination of IoT data mining technology and cloud computing framework, and constructs the functional module structure of the intelligent network office system through demand analysis. On this basis, this paper conducts system performance verification and conducts experimental design based on network intelligent system demand. The experimental results show that the system constructed in this paper has certain practical effects, which can provide theoretical reference for subsequent related research.

1. Introduction

In the early stage of the development of the information industry, information technology was not advanced enough. Software products were scarce resources, and the software market was a seller’s market. Technical factors played a dominant role in software development. After successful software development, the corresponding market demand could be found. The key link of the informatization project at this stage is the writing of software code, and there is basically no need for targeted demand analysis. The success of software development indicates the success of the project. With the rapid development of information technology, the continuous expansion of the information industry, the shortening of the life cycle of information projects, and the introduction of personalized products, software systems have entered the buyer’s market. Customers’ requirements for software systems are constantly increasing, and their requirements for personalization, specificity, and customized software systems continue to increase. At this time, informatization projects enter the buyer’s market, and customized software is the industry’s development trend. Whether developers can develop products that meet customer needs and satisfy customers is the criterion for testing the success of software development projects [1].

At present, in the daily office management applications, there are a large number of process management and process processing issues, including resource coordination, task allocation, work scheduling, and control [2].

Due to insufficient process support capabilities, the existing office management system cannot truly realize the electronic management of official documents in the government. Electronic writing can replace manual writing, which solves the problems of slow transmission of official documents, untimely information, unsynchronized information, and difficult access and realizes official documents, electronic drafting, review, countersignature, issuance, registration, approval, circulation, and other tasks. The existing office management system cannot realize tasks such as automatic processing feedback, reminder, process tracking and statistics, query, and archiving of task personnel and business processes and cannot be flexible. Setting the official document process and conducting full-text search, the work quality and efficiency cannot be effectively tracked and implemented.

The main goal of the office automation system is to use a variety of information and communication technologies to realize the coordination and solidification of daily office management information resource sharing and management and office management processes, improve daily office management efficiency and daily office management quality, and assist office management decision-making [3].

This article introduces the background status and actual needs of intelligent network office, studies the current literature research status of cloud computing and intelligent network office, and analyzes the existing research problems. On this basis, this paper proposes a cloud computing application of intelligent clustering algorithm. With the support of the cloud computing clustering algorithm, an intelligent network office system is constructed, and the performance of the system is verified, which verifies the effectiveness of the system in this paper.

In terms of the depth of theoretical research in the above literature, the introduction of new technologies, and the promotion of mature products, especially in the application of workflow technology in office management systems, more domestic research is still theoretical, and there are relatively few practical application products. Workflow is the practical application of the office management system that needs further research. This article builds an intelligent network office system with the support of cloud computing technology and machine learning to improve the efficiency of intelligent office.

Internet of Things was called the sensor network in the early days, and the sensor network technology was rated as the top ten science and technology that will change the future life of mankind [4]. In the era of Internet of Things, any object can be connected to each other through a communication network. However, Internet of Things has been a broad concept since it was proposed. Because it involves complex technologies such as information, wireless networks, and intelligent control, it is difficult to give it a clear and unified definition [5]. At present, the definition accepted by the mainstream is it refers to a network concept of the GPS, laser scanner, and other information-sensitive sensing devices that connect any item to the Internet according to an agreed protocol for information exchange and communication to realize intelligent identification, positioning, tracking, monitoring, and management [6]. At present, countries all over the world pay attention to the informatization development of science as well as technology. China, the United States, Japan, and South Korea have included Internet of Things in their national strategic plans [7]. At present, many experts and scholars are committed to the core technology research and application system development of Internet of Things so as to establish a technical standard system supported by independent intellectual property rights [8]. Cloud computing is a new computing method based on the Internet, which provides users with on-demand computing through heterogeneous and autonomous services on the Internet [9]. Cloud computing has created countless new ways of working and business models. The Internet of Things realizes the interconnection and intercommunication of data and information between things, and things as well as the Internet, and cloud computing uses its powerful computing and storage capabilities to mine the knowledge laws contained in massive data [10]. The collaborative application of the Internet of Things and cloud computing is conducive to the comprehensive collection and intelligent processing of time data and improves the level of information management in the field [11].

Smart office, also called cloud office, is a new office model that uses cloud computing technology to intelligently manage software and hardware equipment required for office business and realizes the unified deployment and delivery of enterprise application software [12]. Smart office supports the secure remote access of multiple terminal devices such as PCs, mobile phones, and tablet computers, which can enhance the security, ease of use, and scalability of the corporate office environment, improve collaboration and sharing of resources, and comprehensively improve management efficiency and optimization business process, reducing operating costs [13]. Smart office has a wide range of development space and application prospects in the future. Smart office systems make up for most of the shortcomings of traditional office systems and bring office automation to a new level [14]. Daily office applications required by enterprises into virtual desktops, unified delivery, and management are integrated. No matter when and where, as long as users log in with their own account, they will get their own customized operating environment [15]. Cloud technology is used to build a new office application service. Users are provided with an operating experience of “desktop as a service, ready to use” [16]. It is proposed that the various functions provided by the smart office system can not only effectively improve the internal communication management process of the enterprise but also solve the common problems in the enterprise informatization process [17]. With the help of the smart office system, the unified management, rapid deployment, and flexible expansion of each system can be realized, which can effectively reduce the maintenance and management costs [18]. It is proposed that smart office meets the needs of new office applications such as mobile office. At the same time, the smart office system has advantages in reducing enterprise management costs, improving management level and efficiency, etc. [19]. It is proposed that smart office can reduce the IT cost caused by PC updates and can provide users with flexible configuration that cannot be achieved by ordinary PCs. The parameters of smart office desktops support independent setting of high-end or low-end configurations and are equipped on demand [20]. The centralized management of enterprise IT is realized through smart office. Centralized management and online support tools make enterprises no longer need to deploy IT staff in new locations. This can reduce costs, and enterprises can achieve easy, fast, and flexible development. This can reduce the cost of enterprises and accelerate the business process, and enterprises can achieve easy, rapid, and flexible development [21]. It is proposed that smart office can improve the security of enterprise data assets. All data are unified to run on the server side, and the client side only displays its changed images, and there is no need to worry about illegally stealing data through the client side [22]. It is proposed that smart office can realize flexible and convenient mobile office, create a new office model, and realize mobile office through cloud computing technology, and office staff can handle anything related to business at any time and any place. The flexible and intelligent mobile office will become a powerful driving force for companies to gain a competitive advantage; they can work efficiently with other employees in multiple different locations, no matter where they are, and transfer the low-value tasks of high-skilled employees to low-cost employees to complete, which not only can reduce costs but also improve efficiency [23].

From the high analysis, we can see that the depth of theoretical research in the above literature, the introduction of new technologies, and the promotion of mature products, especially in the application of workflow technology in office management systems, are still more theoretical research in China, and there are relatively few practical application products. The practical application of workflow in the office management system needs further study.

3. Cloud Computing Smart Office Data Clustering Processing

As a new generation of data processing and storage technology, cloud computing has realized rapid data processing and mobile applications. As the problem of in-depth mining of massive data information has become an urgent problem to be solved, a clustering algorithm based on cloud computing is designed and applied to the massive data processing of the intelligent office to improve the operating efficiency of the intelligent office system. Clustering refers to reusing as much as possible the manual identification work done by the predecessors to improve work efficiency. It is not difficult to solve the formal data mining clustering problem. The easiest and direct way is to develop a converter between various basic data formats. Because there are not many popular data formats and the conversion rules are clear, this is a problem that can be solved as long as a certain amount of manpower is invested. However, semantic data mining clustering is more complicated, so this article conducts research on cloud computing-based data mining clustering algorithms.

Cluster analysis is to classify datasets based on similarity principles. Clustering is an algorithm in data mining, and it is a pivotal branch in data mining. Researchers have always regarded clustering algorithms as an important research direction, and clustering algorithms have been applied to many fields in major companies. Moreover, it plays an important role in text clustering, user group classification, biology, and finding outliers. The clustering algorithm can efficiently discover the internal laws of things, so it is an important step in data development. In addition, the clustering algorithm can divide the dataset into several classes according to certain rules and select the appropriate similarity measurement formula; thereby, the data belonging to the same class are similar to each other, but the data between the classes are not similar. The flow of the clustering algorithm is shown in Figure 1.

In machine learning, the clustering algorithm is an unsupervised learning algorithm, that is, there is no need to know the type of data before clustering. The purpose of clustering is to classify similar objects in the dataset into the same category. Generally, the evaluation of the clustering effect is determined by the corresponding clustering algorithm, sample approximate distance, and evaluation function. When the number of iterations reaches a certain value and the judgment function tends to infinity, the clustering achieves the best effect [24].

3.1. Sample Similarity Measurement

The following are several commonly used distance measurement methods:(a)Euclidean distance: it is a distance calculation method used to find the distance between two points in the Euclidean space, that is, the straight-line distance between two points. It is usually suitable for the case where the standard of each vector is unified. The formula is(b)Manhattan distance: it is to calculate the distance between city blocks (one intersection to the next intersection), also known as “taxi distance,” and it can also calculate the sum of absolute wheelbases of coordinate axes between data points. The formula is(c)Chebyshev distance: it detects the maximum value of the difference between the data points of two sample records in each coordinate axis. It can also be described as a one-dimensional attribute and is used to determine which cluster an object belongs to. The formula is(d)Cosine similarity: it can be described as the angle between two object attributes in space. The formula is

This paper uses the Euclidean function to measure the similarity between data objects in the cluster.

3.2. Evaluation of Clustering Effect

(a)Davies–Bouldin indexIn the above formula, n represents the number of clusters, represents the centroid of the cluster x, is the average distance from all data points in the x cluster to the centroid , and represents the distance between the centroids and . The smallest Davies–Bouldin index is considered to be the best algorithm based on this standard.(b)Dunn index is used to evaluate clusters with close and easily distinguishable data. It is defined as the ratio between the smallest distance between clusters and the largest distance within clusters. The formula is calculated as follows:In the above formula, represents the distance between the clusters i and j as well as is the intracluster distance of cluster k. The intercluster distance between two clusters can be any number of distance measures. Similarly, the distance within the cluster can be measured in various ways, such as the maximum distance between any pair of elements in the cluster k. Since internal standards seek clusters with high intracluster similarity and low intercluster similarity, it is more desirable to generate an algorithm for clusters with a high Dunn index.(c)Rand index: the Rand index calculates how similar the cluster is to the benchmark classification. In addition, the Rand index can also be as a measure of the percentage of correct decisions made by an algorithm. The formula is as follows:TP represents the number of true examples, TN represents the number of true and negative examples, FP represents the number of false-positive examples, and FN represents the number of false-negative examples. One problem with the Rand index is that false positives and false negatives are equally weighted. For some cluster applications, this may be an undesirable feature.(d)F-measure

where represents the precision rate and R represents the recall rate. We can calculate the F-measure using the following formula:

It should be noted that when , . In other words, at , the recall rate has no effect on the F-measure.

According to the type of data and the purpose of clustering, researchers have pioneered the use of appropriate clustering algorithms to execute data of different dimensions and shapes so as to make the clustering results more accurate. Traditional clustering methods include categories: density-based clustering, probability distribution-based clustering, and hierarchical clustering.

The idea of the K-means Algorithms 1 and 2 is as follows: in the first step, k values are randomly selected from the dataset as the cluster center; in the second step, the distance from each element to the center point is calculated, and these objects select the closest point to themselves as their own class; in the third step, the average value of all objects in all classes is calculated in order, and the average value is used as the center of the new class; in the fourth step, the average error and the E value are calculated until it is less than the specified threshold; otherwise, the second step is followed and iteration is continued.

Input: k value and dataset .
Output: clustering results.
(a)This algorithm randomly selects k data in the dataset D as the initial cluster center point.
(b)This algorithm finds distance from all remaining data points to the k cluster center points and selects the closest point to itself as its cluster center point.
(c)The algorithm calculates the center points of all classes in turn and uses these center points as the new cluster centers.
(d)The algorithm iterates steps b and c repeatedly until the error average and value are less than the specified threshold. The algorithm ends.
Input: intercept .
Output: clustering results.
(a)The algorithm finds the distance between all data.
(b)The algorithm calculates the local density of all data points.
(c)The algorithm calculates the high-density minimum distance of all data.
(d)The algorithm finds the cluster center point in the decision graph.
(e)The algorithm clusters the remaining points to the nearest center point.
(f)The algorithm removes noise points.

Randomly selecting k data points as the initial center points of the clustering will make the clustering effect unstable. For datasets that are not convex, the randomness of the initial centers will usually cause a local optimal solution. Then, this function has many local optimal solutions, and only one of the many local optimal solutions is the global optimal solution. As shown in Figure 2, because the initial center points of the clustering are different, the path of the clustering objective function will follow four different paths to reach the local optimal solution. Among the four local optimal solutions, only one is the global optimal solution C is obtained through -path iteration. Therefore, if the initial clustering center point selection is not appropriate, the K-means clustering algorithm will usually not find the global optimal point and stay at the local optimal point.

In addition, the number of clusters must be specified before K-means clustering. However, many datasets to be classified have unknown characteristics. If the number of dataset classes cannot be clearly indicated, it is necessary to perform multiple clustering tests on the number of different clusters k, and k is selected with the best clustering effect as the number of clusters. This number is not taken randomly. Experiments show that this number must be obtained from so as to obtain the optimal solution for the evaluation clustering effect function. However, the K-means algorithm will be run multiple times in the process of finding the best k value.

LDC is a novel clustering algorithm. The main ideas of the algorithm are as follows: (1) calculating the local density values of all data and arranging them in descending order, (2) selecting the value with high local density as the clustering center point, and (3) clustering the noncentral point data to the nearest class. Several parameters of LDC are defined as follows.

Definition 1. Local densitywhere represents the distance between data points i and j and is the cutoff distance and is an adjustable parameter. represents the number of data points in the range with i as the center and as the radius. The LDC algorithm is more sensitive to the value of , so the value of the cutoff distance has a great influence on the clustering effect.

Definition 2. High-density minimum distancewhere represents the distance between data points i and j and the high-density minimum distance represents the distance from data point i to the nearest point j among all the points with a local density greater than data point i. For the point with the highest local density, it is defined asBelow, we have provided an example to analyze the LDC algorithm in detail.(1)Decision diagram: LDC is a two-dimensional algorithm. The algorithm performs clustering according to Figure 3. Figure 3(a) shows the distribution diagram of the data, and Figure 3(b) shows the decision diagram of the data in Figure 3(a). The labels of the sample points are arranged in descending order according to the local density. Different colors represent different clusters, and black dots are noise dots.The judgment of this clustering center is mainly through the human eyes. From the decision diagram, it is easy to find that point 1 and point 10 are obviously different from other data points, and they have a relatively large local density and a relatively large high-density minimum distance . Therefore, they are used as the center of the cluster when clustering. Although points 26, 27, and 28 have larger high-density minimum distance , the local density is smaller. Therefore, they are deleted as noise points.(2)Classification of data points: after determining the cluster center point in the decision diagram, the remaining data is clustered to the center point closest to itself. The algorithm is simple and clear and has low complexity and no iteration. After the algorithm establishes the center point, clustering is performed directly.(3)Noise filtering: noise points will affect the clustering effect. The algorithm removes noise points by a boundary density. If the local density of the data satisfies , it is defined as a noise point and deleted from the dataset.Aiming at the problem that the clustering results cannot reach the global optimum caused by the random selection of the initial center point of K-means and the influence of noise points on the clustering, this paper proposes the LDCK-means algorithm and conducts clustering experiments to verify its effectiveness. The K-means algorithm also has the following shortcomings: the quality of clustering is sensitive to the random selection of initial points, the selection of k value should be specified, and clustering is also more sensitive to existing noise points. Next, this article analyzes these shortcomings in detail, then makes corresponding optimizations, and finally parallelizes the optimized algorithms to improve the ability to process large-scale data.
The definition of some parameters of the AK-means algorithm is as follows:

Definition 3. Intercluster dispersion degreeDisp represents the intercluster dispersion degree, means the position of the i-th cluster center point, and k means the number of clusters.

Definition 4. Intracluster aggregation degreeAggr represents the intracluster aggregation degree, k represents the number of clusters, and represents the average distance from all elements in the i-th cluster to the center.

Definition 5. Clustering evaluation valuewhere means the k-th intercluster dispersion degree, means the k-th intercluster dispersion degree, means the k-th intracluster aggregation degree, means the -th intracluster aggregation degree, and E is the cluster evaluation value.
To determine the effectiveness of AK-means, we obtain data from the UCI database and choose Iris and Wine as the experimental dataset. Their data volumes are 150 and 178, respectively, the data attributes are 4 and 3, respectively, and the data types are all 3.
To show the differences of the evaluation values E as well as in the clustering process of Iris and Wine, a line graph is drawn as shown in Figure 4.
In Figure 4, while the value of k is larger than 2, the value of E has a small alteration, and the value of is less than 1, which indicates that the algorithm can continue to iterate as well as the combination is reasonable. While the value of k alters from 3 to 2, the value of E suddenly increases, and the value of is larger than 1. This indicates that the merging is unreasonable, so the number of clusters in both datasets is 3. In addition, these experimental results are lined in the real number 3 of the two datasets. This demonstrates the effectiveness of AK-means.

4. Construction of the Intelligent Network Office System Based on Cloud Computing and Machine Learning

The overall system architecture design adopts a service-oriented (SOA) construction idea, that is, using a mature and stable multilayer architecture, adopting an overall plan, and adopting a step-by-step implementation. The overall architecture is shown in Figure 5.

Based on the abovementioned overall architecture, the smart office application system platform is designed according to the following steps: (1) the system adopts original intelligent desktop technology to create a new user experience; (2) the system uses cloud service technology to build smart applications of “desktop as a service”; (3) the system relies on the cloud computing platform to realize application services. Through multiple types of interactive terminals (computers, notebooks, smart phones, and tablet computers), a multiterminal application interconnected and shared smart service is formed.

The system technical architecture is designed according to the SOA design guiding ideology and multilayer architecture. The technical architecture is shown in Figure 6. Using Java mature SSH framework and development technology, it is developed according to the technical requirements of high cohesion, low coupling, pluggability, and componentization to realize the requirements of development of overall applications, high reusability, and flexible expansion and integration.

Private clouds are built for a customer to use alone to provide the most effective control over data, security, and service quality. The unit owns the infrastructure as well as can control the applications deployed on this infrastructure. Private clouds can be deployed in the firewall of university data centers, or they can be deployed in a secure hosting location. In this study, the private cloud is deployed in the university firewall, and the design of basic cloud services must comply with the following principles: (1) It needs to focus on the first phase and consider future scalability. (2) It needs to deploy cloud architecture flexibly to facilitate management and maintenance. (3) It can be observed at any time. (4) Ensure data security. (5) All processes are automated. Figure 7 is a specific planning model for the basic cloud service design.

In the planning model, the logical structure of the corresponding basic cloud service design is shown in Figure 8.

5. Performance Evaluation of Intelligent Network Office System Based on Cloud Computing and Machine Learning

This article combines cloud computing and machine learning technology to build an intelligent networked office system and combines demand analysis to evaluate the performance of the intelligent networked office system. The office system requires a large amount of data fusion work, and it needs to have a strong humanization and convenient network office effect, so this article conducts system performance verification with the support of cloud computing and machine learning technology. First, this article evaluates the data fusion effect of the system, and the results are illustrated in Table 1 and Figure 9.

Compared with the literature [13], the method proposed in this article has more advantages in data fusion. The data fusion in this article is based on the first line of cloud computing, so the speed of data fusion is more advantageous.

From the analysis of Figure 9 and Table 2, we can see that the system constructed in this paper performs well in terms of data fusion and can basically meet the needs of daily smart office. On this basis, this paper conducts a satisfaction survey on the office system constructed in this paper, and the operating effect of the system can be reflected through the satisfaction degree. The statistical results of satisfaction are illustrated in Table 2 and Figure 10.

Compared with traditional paperless office methods and existing intelligent office methods, the method in this article has a certain improvement in office intelligence and office efficiency.

From the above statistical results, we can see that the intelligent office system is highly satisfactory. It can be seen that the effect of the intelligent office system constructed in this article meets the expected requirements.

6. Conclusion

The traditional office automation system focuses on basic office transaction processing functions, and its ability to support process management is weak. Moreover, the part of the workflow is realized manually, so the work intensity is high and the work efficiency is low, and end-to-end closed-loop management cannot be realized. Aiming at the problem of insufficient traditional office management and process management capabilities, this article combines cloud computing and machine learning to construct an intelligent network office system. At the same time, this paper uses the combination of IoT data mining technology and cloud computing framework to realize large-scale IoT data processing and proposes functional and nonfunctional requirements for an office automation management system based on an intelligent workflow. After that, according to the functional and nonfunctional requirements of the office automation management system, this article carries out the overall design of the office automation management system based on an intelligent workflow. Finally, this article combines the actual needs to evaluate the performance of the intelligent network office system. From the actual research situation, the intelligent network office system constructed in this paper has certain effects.

Data Availability

The labeled dataset used to support the findings of this study is available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Acknowledgments

This study was sponsored by Taiyuan University.