Abstract

Automatic reasoning is a very challenging research area in the field of artificial intelligence. Automatic inference based on lattice-valued logic is an important basic research content in the field of intelligent information processing. It is of great academic significance to deal with incomparability, linguistic valued information, and formal verification of the correctness of corresponding software and hardware systems in various applications. Based on the extensive work of α-resolution automated reasoning in linguistic truth-valued lattice-valued logic, this paper discusses the structure and its α-resolvability of the generalized literal. For further improving the efficiency, a topology graph deduction is proposed as well as its soundness, completeness, and universal algorithm. The use of distributed computing and resource sharing is the core idea of this deduction. In the decision process, as long as one satisfiability solution is found by subtask, the decision result can be given directly, shortening the decision time. The algorithm reduces the time complexity from factorial level to square.

1. Introduction

Satisfiability problem (SAT) is the cornerstone of computational complexity theory and a fundamental problem in many areas as the first NPC problem [1]. It plays a prominent role in many domains of computer science and artificial intelligence due to its significant importance in both theory and applications [2]. The SAT problem is fundamental in solving many practical problems in combination optimization [3], statistical physics [4], circuit verification [5], and computing theory [6], and SAT algorithms have been widely used to solve real-world applications, such as computer algebra systems [7], core graphs [8], real-time scheduling [9], gene regulatory networks [10], automated verification [11], model-based diagnosis (MBD) [12], and machine induction [13]. Therefore, it is of great significance to study SAT problem and improve its solving speed.

At present, most mainstream SAT solvers are based on Davis–Putnam–Logemann–Loveland (DPLL) algorithm framework [14]. In the past two decades, many heuristic algorithms [15] and program implementation technologies [16] based on DPLL had been proposed by scholars. The most popular solving framework of conflict-driven clause learning (CDCL) [17] had been formed. And CDCL-based solvers [18] had also developed by leaps and bounds. In the framework of DPLL algorithm, the size of search space is directly determined by the choice of decision arguments. For decades, many heuristics branching decision algorithms have been proposed by scholars. These algorithms manly include Jeroslow–Wang (JW) [19], Bohn [20], maximum occurrences in minimization (MOM) [21], dynamic literal individual sum (DLIS) [22], variable state independent decaying Sum (VSIDS) [15], learning rate elasticity (LRB) [23], and some improved algorithms. The core idea of these algorithms is to add an evaluation operator to each decision text. Then, the maximum or minimum score of the text is selected for the new branch direction each time. After each conflict, a large amount of CPU and memory resources is consumed to recalculate the score of the affected argument in these algorithms.

The Davis–Putnam–Logemann–Loveland (DPLL) algorithm [14] was proposed in 1962, which primarily employed the unit propagation rule, pure-literal rule, and split rule to search the solution space by a depth-first search. However, due to the particularity of the SAT problem, the DPLL algorithm has an exponential time complexity in the worst case. The international SAT competitions organized by research institutions around the world were first held in 1996, and the 22nd competition will be held in Lisbon. The SAT competitions have greatly boosted the solving efficiency of SAT solvers and have also promoted the wider application of SAT problems in practice. At present, most of the state-of-the-art SAT complete algorithms are derived from DPLL algorithm, the most important of which is the conflict-driven clause learning (CDCL) algorithm [24], an extension of the DPLL algorithm, by involved effective search techniques, such as the learnt clause scheme, heuristic decision strategy, restart techniques, and lazy data structures. In 1996, the GRASP solver [24], which is the prototype of the CDCL algorithm, introduced nonsynchronous backtracking and a learnt clause scheme and reduced the searching space to a large extent. In 1997, the head/tail list data structure was employed in the SATO solver [25] and vastly enhanced the efficiency of Boolean constraint propagation (BCP). In 2001, the Chaff solver [15] was integrated into the watched-literal data structure and the low-cost variable state independent decaying sum (VSIDS) decision strategy. In 2002, the BerkMin solver [26] added a learned clause database deletion strategy to avoid memory explosion because of an increasing number of learnt clauses. In 2004, the Zchaff solver [27] improved the implementation of programming based on the Chaff solver, further increasing the solution efficiency. In 2005, the Minisat solver [28], with a strong capability of the search space by optimizing the code structure, and a majority of solvers that entered the SAT competition later were based on Minisat. In 2009, the author of the Glucose solver [29] proposed a new learnt clause reduction method based on the literals blocks distance (LBD), and a dynamic restart strategy, the Glucose solver, got the first prize of the application group in the 2011 SAT competition. The Lingeling solver [30], which was also based on the CDCL algorithm, respectively, won the 2013 and 2014 SAT competition about the application group. In 2016, Liang et al. designed the MapleCOMSPS solver [23, 31], which utilized a new decision branching method—learning rate branching (LRB)—and got the first prize of the main-track group in the 2016 SAT competition and the second prize of the main-track group in 2017, respectively. In 2017, the solver Maple_LCM_Dist [32] got the first prize of the main-track group in the 2017 SAT competition, which used a learnt clause minimization approach, and lots of competing solvers were based on it in the 2018 SAT competition [33].

Now logic is widely used in computer science in logic circuit design, program design analysis, security protocol verification, and artificial intelligence. Logic systems not only provide language tools for knowledge representation, but also provide mechanized algorithms for knowledge reasoning. The research results in this area are the theoretical basis for realizing the automation of calculation and reasoning. In addition, the rapid development of computer science, especially artificial intelligence, has provided a broad background and realistic demand for the theoretical research and practical application of logic. Lattice-valued logic is a very important nonclassical logic, which can describe not only the information of complete order, but also the uncertain information of noncomplete order (i.e., noncomparability).

On the basis of lattice logic deduction, topology deduction algorithm is proposed in this paper. Conflict analysis, backtracking, restart mechanism, and adding evaluation operator for each decision text are no longer used in this algorithm. According to the deduction principle of lattice-valued logic, the single-layer branch topology is generated by each subtask in this algorithm. The results of each subtask are summarized by the main task. And a shared library file is generated by the main task. The topology view is generated by each topology subtask based on the shared library file. As long as the satisfying solution is obtained by any topological subtask, the whole mission will be stopped. This method greatly improves the solving efficiency. It reduces time complexity from factorial level to quadratic level.

2. Methods

2.1. Preliminaries

Rules of machine learning refer to narrowly define rules. It usually refers to semantically clear and can describe the objective laws or domain concepts implied by data distribution. It can be written as “IF…THEN…” logical rules of form. Logical rule learning is to learn a set of rules from the training data that can be used to distinguish unknown data. Compared with the “black box model” such as neural network and support vector machine, rule learning has better interpretability. In addition, mathematical logic has a strong ability of expression and the vast majority of human knowledge can be described and expressed concisely through mathematical logic. The ability to abstract the description of logical rules has significant advantages when dealing with some highly complex AI tasks. In some systems, where there may be an infinite number of possible answers, abstract expression or reasoning based on logical rules is a great convenience.

Each rule in the rule set can be viewed as a submodel. The rule set is an integration of these submodels. When the same examples are overwritten by multiple rules with different discriminant results, a conflict is said to have occurred. The method of conflict resolution is called conflict resolution. Common conflict resolution strategies include voting, sorting, meta-rule, and so on. In terms of the expressive ability of formal language, rules can be divided into propositional rules and first-order rules. The propositional rule is a simple declarative sentence composed of atomic propositions and logical connectives “,” “,” “,” and “.” First-order rules are atomic formulas that describe properties or relationships of things.

The goal of logical rule learning is to produce a rule set that covers as many samples as possible. The most direct way to learn rules is sequential covering. For each rule learned on the training set, the training sample covered by that rule is removed. Then, the remaining training samples were used to form a training set and the process was repeated. Because only one part of the data is processed at a time, it is also known as a divide-and-conquer strategy. The approach based on exhaustive search is not feasible when there are many attributes and candidate values due to combinatorial explosion. Generative testing and data-driven methods are two common strategies for generating rules. The former is much more robust to noise than the latter. So it is often used in propositional rules. The latter are often used in complex tasks such as first-order rule learning in the hypothesis space. Considering only one optimal literal at a time during rule generation is often too greedy and tends to fall into local optimality. To mitigate this problem, some relatively modest measures can be adopted, such as cluster search.

Rule generation is essentially a greedy search process. It requires mechanisms to mitigate the risk of overfitting. The most common practice is pruning, including pre-pruning and post-pruning. Rule set performance for adding or removing literals is usually evaluated based on some kind of performance metric. Then to determine if pruning is needed, pruning can be carried out by means of statistical significance tests. For example, when the Clark and Niblett (CN2) algorithm preprunes, it is assumed that the prediction based on the rule set must be significantly better than the prediction based on the posterior probability distribution of the training sample set. For ease of calculation, Likelihood Ratio Statistics (LRS) is used for CN2. Let and represent the number of positive and negative examples in training sample set, respectively. and , respectively, represent the number of positive and negative cases covered by the rule set. Then,

In practical tasks with a large amount of data, CN2 algorithm usually stops only when LRS is very large. The most common post-pruning strategy is error-reducing pruning. The basic method is to divide the sample set into training set and verification set. Multiple rounds of pruning are performed after learning rule set R from the training set. Pruning operations include deleting a literal, deleting a literal at the end of a rule, deleting multiple literals at the end of a rule, and deleting the whole rule. It is often better to combine pruning with some other postprocessing to optimize a rule set. Take the famous rule learning algorithm RIPPER for example. Its generalization performance exceeds that of many decision tree algorithms. And it learns faster than most decision tree algorithms. The trick is to combine pruning with postprocessing optimization.

Definition 1 (see [34]). Let be a bounded lattice with an order-reversing involution “,” ⟶  be a mapping. is called an LIA if the following conditions hold for any , , , implies , , , .

Definition 2 (see [34]). (Łukasiewicz implication algebra on a finite chain ). Let be a finite chain, , and . Define for any , , , , ; then is an LIA.

Definition 3 (see [35]). Let , , , , and be two Łukasiewicz implication algebras. The Hasse diagram of is depicted in Figure 1, and for any , define , , Then, is an LIA, denoted by .

Definition 4 (see [35]). Let be a set with n linguistic modifiers and , be a set of meta truth values, and denote . Define a mapping as : , Then, is bijection, and its inverse mapping is denoted as . For any , define x, , , ; then, is called a linguistic truth values, and is an isomorphic mapping from to .

Definition 5 (see [22]). Let be a set of propositional variables and be a type with and for every . The propositional algebra of the lattice-valued propositional calculus on the set of propositional variables is the free algebra on and is denoted by .

Remark 1. Particularly, when the field with valuation of is an , this specific , i.e., , is a linguistic truth-valued lattice-valued propositional logic system. Similarly, the truth-valued domain of is a Łukasiewicz implication algebra .

Definition 6 (see [22]). A valuation of is a propositional algebra homomorphism .
Implication is a basic logical connective in all kinds of logical systems, which plays an important role. Implication operators have been discussed in many articles. And they have been taken as the research object in a large number of researches on approximate reasoning in recent years. However, most of the researches focus on semantics, but few on construction. Therefore, the concept of fuzzy implication algebra was introduced by Wu [36]. It was an algebraic abstraction of implication connectives in [0,1]-valued logic. In order to describe incomparability and axiomatize implication operators, the concept of lattice implication algebra was proposed by Y. Xu in 1993 by combining lattice and implication algebra. The properties of lattice implication algebras were then studied by him and his students. Taking lattice implication algebra as the true range, several lattice-valued logic systems were established. They mainly included lattice-valued propositional logic system and corresponding lattice-valued first-order logic system , dynamic hierarchical L-type lattice-valued propositional logic system , and corresponding L-type lattice-valued first-order logic system . The lattice-valued propositional logic system is obtained by improving in order to discuss the problem. On this basis, Xu et al. [37] discussed the problem of LP(X) and LF(X) resolution, proposed the concept of α-resolution principle, and proved the reliability and completeness of α-resolution principle. The α-resolution principle and automatic inference were further studied by Liu et al.
Resolution principle [38] is one of the most important methods to judge the unsatisfiability of logical formulas. Because of its reliability and completeness, it is widely used in modern automatic theorem proving systems. In order to improve reasoning efficiency, many scholars reduce the clauses involved in resolution from the perspective of restricting clauses and literals and at the same time ensure its completeness, such as ordered linear resolution, semantic resolution, locking resolution, and other logical deduction methods.

2.2. Algorithm

Suppose an undirected graph , where is a nonempty finite set, called a vertex set, whose elements are called vertices or nodes. is a finite subset of the unordered product , called an edge set. Its elements are called undirected edges, or edges for short.

Suppose a clause set where to are corresponding to each node in undirected graph to . If there are resolution literals between two clause sets, add an undirected edge to the nodes represented by these two clauses, and then add the first resolution literal to the edge.

In graph theory, to find satisfying solutions by logical deduction is to find whether there are Hamiltonian paths in the undirected graph. Before finding the Hamiltonian path, it is unknown whether there are edges between nodes. Choosing the starting node is the key problem of many algorithms. Some algorithms choose a short clause as the start node. Some algorithms choose a clause with fewer branches as the starting node. Some algorithms calculate the correlation or weight of literals or clauses according to the strategy and select the clause with high correlation or heavy weight as the starting node. Some algorithms try to extend local satisfiability solutions to global satisfiability solutions. Branching decision has been adopted by most of these algorithms and require frequent backtracking and restarts mechanisms.

Utilizing distributed computing is the core idea of this paper. It is very suitable for us to find satisfying solutions. The algorithm is described as follows.

Step 0 (Initiation). Given a set of clauses where n is the length of the clause set, there is a conjunctive relationship between clause sets.

Step 1. To start n subtasks and determine the starting clause for each subtask.

Step 2. In each subtask, find a match for the literal in the starting clause according to the resolution deduction. And generate a one-layer branch topology according to the matching results.

Step 3. To start a general task, which collects the topology result of each subtask and forms a shared library file. Then end n subtasks.

Step 4. To start n topology subtasks. Then generate multilayer topology according to the shared library file. To judge whether there is a Hamiltonian path in the topology.

Step 5. If any topology subtask finds a Hamiltonian path, provide feedback to the general task. The general task notifies all subtasks to terminate the task. It can be determined that there is a satisfiability solution and the clause set is satisfiable.

Step 6. If all topology subtasks end, and there is no Hamiltonian path, then it can be determined that there is no satisfiability solution and the clause set is unsatisfiable.

2.3. Examples and Results

Given a set of clauses there is a conjunctive relationship between clause sets. Literals in these clauses read as follows:

Given a set of tasks to start seven tasks from to , take subtask as an example. Clause is chosen as the starting one. According to the principle of resolution, to find match literals of and , respectively, and are identified as resolution clauses. Then a one-layer branch topology of is generated (see Figure 2). Literals on the edge represent first ones involved.

The branch topology of other clauses can be obtained by the same method (see Figure 3).

The general task collects the topology result of each subtask and forms a shared library file. Subtasks from to can be ended. Given a set of topology tasks , to start seven topology tasks from to , take topology subtask as an example. is chosen as the starting node. According to the shared library file, the multilayer topology of is generated. The multilayer topology of other subtasks can be obtained by the same method (see Figure 4).

In the judgment process, if there is a satisfying solution in the result, the multilayer topology should be pruned further. According to the principle of resolution, literals on the upper and lower edges of the middle layer node cannot be matched. Remove branches that do not meet the criteria. The pruned multilayer topology is shown (see Figure 5).

According to the pruned topology, we can judge whether there is a satisfiability solution. If there is no satisfiability solution, the local maximum length satisfiability solution can be obtained. Sets of and are the local maximum length satisfiability solution in this example. Sets of and are the local maximum length satisfiability literals in this example.

3. Conclusion and Discussion

Classical logic is the basis of reasoning mechanism in artificial intelligence, fuzzy control, and other fields. It adopts formal methods to describe uncertain reasoning problems. At present, nonclassical logic theory is in the stage of rapid development, and there are many problems to be solved, which limit the further successful application of nonclassical logic theory in control engineering. Based on the latest research results in this field, some problems of nonclassical logic are studied, specifically, lattice-implication algebra, lattice-valued logic, and lattice-valued model theory. In this paper, a topology graph algorithm based on the principle of lattice-valued logic is presented. It provides a basis of solving the satisfiability solutions to some clause sets. The further research will be concentrated on discussing how to combine it with other traditional resolution methods and algorithms, and analyzing the complexity of related algorithms. Formal verification is a direct application of automated reasoning in software engineering; how to apply this method to reason and validate the validity of programs which are represented by logical formulation is worthy of studying. Transportation control systems are safety critical systems. The formal verification of software and hardware in intelligent transportation system and the related high confidence software in some specific areas are also deserving research topics. Hence how to translate the related procedures into formulas in some specific logic systems and further to validate their correctness due to our existing resolution method is an important direction.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no potential conflicts of interest in this study.

Authors’ Contributions

All the authors have seen and approved the manuscript to be published.

Acknowledgments

This paper was supported by the National Natural Science Foundation of China under Grant no. 62172135.